frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Autocrit – an agent loop that builds and tests web prototypes

https://github.com/adiun/pi-autocrit
2•4di•2h ago
Hi HN, this is my first Show HN submission.

I've been thinking about how product development will change with AI. The earliest stages of product development have so much ambiguity. Because code was costly and expensive, we spent a lot of time writing specs and doing user research.

I thought I'd try an experiment after (a) seeing advancements around evaluation systems especially for UX (b) realizing that AI can create a reasonably good enrichment of a persona/end-user (c) seeing karpathy's autoresearch project.

Autocrit is a pi extension. Start the pi harness and run the autocrit skill. It will ask you for a high-level app idea, and a definition of the target user. Autocrit will create a persona definition, has that persona create evaluation tasks, and then starts a loop where a coding agent builds a prototype, and a persona agent will try to use it in a real browser. They will judge it based on the tasks, giving scores and verbatim feedback. The coding agent creates a plan to fix things, keeps improvements, reverts bad ideas, etc. The loop runs overnight.

The goal is to get a better understanding of where to take the product at an early stage e.g. paper prototypes, before actually starting to build the product. The evaluation loop of prototyping and getting feedback is automated here, but humans provide the definition of the persona, app idea / product goals, and hypotheses that need validation.

Comments

jsonfitzface•38m ago
The ambiguity doesn't go away with specs, it just gets deferred to when the code is written and you discover nobody wanted the thing you specified. If AI can shrink the loop between 'I think this is a problem' and 'someone who has this problem told me it isn't' that's genuinely useful, that's the part that kills most early products not the building itself.
4di•35m ago
Well said.. yes, it’s more about validating hypotheses about what you’re trying to build.

My own hypothesis is that we could create a rough representation of your end user and create an agent loop around that to try to validate. It won’t work for all stages of product development where you need humans to give you feedback but likely useful in the beginning stages

Ask HN: LLM-Based Spam Filter

1•michidk•7m ago•0 comments

Show HN: Built a model-agnostic, desktop-native, research studio for local files

https://old.reddit.com/r/LLMDevs/comments/1sbusn8/new_pdfviewer_notes_panel_search_downloader_tool/
1•ieuanking•8m ago•0 comments

Josefina Aguilar, maestra artesana del barro, murió a los 80 añOS

https://www.nytimes.com/es/2026/04/02/espanol/cultura/josefina-aguilar-artesana.html
1•paulpauper•13m ago•0 comments

The CA Minimum Wage Increase: Summing Up

https://marginalrevolution.com/marginalrevolution/2026/04/the-ca-minimum-wage-increase-summing-up...
2•paulpauper•13m ago•0 comments

What if everything still ran on vacuum tubes? [video]

https://www.youtube.com/watch?v=mEpnRM97ACQ
1•marklit•14m ago•0 comments

Smartphones, Online Music Streaming, and Traffic Fatalities

https://www.nber.org/papers/w34866
1•naves•16m ago•0 comments

Claude Code skill to preserve traditional Unix style conventions

https://github.com/agiacalone/unix-conventions
2•agiacalone•16m ago•1 comments

How Close Is Too Close? Applying Fluid Dynamics Research Methods to PC Cooling

https://www.lttlabs.com/articles/2026/04/04/how-close-is-too-close-applying-fundamental-fluid-dyn...
1•LabsLucas•17m ago•1 comments

DIY Air Drums

https://www.instructables.com/SpaceDrums-Play-Drums-in-the-Air/
1•nlarion•20m ago•0 comments

Marc Andreessen on why "this time is different" in AI

https://www.latent.space/p/pmarca
3•theorchid•21m ago•0 comments

Microsoft Hasn't Had a Coherent GUI Strategy Since Petzold

https://www.jsnover.com/blog/2026/03/13/microsoft-hasnt-had-a-coherent-gui-strategy-since-petzold/
5•naves•22m ago•0 comments

The $1B perfect bracket challenge likely cost less than a dollar

https://joshpearlson.com/articles/posts/impossible-bracket/impossible-bracket.html
4•jcpearlson•25m ago•0 comments

Satellite mirror plans could disrupt sleep and ecosystems worldwide

https://www.theguardian.com/science/2026/apr/05/satellite-mirror-plans-could-disrupt-sleep-and-ec...
3•mitchbob•25m ago•0 comments

Outdoor Recreation Data Portal

https://data.hereandthere.club
2•toomuchtodo•26m ago•0 comments

Reaffirming our commitment to child safety in the face of EuropeanUnion inaction

https://blog.google/company-news/inside-google/around-the-globe/google-europe/reaffirming-commitm...
5•upofadown•29m ago•1 comments

Sora: A Solution Without a Problem

https://kaptur.co/sora-a-solution-without-a-problem/
2•herbertl•29m ago•0 comments

In 2026, We Are Friction-Maxxing

https://www.thecut.com/article/brooding-friction-maxxing-new-years-2026-resolution.html
2•wjb3•31m ago•0 comments

Building an AI Image Creator Skill for Claude Code

https://ai.georgeliu.com/p/building-an-ai-image-creator-skill
2•vbtechguy•32m ago•1 comments

People Are Not Friction

https://daverupert.com/2026/03/people-are-not-friction/
4•herbertl•34m ago•0 comments

Database triggers to clean text inputs

https://sive.rs/clean1
2•theorchid•34m ago•0 comments

Running Google Gemma 4 Locally with LM Studio's New Headless CLI and Claude Code

https://ai.georgeliu.com/p/running-google-gemma-4-locally-with
2•vbtechguy•36m ago•1 comments

Show HN: Media Den – Photo/video app with client-side encryption and your cloud

https://apps.apple.com/ca/app/media-den/id6761245161
1•ryanisnan•36m ago•0 comments

The Unsettling Vision of Rei Kawakubo (2005)

https://www.newyorker.com/magazine/2005/07/04/the-misfit
1•v9v•37m ago•0 comments

The Therac-25 software radiation disaster

https://en.wikipedia.org/wiki/Therac-25
2•bithavoc•38m ago•0 comments

Show HN: JustWorkflowIt, workflow orchestration platform plus code marketplace

https://justworkflowit.com/
1•nkorai•40m ago•0 comments

Uber engineer manager alleges firing after chemo leave and harassment report

https://www.teamblind.com/post/uber-female-engineering-manager-fired-following-chemo-treatment-af...
2•nickvec•40m ago•0 comments

From birds to brains: My path to the fusiform face area (2024)

https://www.kavliprize.org/nancy-kanwisher-autobiography
7•everbody•41m ago•0 comments

Show HN: VibeAround – Talk to Claude Code from Telegram and Hand over Sessions

https://github.com/jazzenchen/VibeAround
1•jazzen•41m ago•0 comments

Is consciousness the brain's consistency model? [pdf]

https://users.cs.utah.edu/~vijay/papers/waci26.pdf
2•maralom•43m ago•0 comments

Errand, a task runner with a syntax inspired by Terraform/OpenTofu

https://github.com/nuvrel/errand
1•calmondev•46m ago•2 comments