frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

A Brief History of Ralph

https://www.humanlayer.dev/blog/brief-history-of-ralph
27•dhorthy•2h ago

Comments

f311a•1h ago
Just look at the code quality produced by these loops. That's all you need to know about it.

It's complete garbage, and since it runs in a loop, the amount of garbage multiplies over time.

dhorthy•1h ago
I don’t think anyone serious would recommend it for serious production systems. I respect the Ralph technique as a fascinating learning exercise in understanding llm context windows and how to squeeze more performance (read: quality) from today’s models

Even if in the absolute the ceiling remains low, it’s interesting the degree to which good context engineering raises it

ossa-ma•47m ago
How is it a “fascinating learning exercise” when the intention is to run the model in a closed loop with zero transparency. Running a black box in a black box to learn? What signals are you even listening to to determine whether your context engineering is good or whether the quality has improved aside from a brief glimpse at the final product. So essentially every time I want to test a prompt I waste $100 on Claude and have it an entire project for me?

I’m all for AI and it’s evident that the future of AI is more transparency (MLOPs, tracing, mech interp, AI safety) not less.

alansaber•27m ago
Current transparency is rubbish but people will continue to put up with it if they're getting decent output quality
skerit•1h ago
I do not understand where this Ralph hype is coming from. Back when Claude 4.0 came out and it began to become actually useful, I already tried something like this. Every time it was a complete and utter failure.

And this dream of "having Claude implement an entire project from start to finish without intervention" came crashing down with this realization: Coding assistants 100% need human guidance.

Veen•51m ago
You probably wouldn't use it for anything serious, but I've Ralphed a couple of personal tools: Mac menu bar apps mostly. It works reasonably well so long as you do the prep upfront and prepare a decent spec and plan. No idea of the code quality because I wouldn't know good swift code from a hole in the head, but the apps work and scratch the itch that motivated them.
ossa-ma•1h ago
So it took the author 6 months and several 1-to-1s with the creator to get value from this. As in he literally spent more time promoting it than he did using it.

And it all ends with the grift of all grifts: promoting a crypto token in a nonchalant 'hey whats this??!!??' way...

dhorthy•1h ago
the note about the crypto token was intended to “okay this is now hype slop and it’s time to move on”
skybrian•1h ago
There's a lot of irrelevant detail, but the article never actually explains what "Ralph" does or how it works.
wild_egg•1h ago
It's explained under the July 2025 heading with link to the blog post where it was first shared.

The key bit is right under that though. Ralph is literally just this:

    while :; do cat PROMPT.md | npx --yes @sourcegraph/amp ; done
skybrian•1h ago
Thanks!
msla•42m ago
Surely that would be better written as

    cat PROMPT.md | cat | npx --yes @sourcegraph/amp
dhorthy•1h ago
there are hundreds of useful resources, including many linked in the article itself
Juvination•1h ago
I've been working with the Ralphosophy? for iterative behavior in my workflow and it seems pretty promising for cutting out a few manual steps.

I still have a manual part which is breaking the design document down into multiple small gh issues after a review but I think that is fine for now.

Using codex exec, we start working on a github issue with a supplied design document, creating a PR on completion. Then we perform a review using a review skill madeup which is effectively just a "cite your sources" skill on the review along with Open Questions.

Then we iterate through open questions doing a minimum of 3 reviews (somewhat arbitrary but sometimes multiple reviews catch things). Then finally I have I have a step in for checking Sonarcloud, fixing them and pushing the changes. Realistically this step should be broken out into multiple iterations to avoid large context rot.

What I miss the most is output, seeing whats going on in either Codex or Claude in real time. I can output the last response but it just gets messy until I make something a bit more formal.

articulatepang•1h ago
This is so poorly written. What is "Ralph"? What is its purpose? How does it work? A single sentence at the top would help. The writer imagines that the reader cares enough to have followed their entire journey, or to decode this enormously distended pile of words.

More generally, I've noticed that people who spend a lot of time interacting with LLMs sometimes develop a distinct brain-fried tone when they write or talk.

alansaber•29m ago
"develop a distinct brain-fried tone when they write or talk" - I find that using an LLM as a writing copilot seriously degrades the flow of short form content
dang•26m ago
Please don't post shallow dismissals of other people's work (this is in the site guidelines: https://news.ycombinator.com/newsguidelines.html) and especially please don't cross into personal attack.
jes5199•21m ago
I forked the anthropic Ralph Wiggum plugin: https://github.com/jes5199/chief-wiggum

there’s some debate about whether this is in the spirit of the _original_ Ralph, because it keeps too much context history around. But in practice Claude Code compactions are so low-quality that it’s basically the same as clearing the history every few turns

I’ve had good luck giving it goals like “keep working until the integration test passes on GitHub CI” - that was my longest run, actually, it ran unattended for 24 hours before solving the bug

The Coming Industrialisation of Exploit Generation with LLMs

https://sean.heelan.io/2026/01/18/on-the-coming-industrialisation-of-exploit-generation-with-llms/
1•janpio•1m ago•0 comments

How to Master State Management in Modern React

https://jsdev.space/react-state-management/
1•javatuts•3m ago•0 comments

Greenland Crisis

https://en.wikipedia.org/wiki/Greenland_crisis
2•belter•3m ago•0 comments

The Irreversible Action Boundary in Autonomous Systems

https://github.com/indyh91/execution-governance
1•Shaehenderson•4m ago•1 comments

Could Europe leverage its $12.6T pile of US assets?

https://www.ft.com/content/beeaf869-ca12-4178-95a1-bfb69ee27ae4
2•mraniki•5m ago•1 comments

Understanding C++ Ownership System

https://blog.aiono.dev/posts/understanding-c++-ownership-system.html
1•todsacerdoti•6m ago•0 comments

UltraThink Is Dead. Long Live Extended Thinking

https://decodeclaude.com/ultrathink-deprecated/
1•handfuloflight•7m ago•0 comments

Upregulation of reward mesolimbic activity and immune response to vaccination

https://www.nature.com/articles/s41591-025-04140-5
1•plmpsu•9m ago•0 comments

Bitesize – Turn agentic conversations into free, encrypted, ephemeral websites

https://bitesize.online
1•chasenorton•10m ago•0 comments

Ensure correctness by changing when the first failure occurs

https://doliver.org/articles/ensure-correctness-by-changing-when-the-first-failure-occurs
2•d0liver•11m ago•0 comments

Show HN: I built a firewall for agents because prompt engineering isn't security

https://github.com/cordum-io/cordum
4•yaront111•13m ago•1 comments

Elon Musk accused of making up math to squeeze $134B from OpenAI, Microsoft

https://arstechnica.com/tech-policy/2026/01/elon-musk-accused-of-making-up-math-to-squeeze-134b-f...
5•SilverElfin•13m ago•0 comments

Show HN: PaperBot FM – Turn research papers into 3-host audio podcasts

https://www.trypaperbot.com/
2•mohasarc•13m ago•0 comments

WebGPU Water

https://jeantimex.github.io/webgpu-water/
5•astlouis44•14m ago•0 comments

Weight Transfer for RL Post-Training in under 2 seconds

https://research.perplexity.ai/articles/weight-transfer-for-rl-post-training-in-under-2-seconds
1•jxmorris12•14m ago•0 comments

Firehound: Explore the Insecure App Store

https://firehound.covertlabs.io/
3•kevin061•15m ago•0 comments

Revisiting Brat Summer

https://thelastwave.substack.com/p/revisiting-brat-summer
1•johanam•16m ago•0 comments

Show HN: Build Knowledge Graphs with AI

https://edge.dog/docs
1•castalian•16m ago•0 comments

Turn user friction into increased retention/lower customer churn

https://www.usercompass.tech/
1•VladCovaci•16m ago•0 comments

The quiet way AI normalizes foreign influence

https://cyberscoop.com/the-quiet-way-ai-normalizes-foreign-influence/
2•anigbrowl•17m ago•0 comments

Fix macOS 26 (Tahoe) exaggerated rounded corners

https://github.com/makalin/CornerFix
1•guessmyname•19m ago•0 comments

Show HN: Podcast App Detects Ads on iPhone

https://earsay.fm/
1•earsayapp•19m ago•0 comments

The Frogs Who Desired a King

https://aesopsfables.wordpress.com/the-frogs-who-desired-a-king/
1•jruohonen•21m ago•0 comments

AI Boosts Research Careers, but Flattens Scientific Discovery

https://spectrum.ieee.org/ai-science-research-flattens-discovery
2•Loquebantur•21m ago•1 comments

The Dandy' Review: The Threads of Modernity

https://www.wsj.com/arts-culture/books/the-dandy-review-the-threads-of-modernity-34cb2d0e
1•Caiero•23m ago•0 comments

Americans Are the Ones Paying for Tariffs, Study Finds

https://www.wsj.com/economy/trade/americans-are-the-ones-paying-for-tariffs-study-finds-e254ed2e
27•throw0101d•24m ago•9 comments

What's been your experience with Scrum Master?

1•ghostinit•25m ago•1 comments

Show HN: Opengenepool, MolBio IDE Plugin

https://opengenepool.vidalalabs.com
1•dnautics•27m ago•0 comments

Fast Static Symbol Table (FSST): efficient random-access string compression

https://github.com/cwida/fsst
1•tosh•27m ago•0 comments

Styleframe: Typesafe CSS

https://github.com/styleframe-dev/styleframe
1•handfuloflight•27m ago•0 comments