frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: RLM-based local debugger for AI agent traces

https://github.com/context-labs/halo
10•mikepollard_dev•7h ago
We built HALO (Hierarchal Agent Loop Optimizer), an open-source tool for debugging and optimizing AI agents using their execution traces.

It’s a loop. Run your agent, feed the traces to HALO, get the report, apply the fixes, then re-run your agent.

HALO takes in OTEL compliant traces from AI agents using tracing frameworks such as Langfuse, Arize/OpenInference, or even just plain JSONL. It uses an RLM (Recursive Language Model) to more efficiently break trace analysis into smaller subproblems in order to find recurring patterns across large amounts of data and fix systemic issues that regular LLMs might typically miss.

You can also optionally provide a path to where your agent code lives to give the engine more context so it can more concretely provide useful insights.

The repo also includes a desktop app that you can run locally without having to sign up for anything or configure anything complex.

Check out the readme in the repo for more in depth information on what HALO is and how you can use it to your benefit :)

Comments

funfunfunction•1h ago
Cool project. A team at work was building something similar to internal use.

I'm curious how this compares to just using Claude Code directly and giving it a dump of the agent traces? It seems like Claude could probably do some of the same diagnostics / trace grouping to identify failure patterns. Why use a custom harness?

mikepollard_dev•1h ago
Yeah, fair question. For a small number of traces just dumping them into Claude Code can work well.

However, once you're at production scale the problem changes. You can't always fit 10,000+ traces in Claude Code and still have it be effective especially when the relevant pattern of agent failures may only become apparent when you pass that many in. That's where the RLM based methodology helps. HALO recursively decomposes the trace data into smaller investigations, analyzes those sub-pieces, and then synthesizes those up to determine the recurring harness-level failure modes better than Claude Code or Codex ever could at a large scale.

Vulnerability reports are not special anymore

https://words.filippo.io/vuln-reports/
54•goranmoomin•1h ago•9 comments

Jerry's Map

http://www.jerrysmap.com/the-map
331•turtleyacht•7h ago•48 comments

A man was gifted his dream car by Kevin Mitnick, who he helped put in prison

https://www.thedrive.com/news/this-man-was-gifted-his-dream-car-by-the-notorious-hacker-he-put-in...
64•mauvehaus•1d ago•24 comments

FUTO Swipe – A new swipe typing model

https://swipe.futo.tech/
271•futohq•7h ago•87 comments

Printing Gaussian Splats

https://www.patreon.com/DanyBittel/posts/printing-splats-161333338
169•ilnmtlbnm•2d ago•16 comments

Swift Package Index joins Apple

https://swiftpackageindex.com/blog/swift-package-index-joins-apple
170•JDevlieghere•7h ago•52 comments

Usbliter8: an A12/A13 SecureROM Exploit

https://ps.tc/pages/blog-usbliter8.html
65•givinguflac•5d ago•15 comments

In memory of the man who put red and green squiggles under words

https://devblogs.microsoft.com/oldnewthing/20260622-00/?p=112451
146•saikatsg•7h ago•14 comments

Extreme Heat conference cancelled due to extreme heat warning

https://www.lse.ac.uk/granthaminstitute/events/extreme-heat-improving-governance-and-strengthenin...
123•rendx•2h ago•43 comments

Show HN: TikZ Editor – WYSIWYG editor for figures in LaTeX

https://tikz.dev/editor/
327•DominikPeters•11h ago•61 comments

The worthlessness of Vitamin D is mildly exaggerated

https://dynomight.net/vitamin-d/
204•surprisetalk•9h ago•149 comments

The Coming Loop

https://lucumr.pocoo.org/2026/6/23/the-coming-loop/
320•ingve•14h ago•226 comments

Inventing the Future, One Lisp Machine at a Time

https://www.patrickdomanico.com/bpm/2026/06/16/inventing-the-future-one-lisp-machine-at-a-time/
70•pamoroso•1d ago•4 comments

I can haz smoller NixOS ISOs?

https://natkr.com/2026-06-19-nixos-but-smol/
19•logickkk1•4d ago•6 comments

Show HN: Y – A malleable coding-agent desktop app built with Electron

https://github.com/y-times-y/y
10•HetPatel106•1h ago•9 comments

QSOE: QNX-inspired OS with dual-kernel architecture

https://qsoe-dev.blogspot.com/2026/06/qsoe-project-v01-is-released.html
26•ymz5•1d ago•8 comments

Rhombus Language 1.0

https://blog.racket-lang.org/2026/06/rhombus-v1.0.html
68•Decabytes•1d ago•8 comments

Unlimited OCR: One-shot long-horizon parsing

https://github.com/baidu/Unlimited-OCR
438•ingve•14h ago•101 comments

F* file system – file search that reads SSD directly bypassing OS kernel

https://github.com/dmtrKovalenko/ffs
37•neogoose•2d ago•31 comments

Wolves are reconquering Europe. Can people learn to live with them?

https://www.science.org/content/article/wolves-are-reconquering-europe-can-people-learn-live-them
39•stared•1d ago•35 comments

Meta Pauses Employee-Tracking Program Following Internal Data Leak

https://www.wired.com/story/meta-pauses-employee-tracking-program-following-internal-security-bre...
26•1vuio0pswjnm7•1h ago•1 comments

Millimeter wave technology drills 100 meters into granite

https://www.thinkgeoenergy.com/quaise-energy-achieves-100-meters-of-drilling-using-millimeter-wav...
92•Jimmc414•3d ago•22 comments

Five monitors on a Commodore 128 [video]

https://www.youtube.com/watch?v=ul5hC3PY1Yg
106•EvanAnderson•1d ago•20 comments

Trains halted across Germany because of communication system problem

https://apnews.com/article/germany-trains-halted-communications-radio-problem-deutsche-bahn-e8fd9...
142•sva_•4h ago•140 comments

Don't verify email addresses by sending spam to them

https://milek7.pl/mailverifyspam/
141•garaetjjte•5h ago•43 comments

Dirty Little Zine – a tool for making an 8 page printable Zine

https://dirtylittlezine.com/
58•cianmm•3d ago•3 comments

The Low-Tech AI of Elden Ring

https://nega.tv/posts/low-tech-ai-of-elden-ring.html
106•g0xA52A2A•14h ago•55 comments

Samsung demonstrates 3D stacked FETs with triple nanosheet channels at 42nm

https://semiconductor.samsung.com/news-events/tech-blog/from-gaa-to-3d-stacked-fet-expanding-the-...
96•its_ajseven•4d ago•29 comments

Show HN: FastUbu – An Ultrafast Video Archive

https://fastubu.com/
10•lukeigel•1d ago•0 comments

Fired by Google for creating the Google workspace CLI

https://twitter.com/JPoehnelt/status/2069482265953087602
284•justinwp•7h ago•185 comments