frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: 83 browser-use trajectories, visualized

https://trails-red.vercel.app/viewer
7•wayy•8h ago
Hey all, Justin here. I previously built Phind, the AI search engine for developers.

One of the biggest problems we had there was figuring out what went wrong with bad searches. We had tons of searches per day, but less than 1% of users gave any explicit feedback. So we were either manually digging through searches or making general system improvements and hoping they helped.

This problem gets harder with agents. Traces are longer and more complex. It takes more effort to review them, so I'm building a tool that lets you analyze LLM outputs directly to help developers of LLM apps and agents understand where things are breaking and why.

I've put together a demo using browser-use agent traces (gpt-5): https://trails-red.vercel.app/viewer

It's early, but I have lots of ideas - live querying of past failures for currently-running agents, preference models to expand sparse signal data.

Would love feedback on the demo. Also if you're building agents and have 10k+ traces per day that you're not looking at but would like to, I'd love to talk.

Comments

Johnny_Bonk•5h ago
This is a cool project, I've also been trying to find some sort of leaderboard or benchmark to compare. I personally really like the Claude in chrome agent but unfortunately I don't think I can build it into projects yet

Show HN: Whosthere: A LAN discovery tool with a modern TUI, written in Go

https://github.com/ramonvermeulen/whosthere
221•rvermeulen98•16h ago•74 comments

Show HN: Text-to-video model from scratch (2 brothers, 2 years, 2B params)

https://huggingface.co/collections/Linum-AI/linum-v2-2b-text-to-video
131•schopra909•1d ago•23 comments

Show HN: Zsweep – Play Minesweeper using only Vim motions

https://zsweep.com
70•oug-t•5d ago•28 comments

Show HN: New 3D Mapping website - Create heli orbits and "playable" map tours.

https://www.easy3dmaps.com/gallery
28•dobodob•11h ago•15 comments

Show HN: Flux, A Python-like language in Rust to solve ML orchestration overhead

https://github.com/cmc-labo/flux
3•hpscript•2h ago•2 comments

Show HN: isometric.nyc – giant isometric pixel art map of NYC

https://cannoneyed.com/isometric-nyc/
1268•cannoneyed•1d ago•229 comments

Show HN: S2-lite, an open source Stream Store

https://github.com/s2-streamstore/s2
73•shikhar•2d ago•18 comments

Show HN: BrowserOS – "Claude Cowork" in the browser

https://github.com/browseros-ai/BrowserOS
82•felarof•1d ago•34 comments

Show HN: Dwm.tmux – a dwm-inspired window manager for tmux

https://github.com/saysjonathan/dwm.tmux
2•saysjonathan•4h ago•0 comments

Show HN: Teemux – Zero-config log multiplexer with built-in MCP server

https://teemux.com/
10•gajus•12h ago•6 comments

Show HN: I've been using AI to analyze every supplement on the market

https://pillser.com/
84•lilouartz•1d ago•44 comments

Show HN: Obsidian Workflows with Gemini: Inbox Processing and Task Review

https://gist.github.com/juanpabloaj/59bc13fbed8a0f8e87791a3fb0360c19
11•juanpabloaj•10h ago•1 comments

Show HN: Interactive physics simulations I built while teaching my daughter

https://www.projectlumen.app/
84•anticlickwise•4d ago•21 comments

Show HN: 83 browser-use trajectories, visualized

https://trails-red.vercel.app/viewer
7•wayy•8h ago•1 comments

Show HN: We added a CLI for receiving webhooks locally (no ngrok required)

https://hookverify.com
2•phntmdz•6h ago•1 comments

Show HN: AdaL Web, a local “Claude co-work” [video]

https://www.youtube.com/watch?v=smfVGCI08Yk
5•meame2010•4h ago•8 comments

Show HN: Txt2plotter – True centerline vectors from Flux.2 for pen plotters

https://github.com/malvarezcastillo/txt2plotter
33•tsanummy•4d ago•7 comments

Show HN: Sweep, Open-weights 1.5B model for next-edit autocomplete

https://huggingface.co/sweepai/sweep-next-edit-1.5B
524•williamzeng0•2d ago•149 comments

Show HN: Synesthesia, make noise music with a colorpicker

https://visualnoise.ca
36•tevans3•1d ago•13 comments

Show HN: A social network populated only by AI models

https://aifeed.social
10•capela•16h ago•8 comments

Show HN: First Claude Code client for Ollama local models

https://github.com/21st-dev/1code
44•SerafimKorablev•1d ago•22 comments

Show HN: Rails UI

https://railsui.com/
204•justalever•2d ago•109 comments

Show HN: Mastra 1.0, open-source JavaScript agent framework from the Gatsby devs

https://github.com/mastra-ai/mastra
213•calcsam•3d ago•69 comments

Show HN: ChartGPU – WebGPU-powered charting library (1M points at 60fps)

https://github.com/ChartGPU/ChartGPU
662•huntergemmer•2d ago•211 comments

Show HN: Bible translated using LLMs from source Greek and Hebrew

https://biblexica.com
50•epsteingpt•1d ago•64 comments

Show HN: CLI for working with Apple Core ML models

https://github.com/schappim/coreml-cli
46•schappim•1d ago•5 comments

Show HN: yolo-cage – AI coding agents that can't exfiltrate secrets

https://github.com/borenstein/yolo-cage
59•borenstein•2d ago•74 comments

Show HN: Startups.in: An in-development "global" startup intelligence database

https://startups.in
4•Startups_in•10h ago•3 comments

Show HN: AskUCP – UCP protocol explorer showing all products on Shopify

https://askucp.com/
10•possiblelion•4d ago•5 comments

Show HN: Claude Tutor – an open source engineering tutor

https://twitter.com/michaelraspuzzi/status/2014756546195148988
3•mraspuzzi•10h ago•1 comments