frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Browser Harness – simplest way to give AI control of real browser

https://github.com/browser-use/browser-harness
5•gregpr07•2h ago
Hey HN,

We got tired of browser frameworks restricting the LLM, so we removed the framework and gave the LLM maximum freedom to do whatever it's trained on. We gave the harness the ability to self correct and add new tools if the LLM wants (is pre-trained on) that.

Our Browser Use library is tens of thousands of lines of deterministic heuristics wrapping Chrome (CDP websocket). Element extractors, click helpers, target managemenet (SUPER painful), watchdogs (crash handling, file downloads, alerts), cross origin iframes (if you want to click on an element you have to switch the target first, very anoying), etc.

Watchdogs specifically are extremely painful but required. If Chrome triggers for example a native file popup the agent is just completely stuck. So the two solutions are to: 1. code those heuristics and edge cases away 1 by 1 and prevent them 2. give LLM a tool to handle the edge case

As you can imagine - there are crazy amounts of heuristics like this so you eventually end up with A LOT of tools if you try to go for #2. So you have to make compromises and just code those heuristics away.

BUT if the LLM just "knows" CDP well enough to switch the targets when it encounters a cross origin iframe, dismiss the alert when it appears, write its own click helpers, or upload function, you suddenly don't have to worry about any of those edge cases.

Turns out LLMs know CDP pretty well these days. So we bitter pilled the harness. The concepts that should survive are: - something that holds and keeps CDP websocket alive (deamon) - extremely basic tools (helpers.py) - skill.md that explains how to use it

The new paradigm? SKILL.md + a few python helpers that need to have the ability to change on the fly.

One cool example: We forgot to implement upload_file function. Then mid-task the agent wants to upload a file so it grepped helpers.py, saw nothing, wrote the function itself using raw DOM.setFileInputFiles (which we only noticed that later in a git diff). This was a relly magical moment of how powerful LLMs have become.

Compared to other approaches (Playwright MCP, browser use CLI, agent-browser, chrome devtools MCP): all of them wrap Chrome in a set of predefined functions for the LLM. The worst failure mode is silent. The LLM's click() returns fine so the LLM thinks it clicked, but on this particular site nothing actually happened. It moves on with a broken model of the world. Browser Harness gives the LLM maximum freedom and perfect context for HOW the tools actually work.

Here are a few crazy examples of what browser harness can do: - plays stockfish https://x.com/shawn_pana/status/2046457374467379347 - sets a world record in tetris https://x.com/shawn_pana/status/2047120626994012442 - figures out how to draw a heart with js https://x.com/mamagnus00/status/2046486159992480198?s=20

You can super easily install it by telling claude code: `Set up https://github.com/browser-use/browser-harness for me.`

Repo: https://github.com/browser-use/browser-harness

What would you call this new paradigm? A dialect?

Show HN: How LLMs Work – Interactive visual guide based on Karpathy's lecture

https://ynarwal.github.io/how-llms-work/
218•ynarwal__•9h ago•53 comments

Show HN: Gova – The declarative GUI framework for Go

https://github.com/NV404/gova
92•aliezsid•10h ago•16 comments

Show HN: Atomic – Local-first, AI-augmented personal knowledge base

https://atomicapp.ai/
23•kenforthewin•4h ago•6 comments

Show HN: leaf – a terminal Markdown previewer with a GUI-like experience

https://github.com/RivoLink/leaf
11•RivoLink•5h ago•3 comments

Show HN: Tolaria – Open-source macOS app to manage Markdown knowledge bases

https://github.com/refactoringhq/tolaria
260•lucaronin•18h ago•118 comments

Show HN: Browser Harness – simplest way to give AI control of real browser

https://github.com/browser-use/browser-harness
5•gregpr07•2h ago•0 comments

Show HN: Agent Vault – Open-source credential proxy and vault for agents

https://github.com/Infisical/agent-vault
132•dangtony98•2d ago•44 comments

Show HN: Honker – Postgres NOTIFY/LISTEN Semantics for SQLite

https://github.com/russellromney/honker
286•russellthehippo•1d ago•69 comments

Show HN: Headless terminal - Allow agents to run any interactive TUI or CLI

https://github.com/montanaflynn/headless-terminal
3•anonfunction•2h ago•1 comments

Show HN: Learn conflict resolution through a 90-second interactive story

https://app.questly.academy/try/play?story=conflict
5•danish00111•9h ago•2 comments

Show HN: Broccoli, one shot coding agent on the cloud

https://github.com/besimple-oss/broccoli
71•yzhong94•2d ago•47 comments

Show HN: Safer – Sleep better while AI agents have shell access

https://github.com/crufter/safer
4•friendly_chap•9h ago•0 comments

Show HN: GoModel – an open-source AI gateway in Go

https://github.com/ENTERPILOT/GOModel/
206•santiago-pl•3d ago•76 comments

Show HN: AgentSearch – Self-hosted search and MCP for AI agents, no API keys

https://github.com/brcrusoe72/agent-search
4•bricrusoe•22h ago•0 comments

Show HN: LocalLLM – Recipes for Running the Local LLM (Need Contributors)

https://locallllm.fly.dev
11•Igor_Wiwi•1d ago•2 comments

Show HN: easl – Instant hosting for AI agents

https://github.com/AdirAmsalem/easl
4•Adir•18h ago•0 comments

Show HN: Pdfnative – zero-dependency TypeScript PDF engine

https://www.npmjs.com/package/pdfnative
4•nizoka•18h ago•0 comments

Show HN: RustNmap

2•greatwallisme•13h ago•0 comments

Show HN: Run coding agents in microVM sandboxes instead of your host machine

https://github.com/superhq-ai/superhq
59•phoenixranger•1d ago•1 comments

Show HN: Mediator.ai – Using Nash bargaining and LLMs to systematize fairness

https://mediator.ai/
160•sanity•4d ago•74 comments

Show HN: A Clean Room RFC for NTFS Structural Repair

https://github.com/seb3773/ntfs-repair-rfc/
3•seb3773•15h ago•1 comments

Show HN: VidStudio, a browser based video editor that doesn't upload your files

https://vidstudio.app/video-editor
297•kolx•3d ago•107 comments

Show HN: Ctx – a /resume that works across Claude Code and Codex

https://github.com/dchu917/ctx
72•dchu17•3d ago•28 comments

Show HN: SQL Protocol – learn SQL by running real queries, with 1v1 PvP

https://sqlprotocol.com
2•ItaiZeilig•16h ago•1 comments

Show HN: Algorithmic String Art, accessible to all

https://string-loom.pages.dev
12•s_e__a___n•1d ago•4 comments

Show HN: Daemons – we pivoted from building agents to cleaning up after them

https://charlielabs.ai/
70•rileyt•3d ago•34 comments

Show HN: Stash – CLI to search over your team's coding agent sessions

https://github.com/Fergana-Labs/stash
7•samzliu•18h ago•1 comments

Show HN: Endo Familiar, an O-cap based JavaScript agent sandbox

https://dcfoundation.io/containing-ai-agents-the-endo-familiar-demo/
13•zmanian•18h ago•3 comments

Show HN: Real-Real-Time Chat

https://kraa.io/kraa/trees
9•levmiseri•1d ago•9 comments

Show HN: Tron Hilbert Curve Macro

https://github.com/EricNelson12/retrocycles-hilbert
2•i_am_a_squirrel•20h ago•0 comments