frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: Tiny VM sandbox in C with apps in Rust, C and Zig

https://github.com/ringtailsoftware/uvm32
71•trj•4h ago•4 comments

Show HN: I made a spreadsheet where formulas also update backwards

https://victorpoughon.github.io/bidicalc/
63•fouronnes3•1d ago•19 comments

Show HN: Tripwire: A new anti evil maid defense

https://github.com/fr33-sh/Tripwire
73•DoctorFreeman•1d ago•38 comments

Show HN: Autofix Bot – Hybrid static analysis and AI code review agent

32•sanketsaurav•1d ago•12 comments

Show HN: EdgeVec – Sub-millisecond vector search in the browser (Rust/WASM)

https://github.com/matte1782/edgevec
4•matteo1782•3h ago•1 comments

Show HN: PharmVault – Secure Notes with Spring Boot and JWT

https://github.com/nifski/PharmVault
3•nifemi1234•3h ago•3 comments

Show HN: I built a GitHub application that generates documentation automatically

https://codesummary.io
4•jerrodcodes•5h ago•1 comments

Show HN: Local Privacy Firewall-blocks PII and secrets before ChatGPT sees them

https://github.com/privacyshield-ai/privacy-firewall
104•arnabkarsarkar•3d ago•54 comments

Show HN: Verani – Socket.io-like realtime SDK for Cloudflare

https://github.com/v0id-user/verani
4•v0id_user•5h ago•0 comments

Show HN: PhenixCode – Added admin dashboard for multi-server management

https://github.com/nesall/phenixcode
2•nesall•6h ago•0 comments

Show HN: Sim – Apache-2.0 n8n alternative

https://github.com/simstudioai/sim
229•waleedlatif1•1d ago•57 comments

Show HN: Jottings; Anti-social microblog for your thoughts

https://jottings.me/
23•vishalvshekkar•17h ago•12 comments

Show HN: Storyloom – Deterministic Storytelling Framework

https://jcpsimmons.github.io/storyloom/
3•joshcsimmons•7h ago•0 comments

Show HN: I'm building an open-source Amazon

https://openship.org
6•theturtletalks•8h ago•0 comments

Show HN: ESLint Plugin for styled-jsx

https://github.com/sushichan044/eslint-plugin-styled-jsx
3•sushichan044•8h ago•0 comments

Show HN: Epstein's emails reconstructed in a message-style UI (OCR and LLMs)

https://github.com/Toon-nooT/epsteins-phone-reconstructed
41•toon-noot•13h ago•8 comments

Show HN: Wirebrowser – A JavaScript debugger with breakpoint-driven heap search

https://github.com/fcavallarin/wirebrowser
67•fcavallarin•2d ago•15 comments

Show HN: Euporie-lite, Jupyter notebooks in terminal in the browser

https://euporie.readthedocs.io/en/latest/_static/lite.html
3•joouha•9h ago•1 comments

Show HN: Dbxlite – Query 100M+ rows in a browser tab, no install

https://sql.dbxlite.com/?share=gist:f0377982ccd68ac7f61a7faef8ff513e&run=true
2•hfmsio•9h ago•1 comments

Show HN: Gemini Pro 3 imagines the HN front page 10 years from now

https://dosaygo-studio.github.io/hn-front-page-2035/news
3317•keepamovin•3d ago•957 comments

Show HN: A zero-to-hero, spaced-repetition guide to WebGL2 and GLSL

https://github.com/GregStanton/webgl2-glsl-primer
3•HigherMathHelp•9h ago•1 comments

Show HN: A 2-row, 16-key keyboard designed for smartphones

https://k-keyboard.com/Why-QWERTY-mini
81•QWERTYmini•2d ago•68 comments

Show HN: Automated license plate reader coverage in the USA

https://alpranalysis.com
237•sodality2•2d ago•146 comments

Show HN: GPULlama3.java Llama Compilied to PTX/OpenCL Now Integrated in Quarkus

23•mikepapadim•1d ago•5 comments

Show HN: Gotui – a modern Go terminal dashboard library

https://github.com/metaspartan/gotui
41•carsenk•1d ago•13 comments

Show HN: An endless scrolling word search game

https://endless-wordsearch.com
25•marcusdev•1d ago•14 comments

Show HN: Open-source, offline voice typing and live captions for Android

https://github.com/notune/android_transcribe_app
3•leumon•12h ago•0 comments

Show HN: AlgoDrill – Interactive drills to stop forgetting LeetCode patterns

https://algodrill.io
177•henwfan•3d ago•106 comments

Show HN: I built a system for active note-taking in regular meetings like 1-1s

https://withdocket.com
175•davnicwil•4d ago•130 comments

Show HN: 360css CSS library inspired by the xbox360 dashboard

https://tarmo1.github.io/360css/
3•Tarmo362•13h ago•0 comments
Open in hackernews

Show HN: Workflow Use – Deterministic, self-healing browser automation (RPA 2.0)

https://github.com/browser-use/workflow-use
69•gregpr07•7mo ago
Hey HN – Gregor & Magnus here again.

A few months ago, we launched Browser Use (https://news.ycombinator.com/item?id=43173378), which let LLMs perform tasks in the browser using natural language prompts. It was great for one-off tasks like booking flights or finding products—but we soon realized enterprises have somewhat different needs:

They typically have one workflow with dynamic variables (e.g., filling out a form and downloading a PDF) that they want to reliably run a million times without breaking. Pure LLM agents were slow, expensive, and unpredictable for these high-frequency tasks.

So we just started working on Workflow Use:

- You show the browser what to do (by manually recording steps; show don’t tell).

- An LLM converts these recordings into deterministic scripts with variables (scripts include AI steps as well, where it’s 100% agentic)

- Scripts run reliably, 10x faster, and ~90% cheaper than Browser Use.

- If a step breaks, workflow will fallback to Browser Use and agentically run the step. (This self-healing functionality is still very early.)

This project just kicked off, so lots of things will break, it’s definitely not production-ready yet, and plenty of stuff is still missing (like a solid editor and proper self-healing). But we wanted to share early, get feedback, and figure out what workflows you’d want to automate this way.

Try it out and let us know what you think!

Comments

pzullo•7mo ago
Cool stuff!
cdolan•7mo ago
This is amazing. We've been using BrowserUser to try and create deterministic playwright scripts for months with mixed results.

So, so, so excited to see this

MagMueller•6mo ago
In the main library this feature could help you with that: https://github.com/browser-use/browser-use/pull/1437
deepdarkforest•7mo ago
Very cool. 1) How do you deal with timings? If a step includes clicking on a link or something that needs loading, then if you just fire off the generated playwright code at at once, some steps might fail because the xpath is not there yet. So to be safe, i'm guessing you would have to wait using the difference in the timestamps in the json. 2. For self healing, we worked on something similar, and we found it's very easy to get off the rails if one step fails because if your assertions of if the fix was correct are off, then the next and next steps will also fail etc. The most stable way was to just regenerate all steps if a step fails in 2 attempts (2 runs of the flow) consecutively. If the xpath is broken for a step, very likely the subsequent ones won't be worth healing individually.
gregpr07•7mo ago
1) we made this sick function in browser use library which analyses when there are no more requests going through - so we just reuse that!

2) yeah good question. The end goal is to completely regenerate the flow if it breaks (let browser use explore the “new” website and update the original flow). But let’s see, soo much could be done here!

What did you work on btw?

deepdarkforest•7mo ago
1. Oh yes right. I remember trying it out thinking it was going to be brittle because of analytics etc but it filters for those surprisingly well.

2. We are working on https://www.launchskylight.com/ , agentic QA. For the self onboarding version we are using pure CUA without caching. (We wanted to avoid playwright to make it more flexible for canvas+iframe based apps,where we found HTML based approaches like browser-use limited, and to support desktop apps in the future).

We are betaing caching internally for customers, and releasing it for the self-onboarding soon. We use CUA actions for caching instead of playwright. Caching with pixel native models is def a bit more brittle for clicks and we focus on purely vision based analysis to decide to proceed or not. I think for scaling though you are 100% right, screenshots every step for validating are okay/worth it, but running an agent non-deterministically for actions is def an overkill for enterprise, that was what we found as well.

Geminis video understanding is also an interesting way to analyze what went wrong in more interactive apps. Apart from that i think we share quite a bit of the core thinking, would be interested to chat, will DM!

crazymoka•7mo ago
So I can use this and it will pull new data from a database I can use to have it fill out a form? And I can trigger it when I need it to run with the updated form information?
MagMueller•6mo ago
Yes so you can run the same form over and over again with different input variables, very reliable, fast and cheap
petethomas•7mo ago
It's not mentioned here but Kapwork contributed the beginnings of this work in a PR a couple weeks ago: https://github.com/browser-use/browser-use/pull/1437. Thank you Gregor & Magnus for the great tech and all you're doing for the community.
rammy1234•7mo ago
seems similar to selenium plugin for firefox, minus the scripting it generates.
vasusen•7mo ago
Very cool evolution!

Really great to see the fallback to the agentic run when the automation breaks. For our e2e testing browser automation at Donobu, we independently arrived at the same pattern and have been impressed with how well it works. Automatic self-healed PR example here: https://github.com/donobu-inc/playwright-flows/pull/6/files

edit: typo

ProofHouse•7mo ago
Can utilizing Chrome Extensions be added? It seems no one has and would be a critical bridge to many browser tasks
nico•7mo ago
Yes. Also, would love something that can run directly in my browser with my sessions

There’s a lot of websites that are super hostile to automation and make it really hard to do simple, small, but repetitive stuff with things like playwright, selenium, chromedriver

casslin•6mo ago
we built a chrome extension supporting browser use and can run locally in ur browser: https://github.com/nanobrowser/nanobrowser, feel free to implement workflow use on top of our code base in typescript. Love to learn how it works!
casslin•6mo ago
we built a chrome extension supporting browser use: https://github.com/nanobrowser/nanobrowser, feel free to implement workflow use on top of our code base in typescript. Love to learn how it works!
Stiopa•7mo ago
Haha was just thinking last week there should be a tool called “show don’t tell” that infers a routine from recording, great minds think alike :)) Awesome feature guys, looking forward to playing around!
ListeningPie•7mo ago
We use selenium for RPAs. Saving this as an alternative to explore.
Richardson143•7mo ago
This is a really interesting direction, Gregor & Magnus! You're spot on about enterprises needing more robust and self-healing solutions for their high-frequency automation.

It's true that many are looking into self-healing for existing automation scripts; from what I've seen, tools like Healenium are gaining some traction in this space. However, I agree that a Browser Use-like approach also holds a lot of promise here.

My thinking on how this could be achieved with AI agents like Browser Use is to run the existing automation scripts as usual. If a script breaks due to an "element not found" exception or similar issues, the AI agent could then be triggered to analyze the page, identify the correct new locator for the problematic element, and dynamically update or "heal" the script. I've actually put together a small proof-of-concept demonstrating this idea using Browser Use: https://www.loom.com/share/1af87d78d6814512b17a8f949c28ef13?...

I had explored a similar concept previously with Lavague setup here: https://www.loom.com/share/9b0c7cf0bdd6492f885a2c974ca8a4be?...

Another avenue, particularly relevant for existing test suites, is how many QA teams manage their locators. Often, these are centralized in files like POM.xml (for Java/Maven projects) or external spreadsheets/CSVs. An AI agent could potentially be used to proactively scan the application and update these locator repositories.

For instance,

I've experimented with a workflow where Browser Use updates a CSV file of locators weekly based on changes detected on the website: https://www.loom.com/share/821f80fcb0694be4bd4d979e94900990?...

Excited to see how Workflow Use evolves, especially the self-healing aspects!

casslin•6mo ago
If any one wanna try it in browser can try implement it on our chrome extension code base in typescript: https://github.com/nanobrowser/nanobrowser, we support browser use and re wrote its code in typescript, not yet supporting workflow use and love to hear how it works from community!
dmos62•6mo ago
Care to share your thoughts on muscle-mem, the recent, related "meta-tool" for caching "agent trajectories"? https://github.com/pig-dot-dev/muscle-mem
gregpr07•6mo ago
I think it's awesome (we are close friends with Erik from Pig so slightly biased) - one extreme is Browser Use, which is just an agent that does everything for the first time, the other extreme is Workflow Use, which is almost deterministic. I think the winner product lies somewhere in the middle - Browser Use + Cache is easier to do for browser trajectories than for pure images! We will definitely try this direction!
joshstrange•6mo ago
I love the use of reusable non-LLM code for the "happy path" and only using the LLM to create a workflow or repair it.

One area for improve seems to be the checkboxes/radio buttons (possibility other input types?) given the demo didn't pick up on that (it did make the same selections but it didn't recognize this was a multiple-choice input). It might be useful to have a step before creating the JSON where it asks the user some follow up questions like, "Here are the inputs I found and my understanding of their data type". And then go through each input asking for a default value and maybe even clarification on "Should we even prompt for this?" (Example, always select country X).

I wonder if, for workflow repair purposes, it would be helpful to, at recording time, save more contextual information about the fields you are filling/clicking on. "This a country selector", "This is the birthdate field", etc. So that if the xpath/css/etc fails you can give the LLM doing the repair work a description of what it's looking for.

I'm excited to see more efforts in QA testing with things like this. Brittle e2e tests are the bane of my (limited) automated experience and the ability to auto-heal and/or deal with minor deviations would be wonderful.

mparis•6mo ago
This project resonates with me a lot. Call me old-fashioned, but I still appreciate a nice ole' deterministic program that I can fully understand and operate reliably.

With that said, there is undoubtedly still room to innovate on the long-tail of RPA. In the healthcare domain, for example, there are 1000s of sites that might need to be scraped occasionally, somewhat transactionally as e.g. a new patient comes in. However, there are other sites that need regular attention and even the smallest of errors can be catastrophic.

The combination of browser-use & workflow-use seems like a really natural fit for such use cases. Nice work!

We've also experimented with the self-healing ideas you are playing with here. In our case, we wrote a chrome extension that connects to an LLM of your choice as well as a process running locally on your machine. You write a description of the job to be done, click around the browser, and then click "go". The extension grabs all the context, asks the LLM to write a typescript program, sends that typescript program to the local process where it is compiled & type-checked against our internal workflow harness, and then immediately allows you to execute the program against your existing, open browser context.

We've found that even this basic loop is outrageously productive. If the script doesn't do what you expect, there is a big "FIX IT" button that lets you tweak and try again. For the record, we're not a competitor and have no intention of trying to sell/offer this extension externally.

I suspect one of the harder parts about this whole ordeal will be how to integrate with the rest of the workflow stack. For us, we've really appreciated the fact that our extension outputs typescript that seamlessly fits into our stack and that is more easily verifiable than JSON. The TS target also allows us to do nice things like tell the self-healing bot which libraries will be available so that e.g. it can use `date-fns` instead of `Date`. We've also thought about adopting more traditional workflow tools like Temporal to manage the core workflow logic, vending out the browser connectivity remotely. Curious how you guys are thinking about this?

Rooting for you guys, we will be sure to keep an eye on your progress and consider adopting the technology as it matures!

PS. If you like things like this, want to work at a growing health-tech startup, and live in Boston, we're hiring! Reach out here: https://wgwx7h7be0p.typeform.com/to/LV0t8OjI