frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Are We Trek Yet?

https://nicolaiarocci.com/are-we-trek-yet/
1•shikharbhardwaj•56s ago•0 comments

Commodore needles Microsoft over end of Windows 10

https://www.tomshardware.com/software/operating-systems/commodore-needles-microsoft-over-end-of-w...
1•rbanffy•58s ago•0 comments

Time Tracker

https://www.timetrackers.online/
1•linyulearn•4m ago•1 comments

Extortion and ransomware drive over half of cyberattacks

https://blogs.microsoft.com/on-the-issues/2025/10/16/mddr-2025/
1•ChrisArchitect•4m ago•1 comments

SiFive 2nd Generation Intelligence Technology

https://www.youtube.com/watch?v=BJis0tkUt8E
1•fork-bomber•5m ago•0 comments

Self-hosting your code on Gitea

https://alicegg.tech//2025/10/16/gitea.html
1•groseje•6m ago•0 comments

Microsoft: RU, China increasingly using AI to escalate cyberattacks on the US

https://apnews.com/article/ai-cybersecurity-russia-china-deepfakes-microsoft-ad678e5192dd747834ed...
1•c420•9m ago•1 comments

Conflict-Free Replicated Data Types (CRDTs): Convergence Without Coordination

https://read.thecoder.cafe/p/crdt
2•0xKelsey•9m ago•0 comments

CDC tormented: HR workers summoned from furlough to lay off themselves, others

https://arstechnica.com/health/2025/10/cdc-tormented-hr-workers-summoned-from-furlough-to-lay-off...
2•rbanffy•11m ago•0 comments

She Faked Her Way into Yale. Then Things Unraveled

https://airmail.news/issues/2025-10-4/she-faked-her-way-into-yale-then-things-unraveled
1•rahimnathwani•12m ago•0 comments

Fuse-ZSTD: mimic transparent compression on ext4

https://github.com/Big-Dig-Data/fuse-zstd
1•gry_gh•12m ago•0 comments

Conference Cheat Sheet: Strategy+Examples for Positive ROI

https://bill.harding.blog/2025/10/16/conference-cheat-sheet-strategyexamples-for-positive-roi-w-o...
1•groseje•12m ago•0 comments

Where Are the Aliens? New Study Suggests They're Stuck Like Us

https://gizmodo.com/where-are-the-aliens-new-study-suggests-theyre-stuck-like-us-2000672801
2•rbanffy•12m ago•0 comments

FlashWorld: High-quality 3D Scene Generation within Seconds

https://imlixinyang.github.io/FlashWorld-Project-Page/
1•smusamashah•13m ago•0 comments

DBT Multi-Adapter Utils

https://github.com/edmondop/dbt-multi-adapter-utils
1•edmondo_por•13m ago•1 comments

Show HN: Chorey – A Type-Safe, Asynchronous Pipeline Framework for Python

https://anwitars.github.io/chorey/
1•anwitars•13m ago•0 comments

Who's Submitting AI-Tainted Filings in Court?

https://cyberlaw.stanford.edu/blog/2025/10/whos-submitting-ai-tainted-filings-in-court/
1•cainxinth•14m ago•1 comments

The 2-pager used to raise $3.5M from the investors behind Lovable, n8n, and Miro

https://www.productmarketfit.tech/p/the-2-pager-ex-klarna-founder-used
1•wslh•16m ago•0 comments

Google background is dark even when in light mode

https://old.reddit.com/r/chrome/comments/1o7q9gl/google_background_is_dark_even_when_in_light_mode/
1•pstadler•17m ago•0 comments

Blind Conductor and Amnesiac Agents-problems no one talks about

https://xor01.substack.com/p/blind-conductor-and-amnesiac-agent
1•xor01•17m ago•2 comments

3D-printed fuel cells could reshape sustainable aerospace applications

https://techxplore.com/news/2025-09-3d-fuel-cells-reshape-sustainable.html
1•PaulHoule•17m ago•0 comments

Working with the Amiga's RAM and Rad Disks

https://www.datagubbe.se/ramdisk/
4•ibobev•18m ago•0 comments

Bulk Operations in Boost.Bloom

http://bannalia.blogspot.com/2025/10/bulk-operations-in-boostbloom.html
1•ibobev•19m ago•0 comments

Western Executives Shaken After Visiting China

https://slashdot.org/story/25/10/16/006222/western-executives-shaken-after-visiting-china
3•andrewl•19m ago•0 comments

PowerShell Universal joins Devolutions: a new chapter in IT automation

https://blog.devolutions.net/2025/10/powershell-universal-joins-devolutions-a-new-chapter-in-it-a...
2•awakecoding•19m ago•0 comments

Improving the Trustworthiness of JavaScript on the Web

https://blog.cloudflare.com/improving-the-trustworthiness-of-javascript-on-the-web/
3•doomrobo•20m ago•1 comments

The evolution of 37signals over 25 years

https://paulsyng.com/blog/37signals-the-company-that-owns-control-and-doesnt-know-it/
1•felineflock•20m ago•0 comments

Procedural Generation with Wave Function Collapse

https://vectrx.substack.com/p/wave-function-collapse
1•ibobev•20m ago•0 comments

Operational Transparency

https://hbr.org/2019/03/operational-transparency
1•bookofjoe•20m ago•0 comments

Show HN: PAO Trainer – A small app I built to practice PAO memory systems [video]

https://www.youtube.com/shorts/S_nBfn87uRo
1•raoufbelakhdar•23m ago•1 comments
Open in hackernews

A stateful browser agent using self-healing DOM maps

https://100x.bot/a/a-stateful-browser-agent-using-self-healing-dom-maps
48•shardullavekar•2h ago

Comments

brianjking•2h ago
Is this able to load for anyone?
shardullavekar•2h ago
It's a chrome extension. Works if you use chrome.
brianjking•2h ago
I couldn't load the article. I was getting a nginx error initially. I'm able to view now. I think they were getting a bit squeezed.
memet_rush•2h ago
they didnt use the agent to self heal
phgn•57m ago
Nope. Their entire website shows up with a white screen for me in the latest Chrome.

There's this error in the console: Failed to load module script: Expected a JavaScript-or-Wasm module script but the server responded with a MIME type of "text/html". Strict MIME type checking is enforced for module scripts per HTML spec.

philo23•2h ago
Maybe this is a lack of understanding on my part, but this bit of the explanation sets off alarm bells for me:

> Under the hood, we're building a client-sourced RAG for the DOM. An agent's first move on a page is to check a vector DB for a known "map." ... This creates a wild side-effect: the system is self-healing for everyone. One person's failed automation accidentally fixes it for the next hundred users.

I think I'd like to know exactly what kind of data is extracted from the DOM to build that shared map.

artpar•2h ago
Agent4 is going to store "stable selectors" that worked (when it performs a task first time most of the time is spent in identifying these css/xpath selectors). Memories are pretty straighforward at this point, they are stored locally in your browser's IndexedDB (you can inspect from chrome inspector).
philo23•2h ago
Good to hear, that’s what I was hoping that it was doing.
erichocean•1h ago
How are you mapping from "click this element" (presumably obtained via a VLM) to the actual DOM locator that refers to it?

I guess Playwright can do it in "record" mode; I'm curious how you do it from a Chrome extension.

Spitballing here, you inject an event filter on the page and when the click happens, grab the element and run some code to synthesize a selector that just refers to that element? (Presumably you could just reuse Playwright's element-to-locator code at this point.)

artpar•1h ago
So when you go into the "selector" mode, the plugin will add event listeners to all the DOM nodes. Based on your click it will try to generate a bunch of selectors statically first (multiple, css and xpath based), and then based on your guidance its the job of agent4 to make stable selectors.
cjr•1h ago
document.elementFromPoint to get the elem at co-ordinates, then use npm package similar to optimal-select to come up with a unique css selector.
tnolet•1h ago
This is, as far as I understand, self healing ONLY if the name of a CSS class changes. Not for anything else. That seems like a very very very very narrow definition of "self healing": there are 9999 other subtle or not so subtle things that can change per session or per update version of a page.

If you run this against let's say a typical e-commerce page where the navigation and all screen elements are super dynamic — user specific data, language etc. — this problems becomes even harder.

artpar•1h ago
Everyone thinks of typical e-commerce pages when its comes "browser agent doing something", but our real use cases are far from shopping for the user. But your point still stands valid. The idea is that maybe there are websites where generating stable selectors/hierarchy maps wouldn't solve, but 80% (from 80-20) of websites are not like that (including a lot of internal dashboard/interfaces) (there will also be issues for websites with proper i18n implementations if the selectors are aria label based)

Self healing css selectors is also only 1 part of the story. The other part is the cohesive interface for the agent itself to use these selectors.

simpaticoder•1h ago
Couldn't you solve this by having the agent do a first pass through a page and generate a (java)script that interacts with the interesting parts of the page, and then prepend the script (if it's short enough) or a list of entry points (if it's not) to the prompt such that subsequent interactions invoke the script rather than interact directly with the page?
artpar•1h ago
If I am reading you correctly, you captured the whole essence of agent4.

So it does the first pass (based on your goals) makes memories (and these are local)

Now you tell the agent you want to do this repeatedly, so it will make a workflow (these workflows are saved on server, currently all public for now but we are working out permissions/group based access) for you based on these memories and interactions.

The problem is many times that the agent thinks is stable isn't really, so there a feedback loop for the agent to test out the workflow and improve them. (its basically claude code/codex sitting in the browser)

Workflow details are appended to prompt based on user query match/opened tabs match.

simpaticoder•48m ago
Okay I read your post more carefully and it seems like you're attempting to build one central script for a given URL. Assuming on-shot script generation is unreliable and requires iterative improvement this makes sense. Of course I'm biased in favor of local-first, privacy preserving and non-distributed solutions if they exist, so I'd be curious to know if/how you measured the reliability of one-shot script generation for a basket of likely web apps.
artpar•43m ago
One shot is pretty much not going to work, both at single step level or if you ask llm to generate workflow in one shot. We haven't measured it as such but even for static websites like hackernews front page it takes a couple tries of to and fro for the llm to get it right. somehow after all the instructions the llm will still "guess" the selector instead of checking the page/dom contents. And then there are lot of other minor details that need to be captured like "you need to wait a couple of second for the auto complete results to show up". If you tell it to just make a workflow, it will generate some garbage and call it a day.
arkmm•1h ago
Neat approach, but seems like the eventual goal of caching DOM maps for all users would be a privacy nightmare?
artpar•1h ago
Yes I can imagine PI somehow being stored in the workflow. I frequently see llms hardcoding tests just to make user happy and this can also happen in the browser version where if something is too hard to scrape but agent is able to infer from screenshot so it might end up making a workflow that seems correct but is just hardcoded with data. We are thinking of multiple guards/blocks to not let user create such a workflow, but the risks that come with an open ended agent are still going to be present.
bogdanoff_2•1h ago
Asking here because it seems related: I'm trying to use cursor to work on a webapp. It gets frustrating because vanilla Cursor is "coding blind" and can't actually see the result of what it is doing, and whether or not it works.

I ask it to fix something. It claims to know what the problem is, changes the code, and then claims it's fixed. I open app, and it's still broken. I have to continuously and way to often repeatedly tell it what it broken.

Now, supposing I'm "vibe coding" and don't really care about the obvious fact that the AI doesn't actually know what it is doing, it's still frustrating that I have to be in the loop just to provide very basic information like that.

Are there any agentic coding setups that allow the agent to interact with the app it's working on to check if it actually works?

tomashubelbauer•1h ago
Look into the Playwright MCP server, it allows coding agents to scrutinize the results of their work in the web browser. There is also an MCP server for the Chrome DevTools protocol AFAIK but I haven't tried it.
artpar•56m ago
I don't know if plywright works without chrome in debug mode, but I tried the MCP for chrome devtools and it requires chrome to be started in debugging mode and that basically means you cant log into a lot of sites (especially google) since it will block you with an "Unsafe" message. Works pretty well if you owe the target website.
kevinsync•53m ago
I was in the same boat on a side project (Electron, Claude Code) -- I considered Playwright but ended up building a simple, focused API instead that allows Claude to connect to the app to inspect logs (main console + browser console), query internal app data + state, and execute arbitrary JS.

It's sped up debugging a lot since I can just give it instructions like "found a bug that does XYZ, I think it's a problem with functionABC(); connect to app, click these four buttons in this order, examine the internal state, then trace through the code to figure out what's going wrong and present a solution"

I was pretty resistant at first of delegating debugging blindly like that, but it's made the workflow pretty smooth to where I can occasionally just open the app, run through it as a human user and take notes on bugs and flow issues that I find, log them with steps to reproduce, then give Claude a list of bugs to noodle on while I'm focusing on stuff LLMs are terrible at (design, UI, frontend work, etc)

shardullavekar•1h ago
a built-in mcp server that takes a look at what's broken and communicates with cursor is on our roadmap. Join discord and we will keep you posted there.
artpar•1h ago
So actually I have this setup (of a bridge server) which I use for agent4 itself (so claude code can talk to agent4), It makes a lot of sense to publish that bridge as well in the MCP form.
JimDabell•59m ago
You can use things like Browser Use and Playwright to hook things like that up, but you’re right, this is a very underdeveloped area. Armin Ronacher has a talk that covers some of this, such as unifying console.log, server logs, SQL, etc. to feed back to the LLM.

https://www.youtube.com/watch?v=nfOVgz_omlU

wahnfrieden•53m ago
This is the way. You can also feed screenshots back to it.
xnx•42m ago
Gemini CLI Chrome devtools MCP addresses this: https://developer.chrome.com/blog/chrome-devtools-mcp
hatmanstack•29m ago
Jump the line and just install it. who needs to read stuff. https://github.com/ChromeDevTools/chrome-devtools-mcp?tab=re...
ripped_britches•45m ago
“One persons map fixes everyone else’s”

Hm somehow I feel like this is a giant step in the wrong direction.

artpar•21m ago
Worst case scenario we can just shut down sharing/public workflows altogether, or do you have something else in mind ?
rco8786•38m ago
This tool seems relevant to my interests, but I gotta say I cannot figure out how to use the extension.

It seems like I'm only able to use the pre-existing/canned workflows that are provided under different "Persona"s? And there's no way for me to just create a new workflow from scratch for my specific use case.

Am I missing something obvious?

shardullavekar•34m ago
We launched Agent4 recently. You can install it from here: https://chromewebstore.google.com/detail/agent4/kipkglfnhnpb...

The one you refer will be taken down soon. Ping me on discord if you need help in trying it.

rco8786•7m ago
Thanks! I installed the new one but am still unable to figure out how to create my own workflow. I see that there a bunch of them on the left panel, but there doesn't seem to be a way for me to create one? When I use the chatbot it seems like it just tries to use the LLM to do whatever task I asked it to do but again, can't seem to save it or modify specific steps.

Basically, how do I use this self-healing DOM that the article is all about?

Related - the new extension only works if I allow it to be my new tab default page? That's pretty intrusive, if I'm honest.