frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

WASM Agents: AI agents running in the browser

https://blog.mozilla.ai/wasm-agents-ai-agents-running-in-your-browser/
127•selvan•4h ago

Comments

raybb•3h ago
Can you bypass the cors issue with a browser extension? I seem to recall CORS doesn't apply to extensions, or at least the part that isn't injected to the webpages.
ape4•3h ago
CORS is mentioned on that page: NOTE: If you want to run tools that get information from some other server into your HTML page (e.g. the visit_webpage tool or the Ollama server itself), you need to make sure that CORS is enabled for those servers. For more information, refer to the troubleshooting section in our GitHub repository.
N_Lens•3h ago
I guess we're at the stage where every permutation of "AI Agents" and X (Where X is technologies & or spaces) must be tried and posted on HN.
handfuloflight•3h ago
That's either the peak of inflated expectations or the slope of enlightenment.

Depending on what side you're on and "only time will tell."

Dilettante_•2h ago
Sometimes you gotta shake the tree to see what falls out.
bravetraveler•2h ago
Or pump the bubble
selvan•2h ago
Ship AI Agents as a web page :-)
FrankyHollywood•1h ago
We are in the early stage :)

https://www.youtube.com/watch?v=gN-ZktmjIfE

latexr•18m ago
That’s a well-remembered video, but I don’t really think it fits with the original comment. What we’re seeing is more akin to already having the plane flying (badly, still crashing frequently and landing in the wrong country) and instead of making it more reliable, everyone is trying different wheels and paint colours.

We’re not really seeing any significant development with this. What LLMs need most desperately (and are far from getting) is reliability and not being convincing liars. Being able to query existing server models from your oven timer is a cool gimmick but not really transformative or advancing anything.

It’s like a reverse JFK Space Effort speech: “We choose to indiscriminately throw at the wall every single LLM-adjacent idea we can think of. Not because it is useful, but because it is easy and potentially profitable”.

spwa4•1h ago
An excellent idea to have those be AI generated and posted! I'll start:

https://chatgpt.com/share/68679564-6a44-8012-b1bd-25819bfbf0...

TekMol•3h ago
It seems the only code that runs in the browser here is the code that talks to LLMs on servers.

Why would you need WASM for this?

politelemon•2h ago
They're using some python libraries like openai-agents so presumably it's to save on development efforts of calling/prompting/managing the HTTP endpoints. But yes this could just be done in regular JS in the browser, they'd have to write a lot of boilerplate for an ecosystem which is mainly Python.
yjftsjthsd-h•2h ago
> But yes this could just be done in regular JS in the browser, they'd have to write a lot of boilerplate for an ecosystem which is mainly Python.

Surely that's a prime use for AI?

m13rar•1h ago
From a quick gander. WASM is not to talk to the servers. WASM can be utilized to run AI Agents to talk to local LLMs from a sandboxed environment through the browser.

For example in the next few years if Operating System companies and PC producers make small local models stock standards to improve the operating system functions and other services. This local LLM engine layer can be used by browser applications too and that being done through WASM without having to write Javascript and using WASM sandboxed layer to safely expose the this system LLM Engine Layer.

benatkin•2h ago
This is trying to use the word agent to make it sound cool, but it doesn't make a case for why it's particularly about agents and not just basic level AI stuff.

> The agent code is nothing more than a Python script that relies on the openai-agents-python library to run an AI agent backed by an LLM served via an OpenAI-compatible API.

The openai-agents-python code is useful for writing agents but it is possible to use it to write code that isn't very agentic. None of the examples are very agentic.

niyyou•1h ago
Of course. Here is a corrected version of your text that fixes the grammar and typos while keeping the colloquial tone:

I'd like to offer a less skeptical view on this, contrary to what I've read here so far. LLMs that act (a.k.a. agents) bring a whole lot of new security and privacy issues. If we were already heading toward a privacy dystopia (with trackers, centralized services, etc.), agents could take that to a whole new level.

That's why I can only cheer when I see a different path where agents are run locally (by the way, Hugging Face has already published a couple of spaces demonstrating that). As a plus, because they're small, their environmental footprint will also be smaller (although, admittedly, I can also see the Jevons Paradox possibly happening here too).

sandGorgon•1h ago
i build an opensource mobile browser - we create ai agents (that run in the background) on the mobile browser. and build an extension framework on top so u can create these agents by publishing an extension.

we hook into the android workmanager framework and do some quirky things with tab handling to make this work. its harder to do this on mobile than on desktop.

bunch of people are trying to do interesting things like an automatic labubu purchase agent (on popmart) :D lots of purchase related agents

pull requests welcome ! https://github.com/wootzapp/wootz-browser/pull/334

ipsum2•1h ago
I recently wrote some Javascript to automate clicking coupons. The website checks for non-human clicks using event.isTrusted. Firefox allowed me to bypass this by rewriting the JS to replace s/isTrusted/true, while Chrome Manifest V3 doesn't allow it. Anyway, Firefox might be the future of agents, due to its extensibility.
_pdp_•1h ago
Mildly interesting article - I mean, you can already run a ton of libraries that talk to an inference backend. The only difference here is that the client-side code is in Python, which by itself doesn't make creating agents any simpler - I would argue that it complicates things a tone.

Also, connecting a model to a bunch of tools and dropping it into some kind of workflow is maybe 5% of the actual work. The rest is spent on observability, background tasks, queueing systems, multi-channel support for agents, user experience, etc., etc., etc.

Nobody talks about that part, because most of the content out there is just chasing trends - without much real-world experience running these systems or putting them in front of actual customers with real needs.

meander_water•1h ago
When I saw the title, I thought this was running models in the browser. IMO that's way more interesting and you can do it with transformers.js and onnx runtime. You don't even need a gpu.

https://huggingface.co/spaces/webml-community/llama-3.2-webg...

salviati•54m ago
I think you _do_ need a GPU. But it can work with an integrated one, no need for a discrete one.

I can't run it on Linux since WebGPU is not working for me...

zoobab•49m ago
No mention of WebGPU...
ultrathinkeng•47m ago
hmm
asim•46m ago
The frustrating thing about this is the limitation of using a browser. Agents should be long-running processes that exist external to a browser. The idea of using wasm is clever, but it feels like the entire browser environment needs to evolve because we're no longer dealing with just web pages. I think we are looking at a true evolution of the web now if this is the way it's going to go
diggan•43m ago
> Agents should be long-running processes that exist external to a browser

Sure, but there are a ton of ways for doing that today. What this specific thing is addressing, is removing the requirement of "the dependency on extra tools and frameworks that need to be installed before the agents can be run".

boomskats•29m ago
That's what the Component Model[0] is all about.

WASIp3[1] is gonna be awesome. Hopefully releasing later this year.

[0]: https://component-model.bytecodealliance.org/

[1]: https://wasi.dev/roadmap

simonw•27m ago
When you say agents should be long running, which definition of "agent" are you talking about?
evacchi•24m ago
mcp.run is entirely based on wasm. Tools can run on our cloud or locally
_joel•29m ago
Having to disable CORS restrictions is a bit meh, I understand why, but still.
simonw•21m ago
In this case the "agent" definition they are using is the one from the https://github.com/openai/openai-agents-python Python library, which they are running in the browser via Pyodide and WASM.

That library defines an agent as a system prompt and optional tools - notable because many other common agent definitions have the tools as required, not optional.

That explains why their "hello world" demo just runs a single prompt: https://github.com/mozilla-ai/wasm-agents-blueprint/blob/mai...

thepoet•9m ago
We looked at Pyodide and WASM along with other options like firecracker for our need of multi-step tasks that require running LLM generated code locally via Ollama etc. with some form of isolation than running it directly on our dev machines and figured it would be too much work with the various external libraries we have to install. The idea was to get code generated by a powerful remote LLM for general purpose stuff like video editing via ffmpeg, beautiful graphs generation via JS + chromium and stuff and execute it locally with all dependencies being installed before execution.

We built CodeRunner (https://github.com/BandarLabs/coderunner) on top of Apple Containers recently and have been using it for sometime. This works fine but still needs some improvement to work across very arbitrary prompts.

indigodaddy•2m ago
For the Gemini-cli integration, is the only difference between code runner with Gemini-cli, and gemini-cli itself, is that you are just using Gemini-cli in a container?
om8•3m ago
I have a demo that runs llama3-{1,3,8}B in browser on cpu. It can be integrated with this thing in the future to be fully local

https://galqiwi.github.io/aqlm-rs

Profile CUDA kernels with one command, zero GPU setup

https://github.com/Herdora/chisel
1•technoabsurdist•1m ago•1 comments

Columbia University Applicants' Personal Data Stolen by Hacker

https://www.bloomberg.com/news/articles/2025-07-01/columbia-university-applicants-personal-data-stolen-by-hacker
1•perihelions•3m ago•0 comments

Get in losers, we're moving to Linux

https://world.hey.com/dhh/get-in-losers-we-re-moving-to-linux-5e1b93cd
1•dsego•4m ago•0 comments

Making of an Elixir Conference

https://underjord.io/making-of-an-elixir-conference.html
1•lawik•5m ago•0 comments

Our small team vs. millions of bots

https://www.fsf.org/blogs/sysadmin/our-small-team-vs-millions-of-bots
2•chmaynard•8m ago•0 comments

Quantum Cloud: From Lab to Live

https://www.mckinsey.com/capabilities/mckinsey-digital/our-insights/the-year-of-quantum-from-concept-to-reality-in-2025
1•karlperera•10m ago•1 comments

Larry (Cat)

https://en.wikipedia.org/wiki/Larry_(cat)
3•dcminter•11m ago•0 comments

What's new in Java 25 for us, developers?

https://www.loicmathieu.fr/wordpress/en/informatique/java-25-whats-new/
2•loicmathieu•14m ago•1 comments

Ask HN: What are the best resources to learn Rust in 2025?

2•_Crownwell•18m ago•0 comments

What's the fastest way you've built a useful AI agent?

https://www.lyzr.ai
3•niruwu•18m ago•1 comments

Writing a Game Boy Emulator in OCaml

https://linoscope.github.io/writing-a-game-boy-emulator-in-ocaml/
1•ibobev•19m ago•0 comments

Show HN: Juskim – Chrome ext to auto-TL;DR every article before u start reading

https://chromewebstore.google.com/detail/juskim/jpgdpmeemjcdieahhjljhljhmpiifmkn
2•gnurpreet•22m ago•0 comments

The Work of Building for Other Engineers

https://humansinsystems.com/blog/the-work-of-building-for-other-engineers
1•adrianhoward•26m ago•0 comments

An attempt at defining consciousness made based on information theory

https://docs.google.com/document/d/1RN56GMpE8D4tc8k9Bw9YfSeJ4FZOyixkxWKpN_H0A34/edit?usp=drivesdk
1•Trenthug•26m ago•0 comments

OSI provides feedback to the EU Cybersecurity Act

https://opensource.org/blog/keeping-europe-safe-and-advancing-open-source-osi-provides-feedback-to-the-eu-cybersecurity-act
1•doener•30m ago•0 comments

Malaria as neurosyphilis treatment: A historical case study in medical ethics

https://medicine.yale.edu/news-article/malarial-fever-as-neurosyphilis-treatment-a-historical-case-study-in-medical-ethics/
1•thunderbong•32m ago•0 comments

The Parrot in the Machine

https://www.nybooks.com/articles/2025/07/24/the-parrot-in-the-machine-the-ai-con-bender-hanna/
1•almost-exactly•33m ago•1 comments

Fictional K-pop bands zoom to top of US music charts

https://www.bbc.com/news/articles/clyl1zyv1y2o
2•ranit•35m ago•0 comments

Show HN: Refinery – stop hardcoding test cases

https://www.testrefinery.com
1•atultw•38m ago•0 comments

Show HN: I built a tool that reveals hidden APIs the browser hits (open source)

https://github.com/hyperbrowserai/hyperbrowser-app-examples/tree/main/deep-crawler-bot
1•aparupganguly•40m ago•0 comments

Headless, zero dep. modal stack manager for React

https://github.com/AlexDemzz/react-easy-modals
2•alexdemzz•43m ago•1 comments

Pud's Small Batch Headphones

https://pud.com
1•keyle•45m ago•1 comments

When Will Robots Go Mainstream?

https://joincolossus.com/article/when-will-robots-go-mainstream/
2•almost-exactly•46m ago•0 comments

Why most Kimchi Premium trackers are basically useless

https://www.coinapi.io/blog/how-coinapi-helps-with-tracking-the-kimchi-premium
1•adafromcoinapi•49m ago•0 comments

Surflow: Tab Tagging System in Tree

1•asam-0•52m ago•0 comments

Post-Scarcity Blues [video]

https://www.youtube.com/watch?v=q_t3h2AZ0KY
1•doener•54m ago•0 comments

Show HN: vGhost – Vue directive to unload offscreen elements and maintain layout

https://github.com/isaact/vue-infinity
1•tewolde•54m ago•0 comments

Sombrahq/sombra-CLI: a CLI to scaffold and update projects from production code

https://github.com/sombrahq/sombra-cli
1•yunier-rojas•56m ago•0 comments

Man goes viral after working for four startups at the same time

https://www.nbcnews.com/tech/internet/man-goes-viral-working-four-startups-time-rcna216890
12•gshakir•56m ago•17 comments

Baidu Researchers Propose AI Search Paradigm

1•msolujic•57m ago•0 comments