frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

WebGPU enables local LLM in the browser. Demo site with AI chat

https://andreinwald.github.io/browser-llm/
51•andreinwald•2h ago

Comments

andreinwald•2h ago
Browser LLM demo working on JavaScript and WebGPU. WebGPU is already supported in Chrome, Safari, Firefox, iOS (v26) and Android.

Demo, similar to ChatGPT https://andreinwald.github.io/browser-llm/

Code https://github.com/andreinwald/browser-llm

- No need to use your OPENAI_API_KEY - its local model that runs on your device

- No network requests to any API

- No need to install any program

- No need to download files on your device (model is cached in browser)

- Site will ask before downloading large files (llm model) to browser cache

- Hosted on Github Pages from this repo - secure, because you see what you are running

cgdl•55m ago
Which model does the demo use?
andreinwald•7m ago
Llama 3.2 1B
andsoitis•2h ago
very cool. improvement would be if the input text box is always on screen, rather than having to manually scroll down as the screen fills.
maxmcd•2h ago
Looks like this is a wrapper around: https://github.com/mlc-ai/web-llm

Which has a full web demo: https://chat.webllm.ai/

refulgentis•1h ago
I am to see it regardless - projects been very low activity for months. Just last night I was thinking about ripping it out before launch. No observable future.
swores•35m ago
Is this correct?

It doesn't seem so to me, either from the way it works or from what little of the code I've looked at...

But I don't have time to do more than the quick glance I just did at a few of the files of each and need to run, so hopefully someone cleverer than me who won't need as much time as me to answer the question could confirm while I'm afk

pjmlp•1h ago
Beware of opening this on mobile Internet.
andreinwald•1h ago
Demo site is asking before download
lukan•1h ago
Well, I am on a mobile right now, can someone maybe share anything about the performance?
pjmlp•30m ago
Not everyone enjoys unlimited data plans, and 500 MB is schon a lot.
andreinwald•1m ago
On my Android device works pretty fast.

But keep in mind that it's small Llama-3.2-1B model, specially for less powerfull GPU.

asim•1h ago
What's the performance of a model like vs an OpenAI API? What's the comparable here? Edit: I see it's same models locally that you'd run using Ollama or something else. So basically just constrained by the size of the model, GPU and perf of the machine.
andreinwald•4m ago
Yes, its very similar to Ollama app, and Llama-3.2-1B model used
petermcneeley•1h ago
This demo only works if you have the webgpu feature "f16". You can find out if you have this by checking for the feature list in https://webgpureport.org/ . The page itself can of course check for this but since f16 support is common they probably just didnt bother.
andreinwald•54m ago
Also here https://caniuse.com/webgpu
scottfr•52m ago
There is a Prompt API in development that's available in both Chrome and Edge to give access to a local LLM. Chrome extensions have access to it and I believe websites can request access as part of an origin trial.

The model is fully managed by the browser. It's currently the Gemini Nano model on Chrome, and they are testing a version of the Gemma 3n model in beta channels. Edge uses phi-4-mini.

More information is available here: https://github.com/webmachinelearning/prompt-api

om8•7m ago
To have a gpu inference, you need a gpu. I have a demo that runs 8B llama on any computer with 4 gigs of ram

https://galqiwi.github.io/aqlm-rs/about.html

echoangle•1m ago
Fun demo but the model that's used seems to be pretty stupid:

> What's the best way to get to space?

>> Unfortunately, it's not currently possible for humans to travel to space in the same way that astronauts do. While there have been several manned missions to space, such as those to the International Space Station, the technology and resources required to make interstellar travel feasible are still in the early stages of development.

>> Currently, the fastest spacecraft is the Apollo missions, which took around 77 hours to reach the Moon. From there, they traveled to other planets, but the distances between them are so great that even at high speeds, it would take many years to reach them.

>> As for traveling to space itself, there are no commercial or private companies that offer space travel services, and the cost of such a journey is estimated to be in the tens of millions of dollars per person.

>> However, there are some concepts and technologies being explored that might one day make space travel more accessible and affordable. For example, private companies like SpaceX and Blue Origin are working on developing reusable rockets that could potentially take people to space more quickly and reliably.

>> But for now, let's just say that space travel is still in the future, and we'll have to wait a bit longer for humans to venture into the cosmos!

The unreasonable likelihood of being: origin of life, terraforming, and AI

https://arxiv.org/abs/2507.18545
1•bookofjoe•50s ago•0 comments

Show HN: Accelerate AI agent development by grounding AI assistant in local docs

https://github.com/botingw/langgraph-dev-navigator
1•botingw_job•1m ago•0 comments

Linear Types for Programmers

https://twey.io/for-programmers/linear-types/
1•marvinborner•3m ago•0 comments

Invisible Symbols

https://symbol.so/invisible-characters
1•liquid99•11m ago•0 comments

US Army tests robot coyotes to prevent catastrophic bird strikes

https://cyberguy.com/robot-tech/army-tests-robot-coyotes-prevent-catastrophic-bird-strikes/
1•speckx•12m ago•0 comments

Telo MT1

https://www.telotrucks.com/
2•turtleyacht•15m ago•0 comments

The Creative Tension Between Developer and Language

https://krishna.github.io/posts/creative-tension-between-developer-and-language/
1•kenshi•18m ago•1 comments

Show HN: I've had success in SaaS and now I'm building tools for indie hackers

https://reallysimplesupport.com
1•gigamick•19m ago•0 comments

German police expands use of Palantir surveillance software

https://www.dw.com/en/german-police-expands-use-of-palantir-surveillance-software/a-73497117
4•LeftHandPath•19m ago•1 comments

California affordable housing programs on the chopping block after SCOTUS rules

https://calmatters.org/housing/2025/08/affordable-housing-developer-fees/
1•kqr2•20m ago•0 comments

Show HN: F1 COSMOS – Live timing and data dashboard for F1 fans

https://f1cosmos.com/
2•conradmk•27m ago•0 comments

Peter Thiel and the Antichrist

https://www.nytimes.com/2025/06/26/opinion/peter-thiel-antichrist-ross-douthat.html
2•dotcoma•28m ago•1 comments

Partisan hostility, not just policy, drives U.S. protests

https://phys.org/news/2025-08-partisan-hostility-policy-protests.html
2•bikenaga•29m ago•2 comments

Might Tariffs Get "Overturned"?

https://ritholtz.com/2025/07/tariffs-overturned/
1•throwaway81523•29m ago•0 comments

Getting Real with AI

https://doc.searls.com/2025/08/02/getting-real-with-ai/
1•speckx•30m ago•0 comments

Winners and Losers of the Bivalve Evolution

https://www.mdpi.com/1424-2818/17/7/500
1•PaulHoule•33m ago•0 comments

Architecture decision record (ADR) examples for software planning, IT leadership

https://github.com/joelparkerhenderson/architecture-decision-record
1•thunderbong•35m ago•0 comments

AI party game born from a drunk night with friends

https://taptrap.app
1•eliezerpujols•35m ago•1 comments

Choosing AI Tools for Work

https://www.augmentedswe.com/p/the-ultimate-overview-of-ai-tools
1•wordsaboutcode•39m ago•0 comments

Show HN: Let AI design your web project dev roadmap

https://www.archaltect.pro
1•jeremykip•41m ago•0 comments

Why Nobody Wants to Visit Las Vegas Right Now [video]

https://www.youtube.com/watch?v=W62Ie7dKXRY
2•pessimizer•43m ago•0 comments

New world record Weather satellites detect 515-mile-long lightning flash

https://www.space.com/astronomy/earth/new-world-record-weather-satellites-detect-515-mile-long-lightning-flash
2•Brajeshwar•45m ago•0 comments

Data Labeling Is the Hot New Thing in AI

https://spectrum.ieee.org/data-labeling-scale-ai-agents
2•Brajeshwar•45m ago•0 comments

Man Kept a Meticulous List of All 3,599 Books He'd Read Since 1962

https://www.smithsonianmag.com/smart-news/this-man-kept-a-meticulous-list-of-all-3599-books-hed-read-since-1962-when-he-died-his-family-published-it-online-180987074/
4•Brajeshwar•45m ago•0 comments

Google Sold This (Search Appliance) [video]

https://www.youtube.com/watch?v=YXoXJHn2vD0
2•thm•47m ago•1 comments

Creating realistic deepfakes is getting easier, motivating even more AI in reply

https://apnews.com/article/artificial-intelligence-deepfake-trump-espionage-hack-scammers-da90ad1e5298a9ce50c997458d6aa610
1•rntn•47m ago•0 comments

The X11 SECURITY extension from the 1990ies

https://www.uninformativ.de/blog/postings/2025-08-02/0/POSTING-en.html
1•zdw•49m ago•0 comments

How the US Weaponized Pakistan Against India

https://www.bloomberg.com/opinion/articles/2025-08-01/how-the-us-weaponized-pakistan-against-india
2•koolhead17•51m ago•0 comments

Build a Kanban Board in Minutes with GenosDB

https://medium.com/genosdb/build-a-kanban-board-in-minutes-with-genosdb-a4ae06a99ac9
1•estebanrfp•51m ago•1 comments

Can it be the best Bug bounty platform?

https://app.bugseekers.com/
2•Slowrodreguez•52m ago•0 comments