frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: WebPizza – AI/RAG pipeline running in the browser with WebGPU

https://github.com/stramanu/webpizza-ai-poc
2•stramanu•1h ago
I built a proof-of-concept for running RAG (Retrieval-Augmented Generation) entirely in the browser using WebGPU.

You can chat with PDF documents using models like Phi-3, Llama 3, or Mistral 7B - all running locally with zero backend. Documents never leave your device.

Tech stack: - WebLLM + WeInfer (optimized fork with ~3.76x speedup) - Transformers.js for embeddings (all-MiniLM-L6-v2) - IndexedDB as vector store - PDF.js for parsing

The main challenges were: 1. Getting esbuild to bundle without choking on onnxruntime-node 2. Managing COOP/COEP headers for SharedArrayBuffer 3. Keeping the bundle reasonable (Angular + models = ~11MB base)

Performance is surprisingly decent on modern hardware: - Phi-3 Mini: 3-6 tokens/sec (WebLLM) → 12-20 tokens/sec (WeInfer) - Llama 3.2 1B: 8-12 tokens/sec

Demo: https://webpizza-ai-poc.vercel.app/ Code: https://github.com/stramanu/webpizza-ai-poc

This is experimental - I'm sure there are better ways to do this. Would appreciate feedback, especially on: - Bundle optimization strategies - Better vector search algorithms for IndexedDB - Memory management for large documents

Happy to answer questions!

Fast SEO Fix

https://www.fastseofix.com
1•bellamoon544•2m ago•1 comments

Apps.apple.com leaked source code DMCA Takedown

https://github.com/github/dmca/blob/master/2025/11/2025-11-05-apple.md
1•LelouBil•4m ago•0 comments

A small game experiment for SEO

https://itcrawls.com
1•mmagicc•6m ago•0 comments

Dennis Ritchie's story of dabbling in the cryptographic world

https://web.archive.org/web/20250121041734/https://www.bell-labs.com/usr/dmr/www/crypt.html
1•fanf2•12m ago•0 comments

HackedGPT: Novel AI Vulnerabilities Open the Door for Private Data Leakage

https://www.tenable.com/blog/hackedgpt-novel-ai-vulnerabilities-open-the-door-for-private-data-le...
1•consumer451•13m ago•0 comments

How to Talk to Grandma About AI

https://jstrieb.github.io/posts/llm-thespians/
2•wofo•15m ago•1 comments

Show HN: I've built a story-to-video AI generator website

https://visimagine.com
1•gravitywp•19m ago•0 comments

The OpenHands Software Agent SDK: Composable and Extensible

https://arxiv.org/abs/2511.03690
1•timini•19m ago•1 comments

Review of the Other phone: a stylish, safety-first smartphone for children

https://www.mumsnet.com/reviews/the-other-phone-review
1•mner•20m ago•0 comments

Screeps: MMO RTS sandbox game for programmers

https://github.com/screeps/screeps
1•nateb2022•21m ago•0 comments

I want get a free HTTPS certificate without any library

https://awheelmaker.com/ACME
1•RockieYang•22m ago•0 comments

Colonial spider community in Sulfur Cave sustained by chemoautotrophy

https://subtbiol.pensoft.net/article/162344/
1•perihelions•29m ago•0 comments

Deep sequence models tend to memorize geometrically; it is unclear why

https://arxiv.org/abs/2510.26745
2•amichail•31m ago•0 comments

Stanford Graph Learning Workshop 2025 Video Recordings

https://snap.stanford.edu/graphlearning-workshop-2025/#schedule
1•Anon84•32m ago•0 comments

Show HN: A hand-held setup guide for Stremio (for non-tech friends and family)

https://www.own-your.stream/
1•anonbuddy•32m ago•0 comments

The Telegraph: BBC's bias 'pushed Hamas lies around the world'

https://www.telegraph.co.uk/news/2025/11/04/bbc-arabic-bias-pushed-hamas-lies/
1•wtcactus•32m ago•0 comments

Play 1.0 – The Future of Work, All in One App

https://3d7tech.com/play
1•richard3d7•33m ago•0 comments

Millisecond lifetimes and coherence times in 2D transmon qubits

https://www.nature.com/articles/s41586-025-09687-4
1•westurner•34m ago•0 comments

The Web Animation Performance Tier List

https://motion.dev/blog/web-animation-performance-tier-list
1•SirHound•35m ago•0 comments

Show HN: Placeholder Image Generator with color control

https://placeholderimage.io
1•Kristjan_Retter•35m ago•0 comments

Show HN: AI Coding Agents: Intent-Driven Development Guidelines

https://github.com/Exadra37/ai-intent-driven-development
1•Exadra37•36m ago•0 comments

You can't handle the truth – World Assessment Survey

https://worldview-assessment.vercel.app/
1•mrconter11•43m ago•0 comments

The Smith Manoeuvre – Is your mortgage tax deductible?

https://edrempel.com/smith-manoeuvre/
2•mooreds•43m ago•0 comments

Feature Extraction with KNN

https://davpinto.github.io/fastknn/articles/knn-extraction.html
2•RicoElectrico•45m ago•0 comments

Can Agentic AI workflows create good content?

https://medium.com/@mirshakirdah2/i-let-ai-write-my-blog-posts-for-6-months-heres-what-actually-h...
1•rovmut•46m ago•1 comments

Electronics device database of over 500k products

https://device.report/
1•Hackbraten•46m ago•0 comments

Show HN: CountdownShare – simple embeddable countdown timer, free and no signup

https://countdownshare.com
1•jatinlalit•46m ago•0 comments

Accumulating Context Changes the Beliefs of Language Models

https://arxiv.org/abs/2511.01805
2•Anon84•47m ago•0 comments

AI Slop vs. OSS Security

https://devansh.bearblog.dev/ai-slop/
31•mooreds•49m ago•4 comments

OIDC Workload Identity on AWS

https://www.latacora.com/blog/2025/11/04/aws-oidc-workload-identity/
2•mooreds•50m ago•0 comments