frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Postgres LISTEN/NOTIFY does not scale

https://www.recall.ai/blog/postgres-listen-notify-does-not-scale
295•davidgu•3d ago•110 comments

Show HN: Pangolin – Open source alternative to Cloudflare Tunnels

https://github.com/fosrl/pangolin
32•miloschwartz•4h ago•4 comments

What is Realtalk’s relationship to AI? (2024)

https://dynamicland.org/2024/FAQ/#What_is_Realtalks_relationship_to_AI
232•prathyvsh•11h ago•79 comments

Show HN: Open source alternative to Perplexity Comet

https://www.browseros.com/
161•felarof•9h ago•55 comments

Batch Mode in the Gemini API: Process More for Less

https://developers.googleblog.com/en/scale-your-ai-workloads-batch-mode-gemini-api/
21•xnx•3d ago•5 comments

FOKS: Federated Open Key Service

https://foks.pub/
177•ubj•13h ago•42 comments

Graphical Linear Algebra

https://graphicallinearalgebra.net/
180•hyperbrainer•10h ago•12 comments

Flix – A powerful effect-oriented programming language

https://flix.dev/
218•freilanzer•12h ago•89 comments

Measuring the impact of AI on experienced open-source developer productivity

https://metr.org/blog/2025-07-10-early-2025-ai-experienced-os-dev-study/
514•dheerajvs•10h ago•331 comments

Belkin ending support for older Wemo products

https://www.belkin.com/support-article/?articleNum=335419
53•apparent•8h ago•47 comments

Red Hat Technical Writing Style Guide

https://stylepedia.net/style/
160•jumpocelot•11h ago•71 comments

Yamlfmt: An extensible command line tool or library to format YAML files

https://github.com/google/yamlfmt
25•zdw•3d ago•12 comments

Launch HN: Leaping (YC W25) – Self-Improving Voice AI

49•akyshnik•8h ago•25 comments

Turkey bans Grok over Erdoğan insults

https://www.politico.eu/article/turkey-ban-elon-musk-grok-recep-tayyip-erdogan-insult/
85•geox•3h ago•58 comments

How to prove false statements: Practical attacks on Fiat-Shamir

https://www.quantamagazine.org/computer-scientists-figure-out-how-to-prove-lies-20250709/
199•nsoonhui•16h ago•153 comments

eBPF: Connecting with Container Runtimes

https://h0x0er.github.io/blog/2025/06/29/ebpf-connecting-with-container-runtimes/
35•forxtrot•7h ago•0 comments

Regarding Prollyferation: Followup to "People Keep Inventing Prolly Trees"

https://www.dolthub.com/blog/2025-07-03-regarding-prollyferation/
40•ingve•3d ago•1 comments

Show HN: Cactus – Ollama for Smartphones

108•HenryNdubuaku•7h ago•45 comments

Grok 4

https://simonwillison.net/2025/Jul/10/grok-4/
178•coloneltcb•6h ago•148 comments

Analyzing database trends through 1.8M Hacker News headlines

https://camelai.com/blog/hn-database-hype/
117•vercantez•2d ago•61 comments

Not So Fast: AI Coding Tools Can Reduce Productivity

https://secondthoughts.ai/p/ai-coding-slowdown
57•gk1•2h ago•36 comments

Diffsitter – A Tree-sitter based AST difftool to get meaningful semantic diffs

https://github.com/afnanenayet/diffsitter
89•mihau•13h ago•26 comments

Matt Trout has died

https://www.shadowcat.co.uk/2025/07/09/ripples-they-cause-in-the-world/
142•todsacerdoti•19h ago•42 comments

Is Gemini 2.5 good at bounding boxes?

https://simedw.com/2025/07/10/gemini-bounding-boxes/
259•simedw•14h ago•58 comments

The ChompSaw: A Benchtop Power Tool That's Safe for Kids to Use

https://www.core77.com/posts/137602/The-ChompSaw-A-Benchtop-Power-Tool-Thats-Safe-for-Kids-to-Use
80•surprisetalk•3d ago•66 comments

Foundations of Search: A Perspective from Computer Science (2012) [pdf]

https://staffwww.dcs.shef.ac.uk/people/J.Marshall/publications/SFR09_16%20Marshall%20&%20Neumann_PP.pdf
5•mooreds•3d ago•0 comments

Show HN: Typeform was too expensive so I built my own forms

https://www.ikiform.com/
166•preetsuthar17•17h ago•86 comments

Final report on Alaska Airlines Flight 1282 in-flight exit door plug separation

https://www.ntsb.gov:443/investigations/Pages/DCA24MA063.aspx
131•starkparker•5h ago•143 comments

Radiocarbon dating reveals Rapa Nui not as isolated as previously thought

https://phys.org/news/2025-06-radiocarbon-dating-reveals-rapa-nui.html
17•pseudolus•3d ago•8 comments

Optimizing a Math Expression Parser in Rust

https://rpallas.xyz/math-parser/
127•serial_dev•17h ago•55 comments
Open in hackernews

Show HN: Open source alternative to Perplexity Comet

https://www.browseros.com/
160•felarof•9h ago
Hey HN, we're a YC startup building an open-source, privacy-first alternative to Perplexity Comet.

No invite system unlike bunch of others – you can download it today from our website or GitHub: https://github.com/browseros-ai/BrowserOS

--- Why bother building an alternative? We believe browsers will become the new operating systems, where we offload much bunch of our work to AI agents. But these agents will have access to all your sensitive data – emails, docs, on top of your browser history. Open-source, privacy-first alternatives need to exist.

We're not a search or ad company, so no weird incentives. Your data stays on your machine. You can use local LLMs with Ollama. We also support BYOK (bring your own keys), so no $200/month plans.

Another big difference vs Perplexity Comet: our agent runs locally in your browser (not on their server). You can actually watch it click around and do stuff, which is pretty cool! Short demo here: https://bit.ly/browserOS-demo

--- How we built? We patch Chromium's C++ source code with our changes, so we have the same security as Google Chrome. We also have an auto-updater for security patches and regular updates.

Working with Chromium's 15M lines of C++ has been another fun adventure that I'm writing a blog post on. Cursor/VSCode breaks at this scale, so we're back to using grep to find stuff and make changes. Claude code works surprisingly well too.

Building the binary takes ~3 hours on our M4 Max MacBook.

--- Next? We're just 2 people with a lot of work ahead (Firefox started with 3 hackers, history rhymes!). But we strongly believe that a privacy-first browser with local LLM support is more important than ever – since agents will have access to so much sensitive data.

Looking forward to any and all comments!

Comments

rpastuszak•8h ago
Congrats!

How are you planning to make the project sustainable (from a financial, and dev work/maintenance pov)?

patrickaljord•7h ago
my guess is it's just an electron app or chromium wrapper with an ollama wrapper to talk to it (there are plenty of free open source libs to control browsers).
janalsncm•7h ago
“Just” is a four-letter word :)

When someone in their infinite wisdom decides to refactor an api and deprecate the old one, it creates work for everyone downstream.

Maybe as an industry we can agree to do this every so often to keep the LLMs at bay for as long as possible. We can take a page out of the book of the maintainers of moviepy for shuffling their apis around, it definitely keeps everyone on their toes.

moscoe•6h ago
You don’t have to guess, it’s open source
felarof•6h ago
We are a chromium "wrapper"

But we are much more performant than other libs (like playwright) which are written in JS, as we implement bunch of changes at chromium source code level -- for example, we are currently implementing a way to build enriched DOMtree required for agent interactions (click, input text, find element) directly at C++ level.

We also plan to expose those APIs to devs.

lotyrin•1h ago
No wireless. Less space than a Nomad.
felarof•6h ago
Thank you!

plan is to sell licenses for Enterprise-version of browser, same as other open-source projects.

eGQjxkKF6fif•7h ago
Whats the roadmap looking like for Linux?

I don't have Mac or Windows.

felarof•6h ago
this is on our radar, we plan to have it ready by early next week!

still a team of 2 people, so bunch things on our plate.

layer8•7h ago
I would prefer this as a browser extension, not as its own browser application.
felarof•6h ago
We would've preferred to build this as browser extension too.

But we strongly believe that for building a good agent co-pilot we need bunch of changes at Chromium C++ code level. For example, chromium has a accessibility tree for every website, but doesn't expose it as an API to chrome extension. Having access to accessibility tree would greatly improve agent execution.

We are also building bunch of changes in C++ for agents to interact with websites -- functions like click, elements with indexes. You can inject JS for doing this but it is 20-40X slower.

esafak•5h ago
Could you upstream that change in order to make it an extension in the future? I think people would not value it any less.
felarof•5h ago
We don't mind upstreaming. But I don't think Google Chrome/Chromium wants to expose it as an API chrome extensions, if not they would've done this long time ago.

From Google's perspective, extension are meant to be lightweight applications, with restricted access.

jazzyjackson•3h ago
I'm not really interested in AI agents for my webbrowser, but it would be pretty cool to see a fork of chromium available that, aside from being de-googled, relaxes all the "restricted access" to make it more fun to modify and customize the way you guys are. Just a thought, may be more of a market for the framework more than the product :)

See Sciter. A very cool, super lightweight alternative to Electron, but unfortunately it seems like a single developer project and I could never get any of the examples to run.

https://sciter.com/

felarof•2h ago
Yes, we want to do this too! We'll expose much more richer APIs.

What use-cases do you have in mind? like scraping?

layer8•4h ago
Would this be possible for Firefox?
felarof•2h ago
IIRC, Firefox's web extension API does not provide access to accessibility tree as well.
dotancohen•1h ago
I'm not GP, but I agree that if your goal is to empower the end user and protect him from corporate overlords, then Firefox is a more logical choice to fork from.
axus•8m ago
I wonder if someone on the Chromium team will upstream all these BrowserOS changes, or "Not Invented Here" and re-implement it all for Gemini / Google Assistant.

Can't imagine Firefox acquiring a Firefox fork!

arjunchint•2h ago
I mean you could build the agent with a first principles understanding of the DOM instead of just hacking together with the accessibility tree
arjunchint•2h ago
We had this exact thought as well, you don't need a whole browser to implement the agentic capabilities, you can implement the whole thing with the limited permissions of a browser extension.

There are plenty of zero day exploit patches that Google immediately rolls out and not to mention all the other features that Google doesn't push to Chromium. I wouldn't trust a random open source project for my day-to-day browser.

Check out rtrvr.ai for a working implementation, we are an AI Web Agent browser extension that meets you where your workflows already are.

felarof•1h ago
Brave Browser (70M+ users) has validated that a chromium fork can be viable path. And it can in fact provide better privacy and security.

Chrome extensions is not a bad idea too. Just saying that owning the underlying source code has some strong advantages in the long term (being able to use C++ for a11y tree, DOM handling, etc -- which will be 20-40X faster than injecting JS using chrome extension).

daureg•5h ago
> our agent runs locally in your browser (not on their server)

That's definitely a nice feature. Did you measure the impact on laptop battery life in a typical scenario (assuming there is such a scenario at this early stage)

felarof•5h ago
The agent running by itself shouldn't impact battery life, it is similar to a lightweight chrome extension and if you think about it, it's an agent browsing the web like human would :)

If you run LLMs locally (using Ollama) and use that in our browser, that would impact battery life for sure.

zebomon•5h ago
This is very exciting given the rumor that OpenAI will be launching a (presumably not open source) browser of their own this summer. I've joined your Discord, so will try it soon and report back there. Congrats on launching!
felarof•5h ago
Thank you!

Browser wars have begun.

> that OpenAI will be launching a (presumably not open source) browser of their own this summer.

For sure, won't be open-source. I bet in some parallel world, openAI would be non-profit and actually open-source AI :)

closetkantian•5h ago
This looks like a great project.

What are the system requirements? And shouldn't they be listed on your website?

felarof•5h ago
we support Mac (apple silicon and intel) and Windows.

hardware requirements are minimal, same as Google Chrome, if you BYOK API keys for agents and are not running LLMs locally.

paul7986•4h ago
So would this or any AI browser go out and fetch a list of the best deals for my trip to Iceland? After Show me all the options it has found for flights, hotels, car rentals and show cheapest/best prices with all details (fly out of and into with times) to even allow me to pay for each item on same page I asked it to do so? As well it could group the overall best deal with details and then i can just click to pay instantly and or make some edits.
felarof•3h ago
We just started cooking, very soon you should be able to do this!
dotancohen•1h ago
It seems that the next evolution of SEO will be too skip the SE and simply O for the LLMs.
ivape•2h ago
What is the default BrowserOS model? Is it local, and if so, what inferencing server are you using?
felarof•2h ago
The default model is Gemini.

You can bring your own API keys and change the default to any model you local.

Or better run model locally using Ollama and use that!

ivape•2h ago
The default is a remote Gemini call?
felarof•1h ago
Yes, right now using Gemini API on our liteLLM server (we can't expose API key on client side).

We are working on smaller, fine-tuned model too, which will be the default soon! It should be much faster and precise at navigation tasks.

o_____________o•2h ago
Would love to see this show up on homebrew!
felarof•2h ago
Oooh, that's a nice idea! We'll look into doing that!
mh-•1h ago
Making a homebrew recipe is super easy, and you can definitely find an example to draw from that's "shaped" like your app. Highly recommend.
mdaniel•2h ago
> --- How we built? We patch Chromium's C++ source code with our changes, so we have the same security as Google Chrome. We also have an auto-updater for security patches and regular updates.

So you rebuild your browser on every Chromium release? Because that's the risk: often changes go into Chromium with very innocent looking commit messages than are released from embargo 90 days later in their CVE reference

felarof•2h ago
Good question, so far we have been building on top of chromium release that Google Chrome is based on.
arjunchint•2h ago
Do you have any benchmarks to share like Halluminate's Web Bench?
felarof•2h ago
We working on it! Should have pretty soon!
darkmuck•2h ago
This is similar to the chrome extension nanobrowser. https://github.com/nanobrowser/nanobrowser
DANmode•2h ago
Is BrowserOS-OS on the roadmap?

(Will you ever make a better FydeOS, or if you're laser-focused, perhaps be open to sharing some with them, so they could?)

felarof•57m ago
Yes! We are excited to build BrowserOS-OS! I think with agents the whole UX can be re-imagined.

I'll check out FydeOS!

azinman2•1h ago
The demo buying toothpaste shows the difficulty of these tasks. Toothpaste itself was very underspecified, and it essentially randomly chose from a huge list. Some tasks may have past actions that could help guide, others won't have any to inform. Failure cases abound -- maybe the toothpaste you previously bought is no longer available. Then what? Ultimately how much time did this particular example save since you need to double check the result anyway? This is what doomed Alexa for the purchasing experience that Amazon assumed it would enable in the first place.

I think it'd be better to show more non-trivial examples where the time savings is clear, and the failure cases are minimized... or even better how it's going to recover from those failure cases. Do I get a bespoke UI for the specific problem? Talk to it via chat?

This whole world is non-trivial. Good luck!

felarof•1h ago
Great points! For sure, the whole agentic browsers space is still super early.

We are also just getting started and trying to narrow down on a high-value niche use-case.

There are few repetitive, boring use-cases where time saving could be meaningful -- one example: Walmart 3rd-party sellers routinely (multiple times a day) keep checking prices of the competitor products to price their products appropriately. This could be easily automated with current agentic browsers.

byteknight•1h ago
But in reality, would much more consistently be automated by a single playwright script.
felarof•58m ago
True, there are plenty of libs already available to do such an automation if you are (or can hire a) dev.

But for non-technical folks, agentic browsers seems like a good UX to build such and many more automations.

bkyan•43m ago
Is there a way to hook BrowserOS up as a sub-agent for an orchestration agent/system?
felarof•40m ago
Yes! We want to do this.

We were thinking of implement MCP protocol into the browser, so the browser can be an MCP server (that exposes bunch of tools -- navigation, click, extract) and you can connect that to your agent, would that work?

What is your use-case? Happy to chat on discord! https://discord.gg/YKwjt5vuKr

bkyan•26m ago
Yes, that works for me! My primary use case is automated testing of updates to websites that I maintain.
bkyan•17m ago
Thanks for the discord invite! My username is Mindcast on discord.
felarof•6m ago
thanks for joining! DMed you.