Show HN: Nxtscape – an open-source agentic browser

100•felarof•2h ago

Hi HN - we're Nithin and Nikhil, twin brothers and founders of nxtscape.ai (YC S24). We're building Nxtscape ("next-scape") - an open-source, agentic browser for the AI era.

-- Why bother building a new browser? For the first time since Netscape was released in 1994, it feels like we can reimagine browsers from scratch for the age of AI agents. The web browser of tomorrow might not look like what we have today.

We saw how tools like Cursor gave developers a 10x productivity boost, yet the browser—where everyone else spends their entire workday—hasn't fundamentally changed.

And honestly, we feel like we're constantly fighting the browser we use every day. It's not one big thing, but a series of small, constant frustrations. I'll have 70+ tabs open from three different projects and completely lose my train of thought. And simple stuff like reordering tide pods from amazon or filling out forms shouldn't need our full attention anymore. AI can handle all of this, and that's exactly what we're building.

Here’s a demo of our early version https://dub.sh/nxtscape-demo

-- What makes us different We know others are exploring this space (Perplexity, Dia), but we want to build something open-source and community-driven. We're not a search or ads company, so we can focus on being privacy-first – Ollama integration, BYOK (Bring Your Own Keys), ad-blocker.

Btw we love what Brave started and stood for, but they've now spread themselves too thin across crypto, search, etc. We are laser-focused on one thing: making browsers work for YOU with AI. And unlike Arc (which we loved too but got abandoned), we're 100% open source. Fork us if you don't like our direction.

-- Our journey hacking a new browser To build this, we had to fork Chromium. Honestly, it feels like the only viable path today—we've seen others like Brave (started with electron) and Microsoft Edge learn this the hard way.

We also started with why not just build an extension. But realized we needed more control. Similar to the reason why Cursor forked VSCode. For example, Chrome has this thing called the Accessibility Tree - basically a cleaner, semantic version of the DOM that screen readers use. Perfect for AI agents to understand pages, but you can't use it through extension APIs.

That said, working with the 15M-line C++ chromium codebase has been an adventure. We've both worked on infra at Google and Meta, but Chromium is a different beast. Tools like Cursor's indexing completely break at this scale, so we've had to get really good with grep and vim. And the build times are brutal—even with our maxed-out M4 Max MacBook, a full build takes about 3 hours.

Full disclosure: we are still very early, but we have a working prototype on GitHub. It includes an early version of a "local Manus" style agent that can automate simple web tasks, plus an AI sidebar for questions, and other productivity features (grouping tabs, saving/resuming sessions, etc.).

Looking forward to any and all comments!

You can download the browser from our github page: https://github.com/nxtscape/nxtscape

Comments

anilgulecha•2h ago

Very interesting approach. Why a browser, and not a fantastic chrome extension? Grouping tabs, summarizing, even taking open ended actions, seem very doable with permissions extensions have..

edit: Just read about the accessibility thing, but that's thin. Is there any usecase in the future that a browser can, but an extension can't?

esafak•2h ago

It sounds like something that needs to be dealt with in Chromium rather than forked. I am sure lots of developers want such functionality, if it is missing. I found:

https://developer.chrome.com/docs/extensions/ai

Don't any of these fit the bill? Are they Gemini-locked and you want something else? I am not familiar with the Chrome API, so pardon my ignorance.

felarof•2h ago

Yeah accessibility is one such usecase, but in future we have few other ideaswhere having a fork makes it lot easier. Few ideas:

- Ship a small LLM along with browser - MCP store built in

dataviz1000•2h ago

> Is there any usecase in the future that a browser can, but an extension can't?

The only reason to use a browser over a chrome extension is to bypass security features, for example, trusted events. If a user wants the browser window to go to full screen or play a video, a physical mouse click or key press is required. Moreover, some websites do not want to be automated like ChatGPT web console and Chase.com which checks if the event was a trusted event before accepting a button click or key press. This means that a Chrome extension can not automate voice commands inferred with audio to text. However, to get a trusted event only requires the user to press a button, any button, so message or dialog prompt that says, "Press to go full screen," is all that is required. This can be down with a remote bluetooth keyboard also.

The way I see it, these limitations are in place for very, very good reasons and should not be bypassed. Moreover, there are much larger security issues using a agentic browser which is sending entire contents of a bank website or health records in a hospital patient portal to a third party server. It is possible to run OpenAI's whisper on webgpu on a Macbook Pro M3 but most text generation models over 300M will cause it to heat up enough to cook a steak. There are even bigger issues with potential prompt injection attacks from third party websites that know agentic browsers are visiting their sites.

The first step in mitigating these security vulnerabilities is preventing the automation from doing anything a Chrome extension can't already do. The second is blacklisting or opt in only allowing the agents to read and especially to write (fill in form is a write) any webpage without explicit permission. I've started to use VSCode's copilot for command line action and it works with permissions the same way such as only session only access.

I've already solved a lot of the problems associated with using a Chrome extension for agentic browser automation. I really would like to be having this conversation with people.

EDIT: I forgot the most important part. There are 3,500,000,000 Chrome users on Earth. Getting them to install a Chrome extension is much, much easier than getting them to install a new browser.

gtsop•2h ago

Are we still tossing around the 10x productivity boost? Please make this stop. I see first commit on April 28 so by 10x productivity its like you've been working on this for almost 2.5 years, and there is still a waiting list on the website.

Appreciate the agplv3 licence, kudos on that.

felarof•2h ago

Thanks for the feedback.

I get the general sentiment. But cursor for sure has improved productivity by a huge multiplicative factor, especially for simpler stuff (like building chrome extension).

lxe•2h ago

Before I dive into the source code... how do you pass the page content, and the locations of interactive components to the LLM? And how do you dispatch events to interact with the page? I just want to verify if it's ARIA tree like the others, or it's something else.

felarof•2h ago

Today, we connect to chrome using CDP and use Puppeteer to send clicks and other operations. Also, using browser use DOM tree highlighting, which works great.

To get the page content we parse accessibility tree.

Y_Y•2h ago

"Building a browser"?

You're just patching Chromium.

wongarsu•2h ago

But the recruiters don't know that /s

felarof•2h ago

Yes, we are building on top of chromium. haha noway two of us can build a new browser and feel it's not needed too.

jklinger410•2h ago

What is with Mac users forking Chromium and then only making releases for Mac?

felarof•2h ago

Haha, was easier to build and we were the first users :)

have linux next on our radar. What build do you want?

doublerabbit•2h ago

FreeBSD, Haiku, Amiga

felarof•1h ago

Sounds good! will look into getting linux build.

jtolly710•57m ago

.deb would be great to see next :)

zahirbmirza•2h ago

Does it support MP4 playback?

felarof•1h ago

Yes, it works.

b0a04gl•2h ago

so agents can control tabs, forms, clicks—like a real user would.so what about undo. if an agent clicks the wrong thing, how do you roll that back without reloading the world?

felarof•2h ago

There is a big red button to always stop the agent.

wongarsu•2h ago

Name derived from Netscape (Firefox's great-grandfather), icon is a red fox, but based on Chrome? Was this originally designed as a Firefox fork or what happened there

ilaksh•2h ago

Yeah. Regardless, it seems misleading to use that icon with a Chromium fork.

Also the fact that it's AGPL means this project is very copyleft and not compatible with business models.

I'm not saying that there is no place for copyleft open source anymore, but when it's in a clearly commercial project that makes me question the utility of it being open source.

dotancohen•2h ago

  > very copyleft and not compatible with business models.

Could you explain this for the rest of us? Thanks.

mattigames•1h ago

The short answer is that it means that businesses need to publicly share whatever change they do to the code, and that alone is enough deterrent to use it.

abirch•1h ago

"The GNU Affero General Public License is a modified version of the ordinary GNU GPL version 3. It has one added requirement: if you run a modified program on a server and let other users communicate with it there, your server must also allow them to download the source code corresponding to the modified version running there."

https://www.gnu.org/licenses/why-affero-gpl.html

This means that if this company is successful and sells me 1 license, in theory I can request the source code and spin up Dr Evil's voice 1 billion clones and not pay licenses for those.

With other forms of GPL you only have to release the source code if you release the software to the user.

psychoslave•45m ago

A business that maintain its customer base captive through any kind of designed technical defect and asymmetrical information distribution is not striving for excellence in customer experience.

Saying that such a behavior encompasses all possible business models, it's like saying directorship is the only form of governance.

monkeywork•35m ago

Name 3 succesful companies running under such restrictions?

bityard•1h ago

Being copyleft doesn't mean it's not compatible with business models, it means it's not compatible with exploitative business models.

josephcsible•26m ago

Huh? It's a good thing that it's AGPL. That license explicitly allows commercial use, and only bans proprietary forks/modifications.

mbreese•44m ago

I can’t see how this project lasts with the current name/logo. As mentioned elsewhere, Netscape is still a trademark, and this is quite confusing between Netscape and Firefox.

xena•2h ago

Do you respect robots.txt?

felarof•2h ago

No, not today.

But wonder if it matter if it the agent is mostly using it for "human" use cases and not scrapping?

xena•1h ago

You should, because universities are starting to get legal involved due to mass scraping taking down their systems.

dotancohen•1h ago

Yes it would matter. The AI might be I in your eyes, but it is still A.

mattigames•1h ago

What do you mean? This AI cannot scrape multiple links automatically? Like "make a summary of all the recipes linked in this page" kind of stuff? If it can it definitely meets the definition of scraping.

grepexdev•1h ago

I think what he means is it is not just generally crawling and scraping, and uses a more targeted approach. Equivalent to a user going to each of those sites, just more efficiently.

taylorius•2h ago

I'll get voted down, but I hate that cute AI fox, and hope I never see it again.

felarof•2h ago

Haha, used gpt4o to generate it. What change do you want to see in that fox appearance? Any change should be a prompt away :)

al_borland•2h ago

Why a fox? The browser is based on Chrome. Firefox basically owns having a fox as a mascot in the browser space. Why not pick something original? The fox confusing at best, but some may say misleading. Same goes for the name.

felarof•1h ago

Thanks for the feedback. Honestly—we just reused the icon we had gotten professionally designed for the last idea we were working on (https://felafax.ai/).

But not gonna lie, as a tiny startup, we don’t have marketing budget of Perplexity or Dia, so we picked a name and icon that at least hinted at “browser” right away. Definitely not trying to mislead anyone -- just needed something recognizable out of the gate.

sevg•1h ago

It doesn’t look like you reused that icon. It looks like you generated a new one with AI. So it could have been any animal (or not even an animal at all).

Sophira•41m ago

You stated in the parent comment that you used GPT4o to generate it, but now you're saying you had a professionally made icon? I don't understand.

ilaksh•2h ago

It makes me question your honesty. If you want a fox logo then build it as a Firefox fork. If you do that I will trust you again.

rafram•1h ago

The text is very off-center and the AI "vibe" is palpable. Hire a designer or at least take the time to add the text to a free SVG yourself.

esskay•1h ago

For starters, it shouldn't be using a fox. You know why.

doublerabbit•1h ago

I wish we could stop with the animal "furry" mascots for projects.

It was cute when the internet was cute but now it's just boring.

_fw•2h ago

I’m using Dia a lot for work at the moment and frankly it’s a gamechanger. granted I’m not a developer but being able to interact with an LLM that has access to the page I’m on is extremely useful:

Instead of manually hunting across half a dozen different elements, then copy/paste and retype to put something into a format I want…

I can just get Dia do it. In fact, I can create a shortcut to get it to do it the same way every single time. It’s the first time I’ve used something that actually feels like an extension of the web, instead of a new way to simply act on it at the surface level.

I think the obvious extension of that is agentic browsers. I can’t wait for this to get built to a standard where I can use it every day… But how well is it going to run on my 16GB M1 Pro?

felarof•2h ago

16GB M1 Pro is good enough to run our browser! You should give it a try!

Download form https://www.nxtscape.ai/ or our github page.

mattigames•1h ago

If this workflow starts getting any traction this will quickly turn into a cat and mouse game, where companies do their best to make sure those AIs don't work on their websites to make sure humans and humans only watch their websites' ads, their links, their banners and so on.

Google being a big one of those companies would soon side with those companies and not with the users, it's been their modus operandi, just recently some people got threats that if they don't stop using ad blockers in YouTube they will ban them from the platform.

johncole•2h ago

This has been the most fun Show HN to read in a long time. <grabs popcorn>

OsrsNeedsf2P•2h ago

Is this only for MacOS? If it's a Chromium fork, what's the reason for no Linux/Windows?

Also what's the business model?

felarof•1h ago

Yes MacOS for now, but looking into getting Linux binary next.

> what's the reason for no Linux/Windows?

Sorry, just lack of time. Also we use Sparkle for distributing updates, which is MacOS only.

> Also what's the business model?

We are considering an enterprise version of the browser for teams.

Lammy•1h ago

AOL still have active trademarks for “Netscape” which might trouble you here:

- https://tsdr.uspto.gov/#caseNumber=76017078&caseSearchType=U...

> PROVIDING MULTIPLE-USER ACCESS TO A GLOBAL COMPUTER INFORMATION NETWORK FOR THE TRANSFER AND DISSEMINATION OF A WIDE RANGE OF INFORMATION; ELECTRONIC TRANSMISSION OF DATA, IMAGES, AND DOCUMENTS VIA COMPUTER NETWORKS; [ELECTRONIC MAIL SERVICES; PROVIDING ON-LINE CHAT ROOMS FOR TRANSMISSION OF MESSAGES AMONG COMPUTER USERS CONCERNING A WIDE VARIETY OF FIELDS]

- https://tsdr.uspto.gov/#caseNumber=76017079&caseSearchType=U...

> PROVIDING INFORMATION IN THE FIELD OF COMPUTERS VIA A GLOBAL COMPUTER NETWORK; PROVIDING A WIDE RANGE OF GENERAL INTEREST INFORMATION VIA COMPUTER NETWORKS

- https://tsdr.uspto.gov/#caseNumber=74574057&caseSearchType=U...

> computer software for use in the transfer of information and the conduct of commercial transactions across local, national and world-wide information networks

xp84•15m ago

And in case someone is wondering if Yahoo can be said to have abandoned the mark by disuse, it seems like http://isp.netscape.com is still up, so they have their bases covered.

Babkock•1h ago

Yeah that's just what we need, more AI shit, more slop slapped on top of Chromium.

thisislife2•1h ago

I've upvoted to encourage your initiative, but I personally will not support any "AI" software unless it 100% runs locally and supports old platforms and hardware. Otherwise it is nothing but another conduit to get access to, and suck all my personal data for surveillance capitalism.

felarof•1h ago

> 100% runs locally

Thank you! We have ollama integration already, you can run models locally and use that for AI chat.

a2128•1h ago

What models are actually recommended, and how useful is the browser when using them? "We have Ollama integration" isn't very useful when there's no information about which models you should use, what works with them, what doesn't, and honestly it feels disingenuous when projects market themselves as 100% private and local and cloud-free and everything stays on your computer when the intended use case is clearly to put an OpenAI API key and send everything to OpenAI

hannob•1h ago

Okay, maybe this is a stupid question, but: what is an agentic browser? You seem to assume that everyone knows what that means.

Is this a common and well-defined term that people use? I've never heard it.

It would appear to me from the context that it means something like "web browser with AI stuff tackled on".

felarof•1h ago

Thanks for asking - not a stupid question at all! I should have probably explained it at the top of my post.

By "agentic browser" we basically mean a browser with AI agents that can do web navigation tasks for you. So instead of you manually clicking around to reorder something on Amazon or fill out forms, the AI agent can actually navigate the site and do those tasks.

wild_egg•1h ago

Not to pull a "why should I use Dropbox when I have rsync" but why should we use this over adding a Playwright MCP to Claude Desktop or similar?

Does having access to Chromium internals give you any super powers over connecting over the Chrome Devtools Protocol?

felarof•49m ago

Yes, eventually we think there is more value of owning the entire stack than just be a MCP connector.

Few ideas we were thinking of: integrating a small LLM, building MCP store into browser, building a more AI friendly DOM, etc.

Even today, we use chrome's accessibility tree (a better representation of DOM for LLMs) which is not exposed via chrome extension APIs.

tcdent•1h ago

get with the times, bro

al_borland•47m ago

I first heard the term agentic about a month ago. I went from never hearing it, to hearing it 3 or 4 times in 2 days... one of which was on an internal town hall where I work, where leadership was simply using it as if the whole world already knew what it meant, instead of literally being the first time it was ever mentioned.

The tl;dr is that it's AI that makes decisions on its own.

kordlessagain•42m ago

Agents are LLM responses that are feed with tools, like calculate(expression). When it encounters a thing it needs to do to meet desired output, it will run the tool. That is defining a simple agentic workflow.

A complicated workflow may involve other tools. For example, the input to the LLM may produce something that tells it to set the user-agent to such and such as string:

  set_user_agent("Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/120.0.0.0 Safari/537.36");

Other tools could be clicking on things in the page, or even injecting custom JavaScript when a page loads.

finolex•1h ago

This is cool! Congrats on launch! How do you store user data? Do you write to device? Curious if there's a basic.tech x nxtscape collab possible here where you can store each user's info to their dedicated PDS

felarof•1h ago

Thank you! Yeah all user data is just stored locally on device.

Oh cool, will look into basic.tech to understand more.

mahoro•1h ago

This is great, I'd like to test! Is there any recommendations on which ollama models works best with this kind of tasks?

felarof•1h ago

Qwen3 8B works pretty well. But for complex planning and navigation tasks, big models (GPT4.1, claude 3.7) are the still the best bet. We also let you use your own API keys for the big models.

mahoro•1h ago

Thanks, eager to try :)

lofaszvanitt•1h ago

Who needs this and why? In its current form this is DOA.

revskill•1h ago

When windows ?

felarof•1h ago

Hopefully in a month or two. Sorry!

awongh•1h ago

I think LLMs could have a reasonable chance at solving tab-related workflows (keeping track of tabs or the idea/concept of tabs) - that is tracking and sorting lots of small related research ideas.

Sort of like a backwards perplexity search. (LLM context is from open tabs rather than the tool that brings you to those tabs)

I built a tab manager extension a long time ago that people used but ran into the same problem- the concept of tab management runs deeper than just the tabs themselves.

felarof•46m ago

Yeah, I feel LLMs can finally solve the tab overload issue. I suffer from this constantly.

I added few features which I felt would be useful - easy way to organise and group tabs - simple way to save and resume sessions with selective context.

What are your problems that you would like to see solved?

psychoslave•34m ago

I think a large part of it is us, as user, we lake the appropriate discipline.

Resist the call to open in a tab every link in this article, overcome the fear of losing something if all these tabs lagging behind are closed right now without further consideration.

awongh•25m ago

I don't like the idea of letting the LLM run wild and categorize things directly, but in a tab-organizing view it would be useful to add more semantic sorting of the tabs- maybe it would enable something like multiple tab-view control panel: Show all the AI tabs. Show all the image diffusion tabs. Show all the LLM tabs. (so overlapping views of sets of tabs)

This would of course apply to not just open tabs but tabs I used to have open, where the LLM knows about my browsing history.

But I think I would want a non-chat interface for this. (of course at any time I could chat/ask a question as well)

rodolphoarruda•48m ago

This is the missing piece from Karpathy's keynote: the browser.

afeigenbaum•43m ago

What is on your roadmap?

felarof•34m ago

Some ideas here - https://nxtscape.feedbear.com/roadmap

feel free to add new or upvote. Want to build what people want :)

Agentic Misalignment: How LLMs could be insider threats

Addressing fear, uncertainty and doubt thrown at Element and Matrix

It's True: The Jaws Shark Is Public Domain

Honda successfully launched and landed its own reusable rocket

Open Sourcing Any Distance

Making Sense of a Noisy World

Inside the 'Dragon Age' Debacle That Gutted EA's BioWare Studio

MIT student prints AI polymer masks to restore paintings in hours

Engineered Meta-Cognitive Workflow Architecture for Windsurf

Postgres Weekly 19th June 2020

EU Eyes Ditching Microsoft Azure for France's OVHcloud

Show HN: Onri, the Google Map for micro-learning

Cluely raises $15M led by a16z

Haveli Investments to buy Couchbase for $1.5B

How Astronomers Will Deal With 60 Million Billion Bytes of Imagery

Ask HN: X account hacked again – no email when attacker changed the email? How?

YC: Digital Superintelligence, Multiplanetary Life, How to Be Useful [video]

Show HN: wasque – Lightweight Cloudlare Warp Proxy Container for Linux

Uncut Paper Currency

But, my Postgres (+ lakehouse) is free

US patent office wants AI to scan for prior art, but doesn't want to pay for it

Firefox features that help you plan a trip (and take it)

How Is Modular Democratizing AI Compute?

12 Years of Postgres Weekly with Peter Cooper, Ep28 on Talking Postgres Podcast

My experience going through a startup incubator

Kenton Varda on Safe AI-Assisted Coding and the Power of Cloudflare Workers

Jürgen Schmidhuber：the Father of Generative AI Without Turing Award

Elon Musk's robotaxi will have a human driver for 'safety' reasons

I wanted a Steam Deck for the living room so I made one [video]

Listen to Forests from Around the World

Agentic Misalignment: How LLMs could be insider threats

Addressing fear, uncertainty and doubt thrown at Element and Matrix

It's True: The Jaws Shark Is Public Domain

Honda successfully launched and landed its own reusable rocket

Open Sourcing Any Distance

Making Sense of a Noisy World

Inside the 'Dragon Age' Debacle That Gutted EA's BioWare Studio

MIT student prints AI polymer masks to restore paintings in hours

Engineered Meta-Cognitive Workflow Architecture for Windsurf

Postgres Weekly 19th June 2020

EU Eyes Ditching Microsoft Azure for France's OVHcloud

Show HN: Onri, the Google Map for micro-learning

Cluely raises $15M led by a16z

Haveli Investments to buy Couchbase for $1.5B

How Astronomers Will Deal With 60 Million Billion Bytes of Imagery

Ask HN: X account hacked again – no email when attacker changed the email? How?

YC: Digital Superintelligence, Multiplanetary Life, How to Be Useful [video]

Show HN: wasque – Lightweight Cloudlare Warp Proxy Container for Linux

Uncut Paper Currency

But, my Postgres (+ lakehouse) is free

US patent office wants AI to scan for prior art, but doesn't want to pay for it

Firefox features that help you plan a trip (and take it)

How Is Modular Democratizing AI Compute?

12 Years of Postgres Weekly with Peter Cooper, Ep28 on Talking Postgres Podcast

My experience going through a startup incubator

Kenton Varda on Safe AI-Assisted Coding and the Power of Cloudflare Workers

Jürgen Schmidhuber：the Father of Generative AI Without Turing Award

Elon Musk's robotaxi will have a human driver for 'safety' reasons

I wanted a Steam Deck for the living room so I made one [video]

Listen to Forests from Around the World

Show HN: Nxtscape – an open-source agentic browser

Comments