Show HN: Erdos – open-source, AI data science IDE

https://www.lotas.ai/erdos

77•jorgeoguerra•19h ago

Hey HN! We’re Jorge and Will from Lotas (https://www.lotas.ai/), and we’ve built Erdos, a secure AI-powered data science IDE that’s fully open source (https://www.lotas.ai/erdos).

A few months ago, we shared Rao, an AI coding assistant for RStudio (https://news.ycombinator.com/item?id=44638510). We built Rao to bring the Cursor-like experience to RStudio users. Now we want to take the next step and deliver a tool for the entire data science community that handles Python, R, SQL, and Julia workflows.

Erdos is a fork of VS Code designed for data science. It includes:

- An AI that can search, read, and write across all file types for Python, R, SQL, and Julia. Also, for Jupyter notebooks, we’ve optimized a jupytext system to allow the AI to make faster edits.

- Built-in Python, R, and Julia consoles accessible to both the user and AI

- Plot pane that tracks and organizes plots by file and time

- Database pane for connecting to and manipulating SQL or FTP data sources

- Environment pane for viewing variables, packages, and environments

- Help pane for Python, R, and Julia documentation

- Remote development via SSH or containers

- AI assistant available through a single-click sign-in to our zero data retention backend, bring your own key, or a local model

- Open source AGPLv3 license

We built Erdos because data scientists are often second-class citizens in modern IDEs. Tools like VS Code, Cursor, and Claude Code are made for software developers, not for people working across Jupyter notebooks, scripts, and SQL. We wanted an IDE that feels native to data scientists, while offering the same AI productivity boosts.

You can try Erdos at https://www.lotas.ai/erdos, check out our source code on our GitHub (https://github.com/lotas-ai/erdos), and let us know what features would make it more useful for your work. We’d love your feedback below!

Comments

Centigonal•18h ago

This is a good idea, although IMO source control, compute, and MLOps integration are bigger but less flashy pain points for data scientists than AI in notebooks.

If you're going to market Erdos as open source, then IMO there should be a github link somewhere on your website.

WillNickols•18h ago

Thanks for the suggestions - we'll definitely add those to the dev list. Also, the GitHub is https://github.com/lotas-ai/erdos (and it's on the download page but a bit small).

SamTinnerholm•18h ago

I can't tell how this differs to Cursor from your website. How is it different?

WillNickols•18h ago

A bunch of specific things below, but the main point is that it integrates a bunch of features that data scientists use that don't come with Cursor.

Specifics (mostly reproduced from above):

1. R/Python/Julia consoles accessible by the user and AI

2. Optimized jupytext system for editing notebooks efficiently

3. Plots pane for viewing and tracking plots

4. Databases pane for managing SQL/FTP connections

5. Environment pane for managing Python/R/Julia packages and environments

6. Help pane for documentation

7. An AI that interacts with all of that.

8. Open source AGPLv3

For me, the biggest difference in the AI usage is that the AI doesn't need to write one-off python scripts for everything and run them from the terminal because it can just use the console directly.

shuwan•17h ago

I think Rao is more appealing to me since Positron already has that kind of integration, while RStudio doesn’t. Plus, Posit probably won’t ever add an AI Chat feature to RStudio anyway.

WillNickols•17h ago

FWIW there's a bunch of stuff Erdos has that Positron doesn't (including having solved Positron's top 5 open GitHub issues):

1. Remote development via SSH or containers

2. AI that can connect to ChatGPT, local models, or our backend

3. In-line code execution for Qmd/Rmd files

4. Julia as a first class citizen

5. Multi-agent chats: as many AI sessions as you want and they’ll all run in parallel

6. Windows ARM64 builds

7. Open source AGPLv3 license

8. A bunch of other misc items including read-write data explorer for CSVs and TSVs, plots history sorted by file and time, searchable help, a command history tab, etc

Maybe the biggest difference going forward is that Positron was a ~2 year dev project, whereas Erdos reached feature parity (plus or minus some features) in about ~2 months and is now adding substantial brand new functionality every week.

shuwan•14h ago

Will, thanks for the explanation. This changes my view a lot. Will give it a try.

harvey9•17h ago

Do you have the option to run on a local model? Lots of firms don't want data or prompts going outside the local network

jorgeoguerra•17h ago

Yep — if you have a local model with an OpenAI-compatible v1/chat/completions endpoint (most local models have this option), you can route Erdos to use it in the Erdos AI settings.

vednig•17h ago

I see Google acquiring Iotas in the future, that's how good it gets

mritchie712•17h ago

We started with a product like this at Definite (https://www.definite.app/), but it became clear there weren't enough people willing to spend real money on a product like it when Cursor / VS Code already have good coverage on data science.

rubenvanwyk•4h ago

Not sure if self-promoting on every single analytics- or data-related thread is in line with the ethos of HN: "Please don't use HN primarily for promotion."

johannesf•16h ago

Have you done any fine-tuning or prompt-customization for the R-specific work? I've found the models worse on R when compared to Python, especially for more complex tasks. This looks cool, thanks for sharing!

WillNickols•16h ago

Nothing R specific. In my experience, Claude is pretty good about using tidyverse for everything. What was is flopping on for you? Our thought on not fine tuning models is that whatever comes out in 6 months is just going to be better than whatever we fine tuned.

buppermint•16h ago

Very cool. Any plans to add support for local models? This has what has prevented us from adopting Positron so far. We have sensitive data and sending to third party APIs is not an option (regardless of their stated retention policies).

jorgeoguerra•16h ago

Yeah, we just added support for local models. As I mentioned in an earlier comment, if you have a local model with an OpenAI-compatible v1/chat/completions endpoint (most local models have this option), you can route Erdos to use it in the Erdos AI settings.

puppycodes•15h ago

Looks interesting but i'm unclear what makes it "more accurate"?

jorgeoguerra•15h ago

When models edit the raw JSON behind a Jupyter notebook, they often mess up the cell structure by adding extra cells, misaligning code, or making bad edits. We fix this by giving the model the notebook in Jupytext format instead, which tends to make its edits cleaner and more accurate.

mkl•15h ago

The choice of name seems pretty bizarre. The famous Erdos [1] was a mathematician, not data scientist, computer scientist, or statistician.

[1] https://en.wikipedia.org/wiki/Paul_Erd%C5%91s

bigmadshoe•15h ago

He did contribute to/utilize probability theory. He came up during my undergrad probability class because of this: https://en.wikipedia.org/wiki/Probabilistic_method

jorgeoguerra•15h ago

Erdos is also widely considered as the most prolific and productive mathematician of all time (in terms of publications and collaborations). Hopefully you can be as productive with Erdos :)

mkl•8h ago

But productive with it in a different field from the person it's named after? That's weird. It seems disrespectful to him to name a product after him when its purpose is pretty much unrelated to his work.

thom•15h ago

Give me this, but with a very efficient, opinionated path to put models into production. Give me accessible PM and customer friendly documentation about features and model choices at every stage. Make it reusable and easy to modify. Make it robust and scalable at inference time, with metrics and dashboards tracking performance over time. This seems like optimising the bit that's already fun, but I see a lot of value in hand-holding a department through all the stodgy boring bits and getting high quality analysis repeatably into customer hands.

sosodev•12h ago

Does it support OpenRouter? I tried configuring OpenRouter as a "local model" but it seems to silently fail.

WillNickols•11h ago

Not yet - we need to change the header configuration for that to work (versus connecting to local models), but we'll have it available soon.

anigbrowl•11h ago

Apple Silicon only, might be worth mentioning on the download link.

jorgeoguerra•10h ago

Thanks for pointing that out - will fix it asap

dartharva•9h ago

I'm seeing a Windows download link?

jorgeoguerra•9h ago

The download button on the erdos/ page is OS specific, but you can also find all the download links in the download-erdos/ page.

agnosticmantis•10h ago

This looks very cool, I’m gonna try it later today.

Out of curiosity, why the name Erdos? AFAIK Erdos was neither a statistician, data scientist nor AI researcher.

He sure solved many probability/combinatorics problems and famously had many many collaborators.

jorgeoguerra•9h ago

No specific reason. Mainly because he was one of the most productive and collaborative mathematicians of all time. We actually considered "Poisson" at some point but ended up going with Erdos.

Show HN: Bash Screensavers

Show HN: Ordered – A sorted collection library for Zig

Show HN: JSON Query

Show HN: I was tired of people dmming me just "hi", so I made this - NoGreeting

Show HN: Dlog – Journaling and AI coach that learns what drives wellbeing (Mac)

Show HN: Erdos – open-source, AI data science IDE

Show HN: Git Auto Commit (GAC) – LLM-powered Git commit command line tool

Show HN: Write Go code in JavaScript files

Show HN: MyraOS – My 32-bit operating system in C and ASM (Hack Club project)

Show HN:Interactive RISC-V CPU Visualizer (Sequential and Pipelined)

Show HN: Linux Smart Directories Navigation

Show HN: Helium Browser for Android with extensions support, based on Vanadium

Show HN: Shadcn/UI theme editor – Design and share Shadcn themes

Show HN: nblm - Rust CLI/Python SDK for NotebookLM Enterprise automation

Show HN: Diagram as code tool with draggable customizations

Show HN: LLM Rescuer – Fixing the billion dollar mistake in Ruby

Show HN: Easily visualize torch, Jax, tf, NumPy, etc. tensors

Show HN: Whatdidido – CLI to summarize your work from Jira/Linear

Show HN: Learn Basic Chess Movements

Show HN: Action Engine — An API/Agent Buildkit Putting Flexibility First

Show HN: TrueType Rasterizer

Show HN: Vetr.is – Privacy-First Cloud in Iceland

Show HN: LinkPatrol – Free merchant-agnostic tool to find broken affiliate links

Show HN: Relai-SDK – simulate → evaluate → optimize AI agents

Show HN: Omnia OS, the Most Efficient Email Client Without AI

Show HN: OpenSkills - Run Claude Skills Locally Using Any LLM

Show HN: ChatHawk – Stop Copy-Pasting the Same Question Across Every AI Model

Show HN: Chonky – a neural text semantic chunking goes multilingual

Show HN: Ubik - A new way to use AI in citation-based work and research

Show HN: Pinpam, TPM2-backed pin authentication for Linux

Show HN: Erdos – open-source, AI data science IDE

Comments

Show HN: Bash Screensavers

Show HN: Ordered – A sorted collection library for Zig

Show HN: JSON Query

Show HN: I was tired of people dmming me just "hi", so I made this - NoGreeting

Show HN: Dlog – Journaling and AI coach that learns what drives wellbeing (Mac)

Show HN: Erdos – open-source, AI data science IDE

Show HN: Git Auto Commit (GAC) – LLM-powered Git commit command line tool

Show HN: Write Go code in JavaScript files

Show HN: MyraOS – My 32-bit operating system in C and ASM (Hack Club project)

Show HN:Interactive RISC-V CPU Visualizer (Sequential and Pipelined)

Show HN: Linux Smart Directories Navigation

Show HN: Helium Browser for Android with extensions support, based on Vanadium

Show HN: Shadcn/UI theme editor – Design and share Shadcn themes

Show HN: nblm - Rust CLI/Python SDK for NotebookLM Enterprise automation

Show HN: Diagram as code tool with draggable customizations

Show HN: LLM Rescuer – Fixing the billion dollar mistake in Ruby

Show HN: Easily visualize torch, Jax, tf, NumPy, etc. tensors

Show HN: Whatdidido – CLI to summarize your work from Jira/Linear

Show HN: Learn Basic Chess Movements

Show HN: Action Engine — An API/Agent Buildkit Putting Flexibility First

Show HN: TrueType Rasterizer

Show HN: Vetr.is – Privacy-First Cloud in Iceland

Show HN: LinkPatrol – Free merchant-agnostic tool to find broken affiliate links

Show HN: Relai-SDK – simulate → evaluate → optimize AI agents

Show HN: Omnia OS, the Most Efficient Email Client Without AI

Show HN: OpenSkills - Run Claude Skills Locally Using Any LLM

Show HN: ChatHawk – Stop Copy-Pasting the Same Question Across Every AI Model

Show HN: Chonky – a neural text semantic chunking goes multilingual

Show HN: Ubik - A new way to use AI in citation-based work and research

Show HN: Pinpam, TPM2-backed pin authentication for Linux