frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Claude Desktop spins up a VM without no way of stopping it

https://github.com/anthropics/claude-code/issues/29045
141•tonyrice•2h ago•81 comments

How JPL keeps the 13-year-old Curiosity rover doing science

https://spectrum.ieee.org/curiosity-rover-jpl-mars-science
72•pseudolus•1h ago•2 comments

I'm Eric Ries, author of "The Lean Startup" and new book "Incorruptible" – AMA

344•eries•4h ago•259 comments

Why SpaceX 2040 Revenue FCST $4.3T in highly unlikely

https://www.matteast.io/spacex-escape-velocity.html
104•meast•1h ago•75 comments

PgDog is funded and coming to a database near you

https://pgdog.dev/blog/our-funding-announcement
271•levkk•5h ago•136 comments

Policy on the AI Exponential

https://darioamodei.com/post/policy-on-the-ai-exponential
18•yjp20•49m ago•4 comments

L'Affaire Siloxane

https://mceglowski.substack.com/p/laffaire-siloxane
32•idlewords•1d ago•4 comments

GitHub Authentication issues related to API requests

https://www.githubstatus.com/incidents/fcj3088jg1wx
114•Multicomp•3h ago•23 comments

Building an HTML-first site doubled our users overnight

https://mohkohn.co.uk/writing/html-first/
819•edent•6h ago•374 comments

Mercedes‑Benz starts large‑scale production of electric axial flux motor

https://media.mercedes-benz.com/en/article/bebac2af-acdc-465a-9538-adb0bf3d8ccf
451•raffael_de•11h ago•272 comments

Meta steals a tactic from Tesla and builds data centers in tents

https://techcrunch.com/2026/06/04/meta-steals-a-tactic-from-tesla-and-builds-data-centers-in-tents/
24•gnabgib•2h ago•14 comments

Show HN: Extend UI – open-source UI kit for modern document apps

https://www.extend.ai/ui
34•kbyatnal•3h ago•4 comments

Anthropic's Model Naming, Extrapolated

https://samwilkinson.io/posts/2026-06-09-anthropics-model-naming-extrapolated
17•sammycdubs•40m ago•3 comments

Show HN: HelixDB – A graph database built on object storage

https://github.com/HelixDB/helix-db/tree/main
38•GeorgeCurtis•3h ago•21 comments

Apache Burr: Build reliable AI agents and applications

https://burr.apache.org/
117•anhldbk•4h ago•74 comments

The Dynamo and the Computer: The Modern Productivity Paradox (1989) [pdf]

https://www.almendron.com/tribuna/wp-content/uploads/2018/03/the-dynamo-and-the-computer-an-histo...
17•simonpure•1h ago•3 comments

All 9,300 Japanese train station, animated by the year it opened (1872–2026)

https://jivx.com/eki
144•momentmaker•7h ago•53 comments

A €0.01 bank transfer could compromise a banking AI agent

https://blue41.com/blog/how-we-helped-bunq-secure-their-financial-ai-assistant/
115•tvissers•5h ago•91 comments

DiffusionGemma: 4x Faster Text Generation

https://blog.google/innovation-and-ai/technology/developers-tools/diffusion-gemma-faster-text-gen...
173•meetpateltech•3h ago•38 comments

Who's the Smartest Corvid?

https://thetyee.ca/Culture/2026/06/05/Whos-the-Smartest-Corvid/
26•NaOH•1d ago•16 comments

Buy a train, bridge or tracks from the Swiss Railway

https://sbbresale.ch/
148•kisamoto•2d ago•75 comments

GeoLibre 1.0

https://geolibre.app/
8•jonbaer•1h ago•0 comments

The iPad was on Tailscale: a WebRTC debugging story

https://p2claw.com/blog/2026-06-09-the-ipad-was-on-tailscale/
41•syllogistic•4h ago•20 comments

Who Runs Your Rust Future? Hands-On Intro to Async Rust

https://aibodh.com/posts/async-rust-chapter-1-hands-on-intro-to-async-rust/
78•febin•2d ago•15 comments

The Case for Free Online Books (2014)

http://from-a-to-remzi.blogspot.com/2014/01/the-case-for-free-online-books-fobs.html
73•jimsojim•2h ago•65 comments

'They take you out of life, out of time': a journey into Spain's cave paintings

https://www.theguardian.com/science/2026/jun/02/journey-into-spain-palaeolithic-cave-paintings-al...
45•NaOH•2d ago•19 comments

Reviving Papers with Code

https://paperswithcode.co/
172•nielz_r•2d ago•40 comments

The Last Evolution, by John W Campbell Jr. (1932)

https://www.gutenberg.org/files/27462/27462-h/27462-h.htm
15•cf100clunk•3h ago•0 comments

macOS Container Machines

https://github.com/apple/container/blob/main/docs/container-machine.md
1144•timsneath•18h ago•396 comments

Ask HN: Are most corporate SWE jobs performative?

158•hnthrow10282910•6h ago•178 comments
Open in hackernews

Show HN: HelixDB – A graph database built on object storage

https://github.com/HelixDB/helix-db/tree/main
38•GeorgeCurtis•3h ago
Hey HN, it’s been just over a year since we launched HelixDB (https://news.ycombinator.com/item?id=43975423), a project a friend and I started in college. It’s an OLTP graph database built on object-storage, with native vector search and full-text search (FTS).

Why graph, vector and FTS? Graph databases provide a natural cognitive model for data, vectors allow for a semantic understanding of the entities and relationships in the graph, and FTS provides more specific filtering. Many AI-driven applications attempt to combine all of these functionalities by stitching together multiple disconnected systems, but even then there’s no native way to perform joins or queries that span all systems. You still need to handle this logic at the application level.

Helix started as a graph DB, but we moved to a hybrid graph/vector approach after attempting to build an AI memory system, which led us down the GraphRAG and HybridRAG rabbit hole, where we would need separate graph and vector databases.

We knew scalability would be a challenge at each stage of our product's development, however our initial focus this past year was to prove out the product through local deployments and was only meant to be run on a single node. Scaling graph DBs remained a difficult and expensive problem we’d have to solve later. Some common ways other graph DBs solve scaling is by duplicating entire datasets across distributed machines (extremely expensive per node), or by sharding the data.

Sharding databases is effective and affordable, however, graph data doesn’t have explicit partitions like relational databases do. For example, sharding a relational DB involves splitting up tables. When it comes to graph DBs, the edges can span across any of the partitions, and hopping across multiple machines when traversing nodes is ineffective and computationally expensive.

Replicating graph DBs for high availability and better throughput drastically increases the operational cost of the db and still has a limit of how big you can vertically scale. The workload that we’re used for requires storing a huge amount of data for agents, where only a subset of that data is ever needed at any one time. So rather than having the whole thing in memory, we can store it all in object-storage and get the bits we need when they’re needed.

Agents benefit from better context, which is achieved from more and better data (more relationships etc). By using S3 as the persistence/data layer there is no limit to how big the graph can be or how many relationships you can have, and we can scale to serve throughput and requests by horizontally spinning up nodes and caching relevant subsets of the graph on each node. This way, you get extremely low latency for “hot” data and a p99 of ~100ms for writes and ~50ms for reads from cold storage (S3). Plus you get the benefit of dirt cheap storage.

Workloads that HelixDB is currently supporting: - Huge amounts of data (TBs) from which the agents need to search and traverse over - Offering affordable graph storage for companies where cost of graph data is a bottleneck - Consolidating multiple databases, enabling AI agents to have autonomy over companies, helping them become more autonomous. - AI memory - Company brains

We’re currently working on our own generalised AI memory layer which will use HelixDB under the hood and be completely open-source. Also, we’re finishing up on pre-filtering for vector search which will allow you to pre-filter based on relationships in the graph, metadata, and sub-graphs. And lastly, GA cloud will be available in the coming weeks.

If you want to run Helix locally (either on-disk or in-memory), you can find more info on our github (https://github.com/HelixDB/helix-db) or via our docs (https://docs.helix-db.com/database/local-development). If you’re interested in getting started with our distributed cloud, please email us founders@helix-db.com.

Many thanks! Comments and feedback welcome!

Comments

brene•3h ago
How does this compare vs. Turbopuffer?
GeorgeCurtis•3h ago
We see comparable results for vectors and FTS.

For vector search we have warm and cold p99s of approx 20ms and 400ms respectively. For FTS, warm and cold query p99s of approx 15ms and 250ms respectively.

Both of these benchmarks were run on 1m docs.

mentioum•3h ago
We've been having some issues with intermittent performance on multi hop queries.

What's your p99 like for multi hops?

GeorgeCurtis•3h ago
In prod we see p99’s of <10ms ms for warm queries and around 50ms per hop for cold queries.
mentioum•2h ago
Hmmm... I'll get in touch. Got an email i can reach out to, there doesn't seem to be one listed on your website?

I'm more concerned about if the p99s stay consistent when things get spikey.

dgraph is fine otherwise...

GeorgeCurtis•19m ago
Sure! You can email me personally at george@helix-db.com
zw17•2h ago
If your use case is OLAP based, please check it out PuppyGraph. It’s a graph query engine that sits on top of your Lakehouse (no ETL required). Our benchmark has shown consistently that 10-hop queries across billions of edges in <2 seconds. Our customers including some most data demanding companies like Coinbase, Datadog, Palo Alto Network, Netskope, AMD, etc.
GeorgeCurtis•2h ago
PuppyGraph is a good fit for OLAP for sure.

We’re just two young founders sharing what we’ve been building, so I’ll take the drive-by competitor plug as a compliment :)

Definitely a different focus though. Helix is OLTP, built for operational graph + vector workloads, especially apps/agent memory where low-latency traversals and writes are concerned.

jauntywundrkind•38m ago
And is open source.
mentioum•2h ago
It's not, its actually our prod db with direct user usage - we self host a large dgraph cluster. We have a very large number of people manage their car and car histories with us and host a full replica of the UK MOT Database.

We're fine with clickhouse and redshift for the OLAP work we do. I've been looking at ParaQuery lately if I really want to speed that up.

GeorgeCurtis•2h ago
This sounds like a perfect usecase. Would love to learn more and see if we can help!

email us: founders@helix-db.com

maxrumpf•3h ago
does it support fts/vector on edges of the graph?
GeorgeCurtis•3h ago
Yes you can put vectors, full text data, secondary and range indexes on both nodes and edges.
raufakdemir•2h ago
what language does this support? cypher/gremlin?
GeorgeCurtis•2h ago
We don't support cypher or gremlin. We can

You can query HelixDB using JSON or directly in your programming language of choice by using our Rust, TypeScript, Go or Python SDKs. We’ve found AI is very good at working with the SDKs and JSON itself to query, making the development experience much better than before: https://docs.helix-db.com/database/querying

Bnjoroge•1h ago
congrats! how does this compare to turbopuffer, surreal or other multi-model ones built on object storage or not
GeorgeCurtis•26m ago
tpuffer is a vector/fts database. Surreal is a bit of an "everything database".

We're a graph database with vector and FTS capabilities. Our vector and FTS benchmarks are comparable with tpuffer, but you would primarily use us for building whole applications, knowledge graphs, or AI memory/retrieval. Anything that is relationship intense.

Let me know if this properly answers your question

cjlm•42m ago
Currently #5 on gdb-engines.com - definitely worth a look.
GeorgeCurtis•38m ago
yooo this is awesome. Didn't even realise :)
rajit•37m ago
when will the graph memory layer be available?
GeorgeCurtis•21m ago
We plan on launching end of month.