frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Building SQLite with a small swarm

https://kiankyars.github.io/machine_learning/2026/02/12/sqlite.html
39•kyars•2h ago
Hope some find this post interesting on my experience with parallel coding agents.

Comments

scirob•1h ago
Did they pass all unit tests in the end ?
mdavid626•1h ago
It doesn’t matter, just jump on the hype train!
shoo•38m ago
or jump off, and instead grab onto the (well-deserved) sqlite-test-suite hype train.
mdavid626•14m ago
(I'm being sarcastic.)
comex•1h ago
If it works, then it’s impressive. Does it work? Looking at test.sh, the oracle tests (the ones compared against SQLite) seem to consist in their entity of three trivial SELECT statements. SQLite has tens of thousands of tests; it should be possible to port some of those over to get a better idea of how functional this codebase is.

Edit: I looked over some of the code.

It's not good. It's certainly not anywhere near SQLite's quality, performance, or codebase size. Many elements are the most basic thing that could possibly work, or else missing entirely. To name some examples:

- Absolutely no concurrency.

- The B-tree implementation has a line "// TODO: Free old overflow pages if any."

- When the pager adds a page to the free list, it does a linear search through the entire free list (which can get arbitrarily large) just to make sure the page isn't in the list already.

- "//! The current planner scope is intentionally small: - recognize single-table `WHERE` predicates that can use an index - choose between full table scan and index-driven lookup."

- The pager calls clone() on large buffers, which is needlessly inefficient, kind of a newbie Rust mistake.

However…

It does seem like a codebase that would basically work. At a large scale, it has the necessary components and the architecture isn't insane. I'm sure there are bugs, but I think the AI could iron out the bugs, given some more time spent working on testing. And at that point, I think it could be perfectly suitable as an embedded database for some application as long as you don't have complex needs.

In practice, there is little reason not to just reach for actual SQLite, which is much more sophisticated. But I can think of one possible reason: SQLite has been known to have memory safety vulnerabilities, whereas this codebase is written in Rust with no unsafe code. It might eat your data, but it won't corrupt memory.

That is impressive enough for now, I think.

wedog6•1h ago
SQLite is tested against failure to allocate at every step of its operation: running out of memory never causes it to fail in a serious way, eg data loss. It's far more robust than almost every other library.
losteric•1h ago
This blog post doesn't say anything about your experience.

How well does the resulting code perform? What are the trade-offs/limitations/benefits compared to SQLite? What problems does it solve?

Why did you use this process? this mixture of models? Why is this a good setup?

mrmrcoleman•1h ago
Take a look at SQLite’s test coverage. It’s impressive: https://sqlite.org/testing.html

590x the application code

echelon•14m ago
The fact that AI agents can even build something that purports to be a working database is also impressive.

A small, highly experienced team steering Claude might be able to replicate the architecture and test suite reasonably quickly.

1-shotting something that looks this good means that with a few helping hands, small teams can likely accomplish decades of work in mere months.

Small teams of senior engineers can probably begin to replicate entire companies worth of product surface area.

samrus•1h ago
I cant quite tell if the tests that passed were sqlites own famously thorough test suite, or your own.

If its sqlites suite then its great the models managed to get there, but one issue (without trying to be too pessimistic) is that the models had the test suite there to validate against. Sqlites devs famously spend more of their time making the tests than building the functionalities. If we can get AI that reliably defines the functionality of such programs by building the test suite over years of trial and error, then we'll have what people are saying

comrade1234•1h ago
What's the point of building something that already exists in open source. It's just going to use code that already exists. There's probably dozens of examples written by humans that it can pull from.
bob1029•53m ago
> 84 / 154 commits (54.5%) were lock/claim/stale-lock/release coordination.

Parallelism over one code base is clearly not very useful.

I don't understand why going as fast as possible is the goal. We should be trying to be as correct as possible. The whole point is that these agents can run while we sleep. Convergence is non linear. You want every step to be in the right direction. Think of it more as a series of crystalline database transactions that must unroll in perfect order than a big pile of rocks that needs to be moved from a to b.

k33n•9m ago
> There isn’t a great way to record token usage since each platform uses a different format, so I don’t have a grasp on which agent pulled the most weight

lol

gmerc•6m ago
Why do people fall for this. We're compressing knowledge, including the source code of SQLite into storage, then retrieve and shift it along latents at tremendous cost.

I want to wash my car. The car wash is 50 meters away. Should I walk or drive?

https://mastodon.world/@knowmadd/116072773118828295
282•novemp•1h ago•181 comments

I’m joining OpenAI

https://steipete.me/posts/2026/openclaw
936•mfiguiere•10h ago•643 comments

Building SQLite with a small swarm

https://kiankyars.github.io/machine_learning/2026/02/12/sqlite.html
40•kyars•2h ago•15 comments

Magnus Carlsen Wins the Freestyle (Chess960) World Championship

https://www.fide.com/magnus-carlsen-wins-2026-fide-freestyle-world-championship/
255•prophylaxis•10h ago•145 comments

picol: A Tcl interpreter in 500 lines of code

https://github.com/antirez/picol
4•tosh•23m ago•1 comments

Arm wants a bigger slice of the chip business

https://www.economist.com/business/2026/02/12/arm-wants-a-bigger-slice-of-the-chip-business
63•andsoitis•5h ago•37 comments

Modern CSS Code Snippets: Stop writing CSS like it's 2015

https://modern-css.com
413•eustoria•14h ago•160 comments

1,300-year-old world chronicle unearthed in Sinai

https://www.heritagedaily.com/2026/02/1300-year-old-world-chronicle-unearthed-in-sinai/156948
17•telotortium•4d ago•1 comments

Expensively Quadratic: The LLM Agent Cost Curve

https://blog.exe.dev/expensively-quadratic
12•luu•3d ago•4 comments

Audio is the one area small labs are winning

https://www.amplifypartners.com/blog-posts/arming-the-rebels-with-gpus-gradium-kyutai-and-audio-ai
180•rocauc•3d ago•36 comments

LT6502: A 6502-based homebrew laptop

https://github.com/TechPaula/LT6502
343•classichasclass•15h ago•156 comments

Lost Soviet Moon Lander May Have Been Found

https://www.nytimes.com/2026/02/10/science/luna-9-moon-lander-soviet.html
24•Brajeshwar•4d ago•9 comments

I gave Claude access to my pen plotter

https://harmonique.one/posts/i-gave-claude-access-to-my-pen-plotter
161•futurecat•2d ago•85 comments

JavaScript-heavy approaches are not compatible with long-term performance goals

https://sgom.es/posts/2026-02-13-js-heavy-approaches-are-not-compatible-with-long-term-performanc...
61•luu•8h ago•54 comments

Databases should contain their own Metadata – Use SQL Everywhere

https://floedb.ai/blog/databases-should-contain-their-own-metadata-instrumentation-in-floe
18•matheusalmeida•4d ago•7 comments

Show HN: Microgpt is a GPT you can visualize in the browser

https://microgpt.boratto.ca
169•b44•13h ago•13 comments

EU bans the destruction of unsold apparel, clothing, accessories and footwear

https://environment.ec.europa.eu/news/new-eu-rules-stop-destruction-unsold-clothes-and-shoes-2026...
930•giuliomagnifico•15h ago•639 comments

Error payloads in Zig

https://srcreigh.ca/posts/error-payloads-in-zig/
70•srcreigh•9h ago•25 comments

Pocketblue – Fedora Atomic for mobile devices

https://github.com/pocketblue/pocketblue
94•nikodunk•15h ago•16 comments

How long do job postings stay open?

https://corvi.careers/blog/job_open_days_by_category_feb_2026/
26•sp1982•1d ago•31 comments

Show HN: Solving Sudoku reasoning via Energy Geometric models

https://www.davisgeometric.com/index.html
3•epokh•3d ago•1 comments

Real-time PathTracing with global illumination in WebGL

https://erichlof.github.io/THREE.js-PathTracing-Renderer/
150•tobr•3d ago•14 comments

Radio host David Greene says Google's NotebookLM tool stole his voice

https://www.washingtonpost.com/technology/2026/02/15/david-greene-google-ai-podcast/
146•mikhael•14h ago•87 comments

Gwtar: A static efficient single-file HTML format

https://gwern.net/gwtar
219•theblazehen•16h ago•71 comments

GNU Pies – Program Invocation and Execution Supervisor

https://www.gnu.org.ua/software/pies/
81•smartmic•11h ago•52 comments

Show HN: Knock-Knock.net – Visualizing the bots knocking on my server's door

https://knock-knock.net
136•djkurlander•15h ago•55 comments

Transforming a Clojure Database into a Library with GraalVM Native Image and FFI

https://avelino.run/chrondb-polyglot-ffi-clojure-graalvm-native-image/
42•PaulHoule•4d ago•2 comments

I Love Board Games: A Personal Obsession Explained by Psychology

https://www.thesswnetwork.com/post/why-i-love-board-games-a-personal-obsession-explained-by-psych...
44•Propolice•4d ago•30 comments

Amazon's Ring and Google's Nest reveal the severity of U.S. surveillance state

https://greenwald.substack.com/p/amazons-ring-and-googles-nest-unwittingly
807•mikece•19h ago•574 comments

Editor's Note: Retraction of article containing fabricated quotations

https://arstechnica.com/staff/2026/02/editors-note-retraction-of-article-containing-fabricated-qu...
231•bikenaga•13h ago•160 comments