Build Your Own Database

https://www.nan.fyi/database

204•nansdotio•3h ago

Comments

4ndrewl•2h ago

> Databases were made to solve one problem:

> "How do we store data persistently and then efficiently look it up later?"

Isn't that two problems?

dayjaby•2h ago

Store data persistently so it can be looked up efficiently* sounds like a single problem.

SirFatty•2h ago

Definitely two.

cjbgkagh•2h ago

It’s not persistent if it can’t be recovered later

stvltvs•1h ago

Puts message in a bottle and tosses into the most convenient black hole.

BetaDeltaAlpha•1h ago

Doesn't the black hole compresses the bottle beyond recovery?

stvltvs•26m ago

Not necessarily, opinions vary.

https://www.sciencenewstoday.org/do-black-holes-destroy-or-s...

SahAssar•1h ago

"Store data persistently" implies "it can be looked up" since if you cannot look it up it is impossible to know if it is stored persistently.

The "efficiently" part can be considered a separate problem though.

prerok•1h ago

Well, if you just want to store data, you can use files. Lookup is a bit tedious and inefficient.

So, if we consider that persistent storage is a solved problem, then we can say that the reason for databases was how to look up data efficiently. In fact, that is why they were invented, even if persistent storage is a prerequisite.

nonethewiser•51m ago

How about "store data in certain way." That sounds more like 1 problem and encompasses an even larger problem space.

grokgrok•1h ago

How do we reconstruct past memory states? That's the fundamental problem.

Efficiency of storage or retrieval, reliability against loss or corruption, security against unwanted disclosure or modification are all common concerns, and the relative values assigned to these features and others motivate database design.

kiitos•1h ago

> How do we reconstruct past memory states? That's the fundamental problem.

reconstructing past memory states is rarely, if ever, a requirement that needs to be accommodated in the database layer

nonethewiser•53m ago

Can you elaborate? That certainly seems to be what happens in a typical crud app. You have some model for your data which you persist so that it can be loaded later. Perhaps partially at times.

In another context perhaps you're ingesting data to be used in analytics. Which seems to fit the "reconstruct past memory stat" less.

i_k_k•1h ago

I always wanted to ship a write-only database. Lightning fast.

elygre•1h ago

Back in the 80s a professor at our college got a presentation on the concept of «write-only memory» accepted for some symposium.

Good times.

pcdevils•1h ago

Pretty much how eventstoredb works. Deleting data fully only happens at scavenge which rewrites the data files.

hxtk•57m ago

I think it was a joke. It sounds like you read it as append-only, like most LSM tree databases (not rewriting files in the course of write operations), but I think GP meant it as write-only to the exclusion of reads, roughly equivalent to `echo $data > /dev/null`

datadrivenangel•30m ago

I've forgotten how to count that low. [0]

0 - https://www.youtube.com/watch?v=3t6L-FlfeaI

archerx•1h ago

That would be useful for logging.

warkdarrior•1h ago

If it's write-only, and no reads ever happen, one can write to /dev/null without loss of utility.

mewpmewp2•59m ago

It would be good for before going to sleep then.

Etheryte•19m ago

Also useful for backups, so long as you don't need to restore.

pratik661•1h ago

This is analogous to an elevator that’s unidirectional

rzzzt•1h ago

One that lets people enter. We will figure out exiting later, with exiting on a different floor as a stretch goal.

theideaofcoffee•10m ago

Or just a paternoster

nonethewiser•47m ago

It's amusing to me that this is really quite a pedantic observation yet it's driving very earnest engagement from hackernews. Myself included. Absolutely nothing in this article is riding on if its 1 or 2 problems - it's an aside at best. Yet I'm still trying to think through if it's 1 or 2. I mean, the "and" is right there - that clearly suggests two. It's almost comical even, to say "Here is one problem: X and Y." Yet in another way it seems like 2 sides of the same coin.

I guess there is a rather fine line between philosophy and pedantry.

Maybe we can think about it from another angle. If they are 2 problems databases were designed to solve, then that means this is a problem databases were designed to solve: storing data persistently.

Is that really a problem database were designed to solve? Not really. We had that long before databases. It was already solved. It's a pretty fundamental computer operation. Isn't it fair to say this is one thing? "Storing data so it can be retrieved efficiently."

gingersnap•30m ago

You're thinking of regex

mrighele•21m ago

It is a single problem that contains two smaller problems, but the actual hard part (a third problem, if you wish) is putting them together. If you limit yourself to solve those two problems independently you won't have a (useful) database.

cube2222•1h ago

I clicked through a couple of the articles in the OP, and I must say, the design and animations are extremely pretty!

Kudos for that!

235ylkj•1h ago

Here's a simple key-value store inspired by D.B. Cooper:

  ~/bin/cooper-db-set
  ===================
  #! /bin/bash

  key="$1"
  value="$2"

  echo "${key}:${value}" >> /dev/null


  ~/bin/cooper-db-get
  ===================
  #! /bin/bash

  key="$1"
  </dev/null awk -F: -v key="$key" '$1 == key {result = $2} END {print result}'

MathMonkeyMan•40m ago

/dev/null is persistent across restarts and cache friendly, so it's got you covered.

skeptrune•1h ago

I love the design and examples in this post. Easy to read for sure.

Exercises like this also seem fun in general. It's a real test of how much you know to start anything from scratch.

kevinqi•45m ago

my only minor critique is using lorem ipsum examples. It tends to make me want to gloss over instead of reading; I prefer seeing realistic data. other than that, it's a really cool post

ashleyn•36m ago

I was tempted to knee-jerk dismiss this as "don't write your own database, don't even use a KV database, just use SQL". And then I remembered the only reason I'd say this is because I went through designing my own DB or using KV databases just to avoid SQL...only to realise i was badly reinventing SQL. It could be worth the lesson.

FpUser•55m ago

>Problem. How do we store data persistently and then efficiently look it up later?"

I would say without transactions it is not a database yet from a practical standpoint.

dangoodmanUT•39m ago

I think a lot of databases would disagree

FpUser•31m ago

You might be on to something here ;)

alecco•23m ago

But they are web scale!

myth_drannon•27m ago

I also recommend this free online book to build a database https://build-your-own.org/database/

DiabloD3•25m ago

It looks like it got hugged to death already.

winrid•14m ago

Needs a faster database

keybored•24m ago

Part of the reason why I'm not a "maker" is because my mind gets ahead of me with all the things that I would need to do in order to do things properly. So the article starts out interesting and then gets more and more, well, not exactly stressful but I get a bit weary by it.

Not that I would aspire to implement a general-purpose database. But even smaller tasks can make my mind spin too much.

browningstreet•16m ago

I don't disagree with your take in general, but I do think it's different reading about minutiae than being invested in it. If you actually are curing these requirements it's probably quite engaging. If not, the eyes and mind start to gloss over them.

As a different example: I'm moving this week. I've known I'm moving for a while. Thinking about moving -- and all the little things I have to do -- is way more painful than doing them. Thinking about them keeps me up at night, getting through my list today is only fractionally painful.

I'm also leveling up a few aspects of my "way of living" in the midst of all this, and it'd be terribly boring to tell others about it, but when next Monday comes.. it'll be quite sweet indeed.

keybored•11m ago

> As a different example: I'm moving this week. I've known I'm moving for a while. Thinking about moving -- and all the little things I have to do -- is way more painful than doing them. Thinking about them keeps me up at night, getting through my list today is only fractionally painful.

this sounds familiar... :)

chrisallick•12m ago

if author is reading, can you add an rss feed to your site? i want to add to feedly.

constantcrying•7m ago

I absolutely love this "first principles" approach of explaining a topic. You can really go through this and at each time understand what problem needs to be solved and what other problems this introduces, until you get at a reasonably satisfying solution.

Build Your Own Database

LLMs can get "brain rot"

Neural audio codecs: how to get audio into LLMs

Doomsday Scoreboard

Do not accept terms and conditions

Foreign hackers breached a US nuclear weapons plant via SharePoint flaws

NASA chief suggests SpaceX may be booted from moon mission

Show HN: Katakate – Dozens of VMs per node for safe code exec

Mathematicians have found a hidden 'reset button' for undoing rotation

Our modular, high-performance Merkle Tree library for Rust

Wikipedia says traffic is falling due to AI search summaries and social video

Ilo – a Forth system running on UEFI

Flexport Is Hiring SDRs in Chicago

Getting DeepSeek-OCR working on an Nvidia Spark via brute force with Claude Code

Magit Is Amazing

Minds, brains, and programs (1980) [pdf]

Diamond Thermal Conductivity: A New Era in Chip Cooling

Time to build a GPU OS? Here is the first step

ChatGPT Atlas

Binary Retrieval-Augmented Reward Mitigates Hallucinations

The Programmer Identity Crisis

StarGrid: A new Palm OS strategy game

AWS multiple services outage in us-east-1

Apple alerts exploit developer that his iPhone was targeted with gov spyware

What do we do if SETI is successful?

Show HN: ASCII Automata

Show HN: bbcli – A TUI and CLI to browse BBC News like a hacker

60k kids have avoided peanut allergies due to 2015 advice, study finds

The death of thread per core

RF Shielding History: When the FCC Cracked Down on Computers

Build Your Own Database

LLMs can get "brain rot"

Neural audio codecs: how to get audio into LLMs

Doomsday Scoreboard

Do not accept terms and conditions

Foreign hackers breached a US nuclear weapons plant via SharePoint flaws

NASA chief suggests SpaceX may be booted from moon mission

Show HN: Katakate – Dozens of VMs per node for safe code exec

Mathematicians have found a hidden 'reset button' for undoing rotation

Our modular, high-performance Merkle Tree library for Rust

Wikipedia says traffic is falling due to AI search summaries and social video

Ilo – a Forth system running on UEFI

Flexport Is Hiring SDRs in Chicago

Getting DeepSeek-OCR working on an Nvidia Spark via brute force with Claude Code

Magit Is Amazing

Minds, brains, and programs (1980) [pdf]

Diamond Thermal Conductivity: A New Era in Chip Cooling

Time to build a GPU OS? Here is the first step

ChatGPT Atlas

Binary Retrieval-Augmented Reward Mitigates Hallucinations

The Programmer Identity Crisis

StarGrid: A new Palm OS strategy game

AWS multiple services outage in us-east-1

Apple alerts exploit developer that his iPhone was targeted with gov spyware

What do we do if SETI is successful?

Show HN: ASCII Automata

Show HN: bbcli – A TUI and CLI to browse BBC News like a hacker

60k kids have avoided peanut allergies due to 2015 advice, study finds

The death of thread per core

RF Shielding History: When the FCC Cracked Down on Computers

Build Your Own Database

Comments