frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Elasticsearch Was Never a Database

https://www.paradedb.com/blog/elasticsearch-was-never-a-database
29•jamesgresql•4d ago

Comments

jamesgresql•4d ago
I know it sounds obvious, but some people are pretty determined to us it that way!
unethical_ban•1h ago
I work in infosec and several popular platforms use elasticsearch for log storage and analysis.

I would never. Ever. Bet my savings on ES being stable enough to always be online to take in data, or predictable in retaining the data it took in.

It feels very best-effort and as a consultant, I recommend orgs use some other system for retaining their logs, even a raw filesystem with rolling zips, before relying on ES unless you have a dedicated team constantly monitoring it.

toenail•1h ago
Dunno, I've had three node clusters running very stable for years. Which issues did you have that require a full team?
unethical_ban•1h ago
To be fair, I think it is chronically underprovisioned clusters that get overwhelmed by log forwarding. I wasn't on the team that managed the ELK stack a decade ago, but I remember our SOC having two people whose full time job was curating the infrastructure to keep it afloat.

Now I work for a company whose log storage product has ES inside, and it seems to shit the bed more often than it should - again, could be bugs, could be running "clusters" of 1 or 2 instead of 3.

toenail•1h ago
I'm not even sure "get overwhelmed" is a problem, unless you need real time analytics. But yeah, sounds like a resources issue.
xeraa•24m ago
There are no 2-node clusters (it needs a quorum). If your setup has 2-node clusters, someone is doing this horribly wrong.
PedroBatista•1h ago
Even most toy databases "built in a weekend" can be very stable for years if:

- No edge-case is thrown at them

- No part of the system is stressed ( software modules, OS,firmware, hardware )

- No plug is pulled

Crank the requests to 11 or import a billion rows of data with another billion relations and watch what happens. The main problem isn't the system refusing to serve a request or throwing "No soup for you!" errors, it's data corruption and/or wrong responses.

toenail•1h ago
I'm talking about production loads, but thanks.
pixl97•18m ago
Production loads mean a lot of different things to a lot of different people.
kentm•1h ago
Do you happen to know if ES was the only storage? Its been almost 8 years, but if I was building a log storage and analysis system, then I'd push the logs to S3 or some other object store and build an ES index off of that S3 data. From the consumer's perspective, it may look like we're using ES to store the data, but we have a durable backup to regenerate ES if necessary.
cyberpunk•49m ago
Meh i run hundreds of es nodes, its gotten a lot more friendly these days, but yes it can be a bit unforgiving at times.

Turns out running complicated large distributed systems requires a bit more than a ./apply, who would have guessed it?

lvspiff•1h ago
Everything is a database if you believe hard enough

Feel like the christmas story kid --

>simplicity, and world-class performance, get started with XXXXXXXX.

A crummy commercial?

PedroBatista•1h ago
I really never understood how people could store very important information in ES like it was a database.

Even if they don't understand what ES is and what a "normal" database is, I'm sure some of those people run into issues where their "db" got either corrupted of lost data even when testing and building their system around it. This is and was general knowledge at the time, it was no secret that from time to time things got corrupted and indexes needed to be rebuilt.

Doesn't happen all the time, but way greater than zero times and it's understandable because Lucene is not a DB engine or "DB grade" storage engine, they had other more important things to solve in their domain.

So when I read stories of data loss and things going South, I don't have sympathy for anyone involved other than the unsuspecting final clients. These people knew or more or less knew and choose to ignore and be lazy.

kentm•1h ago
> I really never understood how people could store very important information in ES like it was a database.

I agree.

Its been a while since I touched it, but as far as I can remember ES has never pretended to be your primary store of information. It was mostly juniors that reached for it for transaction processing, and I had to disabuse them of the notion that it was fit for purpose there.

ES is for building a searchable replica of your data. Every ES deployment I made or consulted sourced its data from some other durable store, and the only thing that wrote to it were replication processes or backfills.

vjerancrnjak•39m ago
They market it as a general purpose store. Successfully, even though hc cs wizards wouldn’t touch it ever, c suite likes it

Best example is IoT marketing, as if it can handle the load without bazillion shards, and since when does a text engine want telemetry

toenail•1h ago
I think elastic always clearly documented to expect "eventual consistency", they never claimed to be a "database" in the sense that tfa defines.
xeraa•27m ago
First step of a marketing campaign: Claim something never said and then tell everyone why it's wrong ;)
roywiggins•1h ago
> Elastic has been working on this gap. The more recent ES|QL introduces a similar feature called lookup joins, and Elastic SQL provides a more familiar syntax (with no joins). But these are still bound by Lucene’s underlying index model. On top of that, developers now face a confusing sprawl of overlapping query syntaxes (currently: Query DSL, ES|QL, SQL, EQL, KQL), each suited to different use cases, and with different strengths and weaknesses.

I suppose we need a new rule, "Any sufficiently successful data store eventually sprouts at least one ad hoc, informally-specified, inconsistency-ridden, slow implementation of half of a relational database"

kayo_20211030•1h ago
... and then becomes an email client (https://en.wikipedia.org/wiki/Jamie_Zawinski#Zawinski%27s_La...). A two-fer. lol.
esafak•21m ago
ICYMI https://en.wikipedia.org/wiki/Greenspun's_tenth_rule
xeraa•18m ago
Funny argument on the query languages in hindsight, since the latest release (https://www.paradedb.com/blog/paradedb-0-20-0 but that was after this blog) just completely changed the API. To be seen how many different API versions you get if you make it to 15 years ;)

PS: I've worked at Elastic for a long time, so it is fun to see the arguments for a young product.

speedgoose•1h ago
Accenture managed to build a data platform for my company with Elasticsearch as the primary database. I raised concerns early during the process but their software architect told me they never had any issues. I assume he didn’t lie. I was only an user so I didn’t fight and decided to not make my work rely on their work.
CuriouslyC•33m ago
Elastic feels about as much like a primary data store as Mongo, FWIW.
victor106•23m ago
> Accenture

They messed up a $30 million dollar project big time at a previous company. My cto swore to never recommend them

cluckindan•1h ago
”That means a recently acknowledged write may not show up until the next refresh.”

Which is why you supply the parameter

  refresh: ”wait_for”
in your writes. This forces a refresh and waits for it to happen before completing the request.

”schema migrations require moving the entire system of record into a new structure, under load, with no safety net”

Use index aliases. Create new index using the new mapping, make a reindex request from old index to new one. When it finishes, change the alias to point to the new index.

The other criticisms are more valid, but not entirely: for example, no database ”just works” without carefully tuning the memory-related configuration for your workload, schema and data.

nkmnz•1h ago
It took me years before I started tuning the memory-related configuration of postgres for workload, schema and data, in any way. It "just works" for the first ten thousand concurrent users.
cluckindan•1h ago
Well, most people working on a car don’t have a car lift: it only makes sense when you need to safely work on a large volume of cars. If you only work on one or two, a jack and a pile of wood works just fine.
kamma4434•1h ago
Modern JVMs are pretty effective in most scenarios right out of the box.
_joel•34m ago
I just tend to use https://github.com/le0pard/pgtune
stefanon•1h ago
Yep!
this_user•53m ago
I mean, it is called "ElasticSEARCH", not "Elasticdatabase".
_joel•33m ago
MySQL isn't mine either, it's Larry Ellison's.

Cloudflare acquires Astro

https://astro.build/blog/joining-cloudflare/
492•todotask2•5h ago•265 comments

STFU

https://github.com/Pankajtanwarbanna/stfu
315•tanelpoder•2h ago•186 comments

6-Day and IP Address Certificates Are Generally Available

https://letsencrypt.org/2026/01/15/6day-and-ip-general-availability
200•jaas•4h ago•114 comments

Michelangelo's first painting, created when he was 12 or 13

https://www.openculture.com/2026/01/discover-michelangelos-first-painting.html
205•bookofjoe•5h ago•124 comments

Just the Browser

https://justthebrowser.com/
376•cl3misch•7h ago•201 comments

Lock-Picking Robot

https://github.com/etinaude/Lock-Picking-Robot
171•p44v9n•4d ago•76 comments

Cursor's latest "browser experiment" implied success without evidence

https://embedding-shapes.github.io/cursor-implied-success-without-evidence/
137•embedding-shape•5h ago•63 comments

Launch HN: Indy (YC S21) – A support app designed for ADHD brains

https://www.shimmer.care/indy-redirect
42•christalwang•3h ago•44 comments

Elasticsearch Was Never a Database

https://www.paradedb.com/blog/elasticsearch-was-never-a-database
29•jamesgresql•4d ago•32 comments

Read_once(), Write_once(), but Not for Rust

https://lwn.net/SubscriberLink/1053142/8ec93e58d5d3cc06/
77•todsacerdoti•4h ago•22 comments

Zep AI (Agent Context Engineering, YC W24) Is Hiring Forward Deployed Engineers

https://www.ycombinator.com/companies/zep-ai/jobs/
1•roseway4•2h ago

Dell UltraSharp 52 Thunderbolt Hub Monitor

https://www.dell.com/en-us/shop/dell-ultrasharp-52-thunderbolt-hub-monitor-u5226kw/apd/210-bthw/m...
67•cebert•2h ago•69 comments

Dev-owned testing: Why it fails in practice and succeeds in theory

https://dl.acm.org/doi/10.1145/3780063.3780066
63•rbanffy•6h ago•83 comments

Show HN: 1Code – Open-source Cursor-like UI for Claude Code

https://github.com/21st-dev/1code
22•Bunas•1d ago•15 comments

Earth from Space: The Fate of a Giant

https://www.esa.int/ESA_Multimedia/Images/2026/01/Earth_from_Space_The_fate_of_a_giant
6•geox•1h ago•2 comments

Why DuckDB is my first choice for data processing

https://www.robinlinacre.com/recommend_duckdb/
108•tosh•8h ago•44 comments

Can You Disable Spotlight and Siri in macOS Tahoe?

https://eclecticlight.co/2026/01/16/can-you-disable-spotlight-and-siri-in-macos-tahoe/
71•chmaynard•4h ago•55 comments

Training my smartwatch to track intelligence

https://dmvaldman.github.io/rooklift/
112•dmvaldman•1d ago•50 comments

psc: The ps utility, with an eBPF twist and container context

https://github.com/loresuso/psc
56•tanelpoder•6h ago•19 comments

The Alignment Game

https://dmvaldman.github.io/alignment-game/
11•dmvaldman•19h ago•1 comments

OpenBSD-current now runs as guest under Apple Hypervisor

https://www.undeadly.org/cgi?action=article;sid=20260115203619
375•gpi•16h ago•51 comments

Feature Selection: A Primer

https://ikromshi.com/2025/12/30/feature-selection-primer.html
4•ikromshi•4d ago•0 comments

List of individual trees

https://en.wikipedia.org/wiki/List_of_individual_trees
314•wilson090•19h ago•102 comments

The wealth of the top% reaches a record $52T

https://www.cnbc.com/2025/10/03/the-wealth-of-the-top-1percent-reaches-a-record-52-trillion.html
13•thelastgallon•22m ago•4 comments

Zorgdomein Integration: A Guide to Secure .NET and Azure Architecture

https://plakhlani.in/healthcare/bidirectional-patient-data-exchange-with-zorgdomein/
10•prashantl•4d ago•7 comments

Our approach to advertising and expanding access to ChatGPT

https://openai.com/index/our-approach-to-advertising-and-expanding-access/
75•rvz•1h ago•43 comments

Interactive eBPF

https://ebpf.party/
173•samuel246•11h ago•8 comments

Emoji Use in the Electronic Health Record is Increasing

https://jamanetwork.com/journals/jamanetworkopen/fullarticle/2843883
5•giuliomagnifico•1h ago•1 comments

Pocket TTS: A high quality TTS that gives your CPU a voice

https://kyutai.org/blog/2026-01-13-pocket-tts
593•pain_perdu•1d ago•142 comments

Canada slashes 100% tariffs on Chinese EVs to 6%

https://electrek.co/2026/01/16/canada-breaks-with-us-slashes-100-tariffs-chinese-evs/
305•1970-01-01•2h ago•348 comments