frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

TernFS – An exabyte scale, multi-region distributed filesystem

https://www.xtxmarkets.com/tech/2025-ternfs/
78•rostayob•1h ago

Comments

mrbluecoat•1h ago
Cool project and kudos for open sourcing it. Noteworthy limitation:

> TernFS should not be used for tiny files — our median file size is 2MB.

heipei•1h ago
Yeah, that was the first thing I checked as well. Being suited for small / tiny files is a great property of the SeaweedFS system.
pandemic_region•1h ago
What happens if you put a tiny file on it then? Bad perf, possible file corruption, ... ?
redundantly•1h ago
Probably wasting space and lower performance.
jleahy•1h ago
It's just not optimised for tiny files. It absolutely would work with no problems at all, and you could definitely use it to store 100 billion 1kB files with zero problems (and that is 100 terabytes of data, probably on flash, so no joke). However you can't use it to store 1 exabyte of 1 kilobyte files (at least not yet).
jandrewrogers•35m ago
I have worked on exabyte-scale storage engines, there is a good engineering reason for this type of limitation.

If you had 1 KiB average file size then you have quadrillions of metadata objects to quickly search and manage with fine-granularity. The kinds of operations and coordination you need to do with metadata is difficult to achieve reliably when the metadata structure itself is many PB in size. There are many interesting edge cases that show up when you have to do deep paging of this metadata off of storage. Making this not slow requires unorthodox and unusual design choices that introduce a lot of complexity. Almost none of the metadata fits in memory, including many parts of conventional architectures we assume will easily fit in memory.

A mere trillion objects is right around the limit of where the allocators, metadata, etc can be made to scale with heroic efforts before conventional architectures break down and things start to become deeply weird on the software design side. Storage engines need to be reliable, so avoiding that design frontier makes a lot of sense if you can avoid it.

It is possible to break this barrier but it introduces myriad interesting design and computer science problems for which there is little literature.

ttfvjktesd•1h ago
How does TernFS compare to CephFS and why not CephFS, since it is also tested for the multiple Petabyte range?
rostayob•1h ago
(Disclaimer: I'm one of the authors of TernFS and while we evaluated Ceph I am not intimately familiar with it)

Main factors:

* Ceph stores both metadata and file contents using the same object store (RADOS). TernFS uses a specialized database for metadata which takes advantage of various properties of our datasets (immutable files, few moves between directories, etc.).

* While Ceph is capable of storing PBs, we currently store ~600PBs on a single TernFS deployment. Last time we checked this would be an order of magnitude more than even very large Ceph deployments.

* More generally, we wanted a system that we knew we could easily adapt to our needs and more importantly quickly fix when something went wrong, and we estimated that building out something new rather than adapting Ceph (or some other open source solution) would be less costly overall.

mgrandl•41m ago
There are definitely insanely large Ceph deployments. I have seen hundreds of PBs in production myself. Also your usecase sounds like something that should be quite manageable for Ceph to handle due to limited metadata activity, which tends to be the main painpoint with CephFS.
kachapopopow•28m ago
Ceph is more of: here's a raw block of data, do whatever the hell you want with it, not really good for immutable data.
rostayob•17m ago
I'm not fully up to date since we looked into this a few years ago, at the time the CERN deployments of Ceph were cited as particularly large examples and they topped out at ~30PB.

Also note that when I say "single deployment" I mean that the full storage capacity is not subdivided in any way (i.e. there are no "zones" or "realms" or similar concepts). We wanted this to be the case after experiencing situations where we had significant overhead due to having to rebalance different storage buckets (albeit with a different piece of software, not Ceph).

If there are EB-scale Ceph deployments I'd love to hear more about them.

sreekanth850•49m ago
Wow, great project.
nunobrito•48m ago
Thanks for sharing.
bananapub•30m ago
seems like a colossusly nice design.
VikingCoder•11m ago
I see what you did there.
eigenvalue•4m ago
This sounds like it would be a good underpinning for a decentralized blockchain file storage system with its focus on immutability and redundancy.

American Prairie unlocks another 70k acres in Montana

https://earthhope.substack.com/p/victory-for-public-access-american
62•mooreds•41m ago•19 comments

Geizhals Preisvergleich Donates USD 10k to the Perl and Raku Foundation

https://www.perl.com/article/geizhals-donates-to-tprf/
128•oalders•2h ago•51 comments

Launch HN: Cactus (YC S25) – AI inference on smartphones

https://github.com/cactus-compute/cactus
24•HenryNdubuaku•48m ago•8 comments

Slack has raised our charges by $195k per year

https://skyfall.dev/posts/slack
2282•JustSkyfall•14h ago•977 comments

TernFS – An exabyte scale, multi-region distributed filesystem

https://www.xtxmarkets.com/tech/2025-ternfs/
78•rostayob•1h ago•16 comments

Luau – fast, small, safe, gradually typed scripting language derived from Lua

https://luau.org/
67•andsoitis•2h ago•23 comments

Flipper Zero Geiger Counter

https://kasiin.top/blog/2025-08-04-flipper_zero_geiger_counter_module/
98•wgx•3h ago•31 comments

The quality of AI-assisted software depends on unit of work management

https://blog.nilenso.com/blog/2025/09/15/ai-unit-of-work/
81•mogambo1•3h ago•46 comments

KDE is now my favorite desktop

https://kokada.dev/blog/kde-is-now-my-favorite-desktop/
394•todsacerdoti•4h ago•334 comments

Fuck, you're still sad?

https://bessstillman.substack.com/p/oh-fuck-youre-still-sad
193•LaurenSerino•2h ago•67 comments

Automatic Differentiation Can Be Incorrect

https://www.stochasticlifestyle.com/the-numerical-analysis-of-differentiable-simulation-automatic...
37•abetusk•2h ago•8 comments

Midcentury North American Restaurant Placemats

https://casualarchivist.substack.com/p/order-up
109•NaOH•1d ago•26 comments

CERN Animal Shelter for Computer Mice

https://computer-animal-shelter.web.cern.ch/index.shtml
249•EbNar•9h ago•34 comments

Show HN: The text disappears when you screenshot it

https://unscreenshottable.vercel.app/?text=Hello
439•zikero•14h ago•144 comments

This Website Has No Class

https://aaadaaam.com/notes/no-class/
159•robin_reala•7h ago•72 comments

Meta Ray-Ban Display

https://www.meta.com/blog/meta-ray-ban-display-ai-glasses-connect-2025/
550•martpie•15h ago•800 comments

Pnpm has a new setting to stave off supply chain attacks

https://pnpm.io/blog/releases/10.16
137•ivanb•9h ago•95 comments

Fast Fourier Transforms Part 1: Cooley-Tukey

https://connorboyle.io/2025/09/11/fft-cooley-tukey.html
62•signa11•7h ago•11 comments

Rereading

https://maxgirkins.com/writings/on-rereading
13•mgirkins•3d ago•0 comments

CircuitHub (YC W12) Is Hiring Operations Research Engineers (UK/Remote)

https://www.ycombinator.com/companies/circuithub/jobs/UM1QSjZ-operations-research-engineer
1•seddona•6h ago

Tesla is looking to redesign its door handles following trapped-passenger report

https://www.cnn.com/2025/09/18/business/telsa-door-handle-redesign
23•rawgabbit•1h ago•10 comments

You Had No Taste Before AI

https://matthewsanabria.dev/posts/you-had-no-taste-before-ai/
175•codeclimber•4h ago•149 comments

Mirror Life Worries

https://www.science.org/content/blog-post/mirror-life-worries
24•etiam•5h ago•14 comments

Nvidia buys $5B in Intel stock in seismic deal

https://www.tomshardware.com/pc-components/cpus/nvidia-and-intel-announce-jointly-developed-intel...
312•stycznik•5h ago•217 comments

Keeping SSH sessions alive with systemd-inhibit

https://kd8bny.com/posts/session_inhibit/
40•kd8bny•2d ago•13 comments

A better future for JavaScript that won't happen

https://drewdevault.com/2025/09/17/2025-09-17-An-impossible-future-for-JS.html
19•warrenm•1h ago•10 comments

One Token to rule them all – Obtaining Global Admin in every Entra ID tenant

https://dirkjanm.io/obtaining-global-admin-in-every-entra-id-tenant-with-actor-tokens/
278•colinprince•17h ago•42 comments

Boring is good

https://jenson.org/boring/
264•zdw•2d ago•58 comments

An Afternoon at the Recursive Café: Two Threads Interleaving

https://ipfs.io/ipfs/bafkreieiwashxhlv5epydts2apocoepdvjudzhpnrswqxcd3zm3i5gipyu
9•robertothais•4d ago•2 comments

A postmortem of three recent issues

https://www.anthropic.com/engineering/a-postmortem-of-three-recent-issues
353•moatmoat•19h ago•112 comments