I'm a self-taught developer and researcher who left school at 16, and I've spent some time exploring a first-principles approach to system design for various frontier problems. In this case it's AI that challenges the 'bigger is better' transformer paradigm.
Lingo is the first piece of that research, a high-performance linguistic database designed to run on-device.
The full technical overview and manifesto is here: https://medium.com/@robm.antunes/bcd1e9752af6
The paper has been archived on Zenodo with a DOI: https://doi.org/10.5281/zenodo.17196613
The code is open-source and can be found at https://github.com/RobAntunes/lingodb, it's currently broken and feature incomplete but I'm working on it - just wanted to start getting some feedback.
All benchmarks are reproducible from the repo and can also be found in the various texts.
As an independent without academic affiliation, I'd be incredibly grateful for your feedback! I'm here to answer any questions.
Cheers!
apavlo•4mo ago
Ugh, not another one...
0x264•4mo ago
nurettin•4mo ago
pclmulqdq•4mo ago
nurettin•4mo ago
pclmulqdq•4mo ago
I thought you said financial time series!
But yeah, this is a case where mmap works great - convenience, not super fast, single writer and not necessarily super durable.
nurettin•4mo ago
Yeah it is just your average normal financial time series.
madushan1000•4mo ago
0xdeafbeef•4mo ago
Traveling into kernel flushes branch predictor caches, tlb. So it's not free at all.
anonzzzies•4mo ago
porridgeraisin•4mo ago