frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: Poddley.com – Follow people, not podcasts

https://poddley.com/guests/ana-kasparian/episodes
1•onesandofgrain•7m ago•0 comments

Layoffs Surge 118% in January – The Highest Since 2009

https://www.cnbc.com/2026/02/05/layoff-and-hiring-announcements-hit-their-worst-january-levels-si...
4•karakoram•7m ago•0 comments

Papyrus 114: Homer's Iliad

https://p114.homemade.systems/
1•mwenge•7m ago•1 comments

DicePit – Real-time multiplayer Knucklebones in the browser

https://dicepit.pages.dev/
1•r1z4•7m ago•1 comments

Turn-Based Structural Triggers: Prompt-Free Backdoors in Multi-Turn LLMs

https://arxiv.org/abs/2601.14340
2•PaulHoule•9m ago•0 comments

Show HN: AI Agent Tool That Keeps You in the Loop

https://github.com/dshearer/misatay
2•dshearer•10m ago•0 comments

Why Every R Package Wrapping External Tools Needs a Sitrep() Function

https://drmowinckels.io/blog/2026/sitrep-functions/
1•todsacerdoti•11m ago•0 comments

Achieving Ultra-Fast AI Chat Widgets

https://www.cjroth.com/blog/2026-02-06-chat-widgets
1•thoughtfulchris•12m ago•0 comments

Show HN: Runtime Fence – Kill switch for AI agents

https://github.com/RunTimeAdmin/ai-agent-killswitch
1•ccie14019•15m ago•1 comments

Researchers surprised by the brain benefits of cannabis usage in adults over 40

https://nypost.com/2026/02/07/health/cannabis-may-benefit-aging-brains-study-finds/
1•SirLJ•17m ago•0 comments

Peter Thiel warns the Antichrist, apocalypse linked to the 'end of modernity'

https://fortune.com/2026/02/04/peter-thiel-antichrist-greta-thunberg-end-of-modernity-billionaires/
1•randycupertino•17m ago•2 comments

USS Preble Used Helios Laser to Zap Four Drones in Expanding Testing

https://www.twz.com/sea/uss-preble-used-helios-laser-to-zap-four-drones-in-expanding-testing
2•breve•23m ago•0 comments

Show HN: Animated beach scene, made with CSS

https://ahmed-machine.github.io/beach-scene/
1•ahmedoo•24m ago•0 comments

An update on unredacting select Epstein files – DBC12.pdf liberated

https://neosmart.net/blog/efta00400459-has-been-cracked-dbc12-pdf-liberated/
2•ks2048•24m ago•0 comments

Was going to share my work

1•hiddenarchitect•27m ago•0 comments

Pitchfork: A devilishly good process manager for developers

https://pitchfork.jdx.dev/
1•ahamez•27m ago•0 comments

You Are Here

https://brooker.co.za/blog/2026/02/07/you-are-here.html
3•mltvc•31m ago•1 comments

Why social apps need to become proactive, not reactive

https://www.heyflare.app/blog/from-reactive-to-proactive-how-ai-agents-will-reshape-social-apps
1•JoanMDuarte•32m ago•1 comments

How patient are AI scrapers, anyway? – Random Thoughts

https://lars.ingebrigtsen.no/2026/02/07/how-patient-are-ai-scrapers-anyway/
1•samtrack2019•32m ago•0 comments

Vouch: A contributor trust management system

https://github.com/mitchellh/vouch
3•SchwKatze•33m ago•0 comments

I built a terminal monitoring app and custom firmware for a clock with Claude

https://duggan.ie/posts/i-built-a-terminal-monitoring-app-and-custom-firmware-for-a-desktop-clock...
1•duggan•34m ago•0 comments

Tiny C Compiler

https://bellard.org/tcc/
3•guerrilla•35m ago•0 comments

Y Combinator Founder Organizes 'March for Billionaires'

https://mlq.ai/news/ai-startup-founder-organizes-march-for-billionaires-protest-against-californi...
2•hidden80•35m ago•3 comments

Ask HN: Need feedback on the idea I'm working on

1•Yogender78•36m ago•0 comments

OpenClaw Addresses Security Risks

https://thebiggish.com/news/openclaw-s-security-flaws-expose-enterprise-risk-22-of-deployments-un...
2•vedantnair•36m ago•0 comments

Apple finalizes Gemini / Siri deal

https://www.engadget.com/ai/apple-reportedly-plans-to-reveal-its-gemini-powered-siri-in-february-...
1•vedantnair•37m ago•0 comments

Italy Railways Sabotaged

https://www.bbc.co.uk/news/articles/czr4rx04xjpo
16•vedantnair•37m ago•8 comments

Emacs-tramp-RPC: high-performance TRAMP back end using MsgPack-RPC

https://github.com/ArthurHeymans/emacs-tramp-rpc
1•fanf2•39m ago•0 comments

Nintendo Wii Themed Portfolio

https://akiraux.vercel.app/
2•s4074433•43m ago•2 comments

"There must be something like the opposite of suicide "

https://post.substack.com/p/there-must-be-something-like-the
2•rbanffy•45m ago•1 comments
Open in hackernews

Show HN: Lingo – A linguistic database in Rust with nanosecond-level performance

42•peerlesscasual•4mo ago
Hi HN, I made Lingo - the SQLite of semantic search.

I'm a self-taught developer and researcher who left school at 16, and I've spent some time exploring a first-principles approach to system design for various frontier problems. In this case it's AI that challenges the 'bigger is better' transformer paradigm.

Lingo is the first piece of that research, a high-performance linguistic database designed to run on-device.

The full technical overview and manifesto is here: https://medium.com/@robm.antunes/bcd1e9752af6

The paper has been archived on Zenodo with a DOI: https://doi.org/10.5281/zenodo.17196613

The code is open-source and can be found at https://github.com/RobAntunes/lingodb, it's currently broken and feature incomplete but I'm working on it - just wanted to start getting some feedback.

All benchmarks are reproducible from the repo and can also be found in the various texts.

As an independent without academic affiliation, I'd be incredibly grateful for your feedback! I'm here to answer any questions.

Cheers!

Comments

apavlo•4mo ago
> • Memory-Mapping (mmap): We treat the database file as if it’s already in memory, eliminating the distinction between disk and RAM.

Ugh, not another one...

0x264•4mo ago
Yep, another developer enthusiastically proposing mmap as an "easy win" for database design, when in reality it often causes hard-to-debug correctness and performance problems.
nurettin•4mo ago
To be fair, I use it to share financial time series between multiple processes and as long as there is a single writer it works well. Been in production since several years.
pclmulqdq•4mo ago
Creating a shared memory buffer by mapping it as a file is not the same as mapping files on disk. The latter has weird and subtle problems, whereas the former just works.
nurettin•4mo ago
To be clear, I am indeed doing mmap to the same file on disk. Not using shmap. But there is only one thread in one process writing to it and the readers are tolerant to millisecond delays.
pclmulqdq•4mo ago
> millisecond delays

I thought you said financial time series!

But yeah, this is a case where mmap works great - convenience, not super fast, single writer and not necessarily super durable.

nurettin•4mo ago
> I thought you said financial time series!

Yeah it is just your average normal financial time series.

madushan1000•4mo ago
Why not though, from what I can see from the docs, these databases supposed to be static and read only. At least when you use it on a device.
0xdeafbeef•4mo ago
Page cache reclamation is mostly single threaded. It's much simpler, than you can create in a user space, it has no weight for specific pages etc.

Traveling into kernel flushes branch predictor caches, tlb. So it's not free at all.

anonzzzies•4mo ago
No issue if you know what you are doing. Not sure about the author but I know very high perf mmap systems for decades without corruption / issues (in hft/finance/payments).
porridgeraisin•4mo ago
Ctrl-Fd you here the moment i saw that in the article
rzz3•4mo ago
Really impressive work :)
vouwfietsman•4mo ago
Ok, since you're looking for sincere feedback.

Great vision, challenging the "scale" of current AI solutions is super valid, if only for the reason that humans don't learn like this.

Architecture: despite other comments, I am not so bothered with MMAP (if read only) but rather with the performance claims. If your total db is 13kb you should be answering queries at amazing speeds, because you're just running code on in-cache data at that point. The performance claim at this point means nothing, because what you're doing is not performance intensive.

Claims: A frontal attack on the current paradigm would at least have to include real semantic queries, which I think is not currently what you're doing, you're just doing language analytics like NLP. This is maybe how you intend to solve semantic queries later, but since this is not what you're doing, I think that should be clear from the get-go. Especially because the "scale" of the current AI paradigm has nothing to do with how the tokenization happens, but rather with how the statistical model is trained to answer semantic queries.

Finally, the example of "Find all Greek-origin technical terms" is a poor one because it is exactly the kind of "knowledge graph" question that was answerable before the current AI hype.

Nevertheless, love the effort, good luck!

(oh and btw: I'm not an expert, so if any of this is wrong, please correct me)

sigfubar•4mo ago
The repo is 100% AI slop.

Advice to OP: lay off the Claude Code if your goal is to become an “independent researcher”. Claude doesn’t know what it’s doing, but it’s happy to lead you into a false sense of achievement because it’ll never tell you when you’re wrong, or when it’s wrong.

mpeg•4mo ago
Bizarre because a quick look at the code and commit log shows it was likely 100% coded by AI, so the author is not trying too hard to hide it, but they also seemed to forget to mention it anywhere in the README or the blog post.
rahkiin•4mo ago
Out of interest: can you elaborate how you analyzed the repo to come to this conclusion?
jdiez17•4mo ago
All of the code is imported in 1 commit. The rest of the commits are deleting the specs that I guess were used to generate the code. There’s one commit adding code which explicitly says generated by Claude code. There’s basically no chance the whole codebase is not AI slop.
thunfischbrot•4mo ago
For those interested in the referenced spec:

https://github.com/RobAntunes/lingodb/blob/e8e56a2b2dfe19a27...

mpeg•4mo ago
The specs themselves seem generated with LLMs too, as in https://github.com/RobAntunes/lingodb/blob/5e3834de648debf08... – overuse of emojis, excitement, etc
teiferer•4mo ago
Already the title of your submission does not check out. Do you know how many clock cycles a 1 GHz CPU realizes in one nanosecond? One. Just reading the input argument of a function takes a "nanosecond-scale" amount of time.

> I'm a self-taught developer and researcher who left school at 16, and I've spent some time exploring a first-principles approach to system design for various frontier problems.

As much as I appreciate new ways of thinking, whenever I read "first-principles approach", my alarm bells go off. More often than not it just means "I chose to ignore (or am too impatient to learn about) all insights that generations of research in this field have made". The "left school at 16" and "self-taught" parts also indicate that. This may explain the hyperbole of the title as well, as it does not pass the smell test.

If you are looking for advice, here is mine: try to not ignore those that came before you. Giants' shoulders are very wide, very high up and pretty solid. There is no shame in standing on them, but it takes effort to climb up.

ozgrakkurt•4mo ago
What an amazing comment, criticism on the title without going into any content with a side of character judgement
nitishr•4mo ago
Not too sure, reading with mmap is ok but simultaneous read/write operations are a bit tricky.
bitmagier•4mo ago
Summary from my side:

Outstanding features:

- way better representation (very information-dense) of different basic language properties directly as a storage layout property (which seems absolutely possible to me to achieve)

- attention (signal) as resonance: analog wave signal processing methods can be used -> way less computation power needed

Analysis: It will have the same fundamental limitations in terms of "understanding" and "thinking" as traditional LLMs, as its "knowledge" is still based on language itself. I believe it would be implemented in combination with other models, which supply nuances of actual content – namely traditional LLMs, which are focussed on written text as it appears. Nevertheless, it should add a high-quality and high-efficient building block for language processing to the landscape of LLMs. Furthermore it may also be a nice starting point for a general development towards rethinking architecture patterns in favor of lower resource consumption and fine quality of any kind of information.