frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Tiny C Compiler

https://bellard.org/tcc/
102•guerrilla•3h ago•44 comments

SectorC: A C Compiler in 512 bytes

https://xorvoid.com/sectorc.html
186•valyala•7h ago•34 comments

Speed up responses with fast mode

https://code.claude.com/docs/en/fast-mode
110•surprisetalk•7h ago•116 comments

Brookhaven Lab's RHIC concludes 25-year run with final collisions

https://www.hpcwire.com/off-the-wire/brookhaven-labs-rhic-concludes-25-year-run-with-final-collis...
43•gnufx•6h ago•45 comments

Software factories and the agentic moment

https://factory.strongdm.ai/
130•mellosouls•10h ago•279 comments

OpenCiv3: Open-source, cross-platform reimagining of Civilization III

https://openciv3.org/
880•klaussilveira•1d ago•269 comments

Stories from 25 Years of Software Development

https://susam.net/twenty-five-years-of-computing.html
129•vinhnx•10h ago•15 comments

Hoot: Scheme on WebAssembly

https://www.spritely.institute/hoot/
166•AlexeyBrin•12h ago•29 comments

The F Word

http://muratbuffalo.blogspot.com/2026/02/friction.html
97•zdw•3d ago•46 comments

FDA intends to take action against non-FDA-approved GLP-1 drugs

https://www.fda.gov/news-events/press-announcements/fda-intends-take-action-against-non-fda-appro...
60•randycupertino•2h ago•90 comments

First Proof

https://arxiv.org/abs/2602.05192
96•samasblack•9h ago•63 comments

The silent death of Good Code

https://amit.prasad.me/blog/rip-good-code
48•amitprasad•1h ago•44 comments

Vocal Guide – belt sing without killing yourself

https://jesperordrup.github.io/vocal-guide/
265•jesperordrup•17h ago•86 comments

I write games in C (yes, C) (2016)

https://jonathanwhiting.com/writing/blog/games_in_c/
167•valyala•7h ago•148 comments

Eigen: Building a Workspace

https://reindernijhoff.net/2025/10/eigen-building-a-workspace/
4•todsacerdoti•4d ago•1 comments

Al Lowe on model trains, funny deaths and working with Disney

https://spillhistorie.no/2026/02/06/interview-with-sierra-veteran-al-lowe/
85•thelok•9h ago•18 comments

Start all of your commands with a comma (2009)

https://rhodesmill.org/brandon/2009/commands-with-comma/
549•theblazehen•3d ago•203 comments

Show HN: I saw this cool navigation reveal, so I made a simple HTML+CSS version

https://github.com/Momciloo/fun-with-clip-path
49•momciloo•7h ago•9 comments

Show HN: A luma dependent chroma compression algorithm (image compression)

https://www.bitsnbites.eu/a-spatial-domain-variable-block-size-luma-dependent-chroma-compression-...
26•mbitsnbites•3d ago•2 comments

Selection rather than prediction

https://voratiq.com/blog/selection-rather-than-prediction/
24•languid-photic•4d ago•6 comments

The AI boom is causing shortages everywhere else

https://www.washingtonpost.com/technology/2026/02/07/ai-spending-economy-shortages/
246•1vuio0pswjnm7•13h ago•388 comments

Microsoft account bugs locked me out of Notepad – Are thin clients ruining PCs?

https://www.windowscentral.com/microsoft/windows-11/windows-locked-me-out-of-notepad-is-the-thin-...
79•josephcsible•5h ago•105 comments

Reinforcement Learning from Human Feedback

https://rlhfbook.com/
108•onurkanbkrc•12h ago•5 comments

Unseen Footage of Atari Battlezone Arcade Cabinet Production

https://arcadeblogger.com/2026/02/02/unseen-footage-of-atari-battlezone-cabinet-production/
137•videotopia•4d ago•44 comments

A Fresh Look at IBM 3270 Information Display System

https://www.rs-online.com/designspark/a-fresh-look-at-ibm-3270-information-display-system
57•rbanffy•4d ago•17 comments

Learning from context is harder than we thought

https://hy.tencent.com/research/100025?langVersion=en
215•limoce•4d ago•123 comments

Coding agents have replaced every framework I used

https://blog.alaindichiappari.dev/p/software-engineering-is-back
303•alainrk•12h ago•480 comments

72M Points of Interest

https://tech.marksblogg.com/overture-places-pois.html
48•marklit•5d ago•9 comments

Where did all the starships go?

https://www.datawrapper.de/blog/science-fiction-decline
121•speckx•4d ago•183 comments

Show HN: Look Ma, No Linux: Shell, App Installer, Vi, Cc on ESP32-S3 / BreezyBox

https://github.com/valdanylchuk/breezydemo
294•isitcontent•1d ago•39 comments
Open in hackernews

Show HN: Yet another memory system for LLMs

https://github.com/trvon/yams
165•blackmanta•5mo ago
Built this for my LLM workflows - needed searchable, persistent memory that wouldn't blow up storage costs. I also wanted to use it locally for my research. It's a content-addressed storage system with block-level deduplication (saves 30-40% on typical codebases). I have integrated the CLI tool into most of my workflows in Zed, Claude Code, and Cursor, and I provide the prompt I'm currently using in the repo.

The project is in C++ and the build system is rough around the edges but is tested on macOS and Ubuntu 24.04.

Comments

winterrx•5mo ago
The domain listed on the GitHub repo redirects too many times.
blackmanta•5mo ago
That should be fixed now. It was a misconfiguration of CloudFlare SSL with GitHub Pages.
mempko•5mo ago
Wicked cool. Useful for single users. Any plans to build support for multiple users? Would be useful for an LLM project that requires per user sandboxing.
marcofiocco•5mo ago
What about versioning of files?
blackmanta•5mo ago
The tool has built-in versioning. Each file gets a unique SHA-256 hash on storage (automatic versioning), you can update metadata to track version info, and use collections/snapshots to group versions together. I have been using the metadata to track progress and link code snippets.
yawerali•5mo ago
Hader
JSR_FDED•5mo ago
Thanks, I learned a lot from this.
sitkack•5mo ago
How would you use the built in functionality to enable graph functionality? Metadata or another document used as the link or collection of links?
blackmanta•5mo ago
The graph functionality is exposed through the retrieval functionality. I may improve this later but the idea was to maximize getting the best results when looking for stored data.
sitkack•5mo ago
There is no built in graph functionality correct? But one could use existing mechanisms like metadata or storing the link between documents as a document itself?
blackmanta•5mo ago
The graph functionality is stubbed but I will expose it in a future update. You can also use metadata and tags for similar things.
retreatguru•5mo ago
How do you use this in your workflow? Please give some examples because it’s not clear to me what this is for.
blackmanta•5mo ago
I have been using it for task tracking, research, and code search. When using CLI tools, I found that the LLM's were able to find code in less tool calls when I stored my codebase in the tool. I had to wrangle the LLMs to use the tool verse native rgrep or find.

I am also trying to stabilize PDF text extraction to improve knowledge retrieval when I want to revisit a paper I read but cannot remember which one it was. Most of these use cases come from my personal use and updates to the tool but I am trying to make it as general as possible.

3abiton•5mo ago
This is an interesting approach! Why not offload PDF extraction to other frameorks that apply OCR pdf -> .md
blackmanta•5mo ago
I may explore this when I implement the vectordb implementation I started.
ActorNightly•5mo ago
>MCP server (requires Boost)

I see stuff like this, and I really have to wonder if people just write software with bloat for the sake of using a particular library.

pessimizer•5mo ago
Boost is a nearly 30 year old open source library that provides stuff for C++ that most standard libraries for other languages already have out of the box. You seem to think that it is hipster bullshit rather than almost a dinosaur itself.
SJC_Hacker•5mo ago
Blame the committee for refusing to include basic functionality like regular expressions , networking and threads as part of the STL
ActorNightly•5mo ago
I feel like there are pretty standard C++ server implementations that are less bloated.
SJC_Hacker•5mo ago
There might be, but as of a few years ago they were not mature and may not have captured the mindshare yet. Company I worked for actually used websocketpp because Boost ASIO implementation had some bug they couldn't work around, but then it was fixed and we dropped websocketpp.

I can say one the the nice thing about Boost network implementation (ASIO) is fairly mature asychronous framework using a variety of techniques. Also if you need HTTP or Websockets you can use Beast which is built on top of ASIO.

And if you're using one thing from Boost, its easy to just use everything else you need and that Boost provides to minimize dependencies.

menaerus•5mo ago
The reason for depending on Boost in this repo is just few search characters away - he needs HTTP/WebSocket implementation and Boost.Beast provides it. The actual bloat here in this repo is conan.
ActorNightly•5mo ago
My experience with Boost has been template metaprogramming hell.
SJC_Hacker•5mo ago
For its credit though, it follows the C++ "philosophy" fairly faithfully. If you don't like Boost you probably don't like C++ either.

Although that download is a monster, I think its like 1.6 GB even compressed. Its not modular at all, some of the modules depend on others and its impossible to separate them out (they've tried in the past)

But last I check there is ALOT they could have removed, especially support for older compilers like MSVC 200x (!), pre C++ 11/older GNU compilers, etc. without compromising functionality. I'm not if they got around to doing that.

noodletheworld•5mo ago
? Are you complaining about MCP or boost?

It’s an optional component.

What do you want the OP to do?

MCP may not be strictly necessary but it’s straight in line with the intent of the library.

Are you going to take shots at llama.cpp for having an http server and a template library next?

Come on. This uses conan, it has a decent cmake file. The code is ok.

This is pretty good work. Dont be a dick. (Yeah, ill eat the down votes, it deserves to be said)

airstrike•5mo ago
This feels like a shallow dismissal, which is frowned upon per the HN guidelines
yard2010•5mo ago
I'm puzzled - where are the header files?
paffdragon•5mo ago
You mean these? https://github.com/trvon/yams/tree/main/include/yams
vira28•5mo ago
How does this compare to Letta?
rkunnamp•5mo ago
Thank you for sharing this. Sorry for a possible noob question. How are embedding generated? Does it use a hosted embedding model? (I was trying to understand how is semantic search implemented)
sync•5mo ago
It, uh... generates mock embeddings? https://github.com/trvon/yams/blob/c89798d6d2de89caacdbe50d2...

(seems like there's some vague future plans for models like all-MiniLM-L6-v2, all-mpnet-base-v2)

pbronez•5mo ago
Hmm I wonder how much that effects the compression benefits of block level duplication. The mock embeddings choose vector elements from a normal distribution, so it’s far from uniform
huqedato•5mo ago
In my RAG I use qdrant w/ Redis. Very successfully. I don't really see the use of "another memory system for LLM", perhaps I'm missing something.
jerpint•5mo ago
I also developed yet another memory system !

https://github.com/jerpint/context-llemur

Although I developed it explicitly without search, and catered it to the latest agents which are all really good at searching and reading files. Instead you and LLMs cater your context to be easily searchable (folders and files). It’s meant for dev workflows (i.e a projects context, a user context)

I made a video showing how easy it is to pull in context to whatever IDE/desktop app/CLI tool you use

https://m.youtube.com/watch?v=DgqlUpnC3uw

elpocko•5mo ago
>block-level deduplication (saves 30-40% on typical codebases)

How is savings of 40% on a typical codebase possible with block-level deduplication? What kind of blocks are you talking about? Blocks as in the filesystem?

blackmanta•5mo ago
I am working to improve the CLI tools to make getting this information easier but I have stored the yam repo in yams with multiple snapshots and metadata tags and I am seeing about 32% storage savings.
elpocko•5mo ago
Cool. I have no idea what "stored the yam repo in yams" means. What do you mean by "block-level deduplication"? What is a block?
blackmanta•5mo ago
I stored the codebase for yams in the tool. The "blocks" are content-defined blocks/chunks, not filesystem blocks. They're variable-size chunks (typically 4-64KB) created using Rabin fingerprinting to find natural content boundaries. This enables deduplication across files that share similar content.
A4ET8a8uTh0_v2•5mo ago
I like it and I will be perusing your code for what could be used in my 'not yet working' variant.
skyzouwdev•5mo ago
That sounds like a practical take on LLM memory — especially the block-level deduplication part.

Most “memory” layers I’ve seen for AI are either overly complex or end up ballooning storage costs over time, so a content-addressed approach makes a lot of sense.

Also curious — have you benchmarked retrieval speed compared to more traditional vector DB setups? That could be a big selling point for devs running local research workflow

blackmanta•5mo ago
I have not, but that is something I plan to do when I have time.
izabera•5mo ago
not trying to be a hater but how is 100mb/s high performance in 2025? that's as performant as a 20 years old hdd
blackmanta•5mo ago
The system is honestly tuned for storage efficiency not speed but these configurations are tunable and you can use the benchmarks as a reference for tuning. https://github.com/trvon/yams/blob/main/docs/benchmarks/perf...
threecheese•5mo ago
Reviewing the prompts, looks like you are using this CAS tool as a global context data manager, supporting primarily a code use case. There are a number of extant MCP-capable code understanding tools (Serena and others), but what I am lacking in my CLI toolchain is non-code memory. You even called this out in another thread, mentioning task management- I find that the type of memory I need is not scoped to a code module, but an agent session - specifically to the orchestration of many agent sessions. What we have today are techniques, using a bunch of hacked together context files for sessions (tasks.md, changes.md), for agents (roles.md), for tech (architecture.md), etc etc, hoping that our prompts guide the agent to use them, and this is IMO a natural place for some abstraction over memory that can provide rigor.

I am observing in my professional (non-Claude Max) life that context is a real limiter, from both the “too much is confusing the agent” and “I’m hitting limits doing basic shit” perspectives (looking at you, Bedrock and Github), and having a tool that will help me give an agent only what it needs would be really valuable. I could do more with the tools, spend less time trying to manually intervene, and spend less of my token budget.

blackmanta•5mo ago
While the examples and provided prompt lean toward code (since that's my personal use case), YAMS is fundamentally a generic content-addressed storage system.

I will attempt to run some small agents with custom prompts and report back.

yukukotani•5mo ago
Cool! Any plan to support shared storage like cloud RDBs or S3?
blackmanta•5mo ago
I will look into adding this in a future update.