frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Your File System Is Already A Graph Database

https://rumproarious.com/2026/04/04/your-file-system-is-already-a-graph-database/
39•alxndr•2d ago

Comments

alxndr•2d ago
> […] the knowledge base isn’t just for research. It’s a context engineering system. You’re building the exact input your LLM needs to do useful work. > […] there’s a real difference between prompting “help me write a design doc for a rate limiting service” and prompting an LLM that has access to your project folder with six months of meeting notes, three prior design docs, the Slack thread where the team debated the approach, and your notes on the existing architecture.
WillAdams•1h ago
I've found a similar structure along with a naming convention useful at my day job --- the big thing is the names are such that when copied as a filepath, the filepath and extension deleted, and underscores replaced by tabs, the text may then be pasted into a spreadsheet and summed up or otherwise manipulated.

In somewhat of an inversion, I've been getting the initial naming done by an LLM (well, I was, until CoPilot imposed file upload limits and the new VPN blocked access to it) --- for want of that, I just name each scan by Invoice ID, then use a .bat file made by concatenating columns in a spreadsheet to rename them to the initial state ready for entry.

embedding-shape•50m ago
I've been playing around with the same, but trying to use local models as my Obsidian vault obviously contain a bunch of private things I'm not willing to share with for-profit companies, but I have yet to find any model that comes close to working out as well as just codex or cc with the small models, even with 96GB of VRAM to play around with.

I've started to think about maybe a fine-tuned model is needed, specifically for "journal data retrieval" or something like that, is anyone aware of any existing models for things like this? I'd do it myself, but since I'm unwilling to send larger parts of my data to 3rd parties, I'm struggling collecting actual data I could use for fine-tuning myself, ending up in a bit of a catch 22.

For some clients projects I've experimented with the same idea too, with less restrictions, and I guess one valuable experience is that letting LLMs write docs and add them to a "knowledge repository" tends to up with a mess, best success we've had is limiting the LLMs jobs to organizing and moving things around, but never actually add their own written text, seems to slowly degrade their quality as their context fills up with their own text, compared to when they only rely on human-written notes.

exossho•15m ago
I can't remember how many file structures I've already tried... LLMs seem to be a great help here. Also used CC to organize my messy harddrive.

Now just need to find a good way to maintain the order...

freedomben•12m ago
> Also used CC to organize my messy harddrive.

Do you still have your prompt by chance, and willing to share it? I took a stab at this and it didn't want to make much change. I think I need to be more specific but am not sure how to do that in a general way

itake•5m ago
I'm wonder though:

1. Why does AI need that folder structure? Why not a flat list of files and let the AI agent explore with BM25 / grep, etc.

2. pre-compute compression vs compute at query time.

Kaparthy (and you) are recommending pre-compressing and sorting based on hard coded human abstraction opinions that may match how the data might be queried into human-friendly buckets and language.

Why not just let the AI calculate this at run time? Many of these use cases have very few files and for a low traffic knowledge store, it probably costs less tokens if you only tokenize the files you need.

The Git Commands I Run Before Reading Any Code

https://piechowski.io/post/git-commands-before-reading-code/
320•grepsedawk•3h ago•73 comments

Veracrypt Project Update

https://sourceforge.net/p/veracrypt/discussion/general/thread/9620d7a4b3/
439•super256•4h ago•123 comments

I've sold out

https://mariozechner.at/posts/2026-04-08-ive-sold-out/
110•doppp•2h ago•69 comments

Revision Demoparty 2026: Razor1911 [video]

https://www.youtube.com/watch?v=Lw4W9V57SKs&t=5716s
218•tetrisgm•6h ago•79 comments

Project Glasswing: Securing critical software for the AI era

https://www.anthropic.com/glasswing
1333•Ryan5453•18h ago•672 comments

Lunar Flyby

https://www.nasa.gov/gallery/lunar-flyby/
775•kipi•21h ago•185 comments

Your File System Is Already A Graph Database

https://rumproarious.com/2026/04/04/your-file-system-is-already-a-graph-database/
39•alxndr•2d ago•6 comments

Show HN: We built a camera only robot vacuum for less than 300$ (Well almost)

https://indraneelpatil.github.io/blog/2026/robot-vacuum/
45•indraneelpatil•2d ago•8 comments

Audio Reactive LED Strips Are Diabolically Hard

https://scottlawsonbc.com/post/audio-led
28•surprisetalk•22h ago•4 comments

Protect your shed

https://dylanbutler.dev/blog/protect-your-shed/
192•baely•9h ago•53 comments

System Card: Claude Mythos Preview [pdf]

https://www-cdn.anthropic.com/53566bf5440a10affd749724787c8913a2ae0841.pdf
735•be7a•17h ago•529 comments

GLM-5.1: Towards Long-Horizon Tasks

https://z.ai/blog/glm-5.1
553•zixuanlimit•19h ago•226 comments

Slightly safer vibecoding by adopting old hacker habits

http://addxorrol.blogspot.com/2026/03/slightly-safer-vibecoding-by-adopting.html
126•transpute•5d ago•67 comments

Native Americans had dice 12k years ago

https://www.nbcnews.com/science/science-news/native-americans-dice-games-probability-study-rcna26...
82•delichon•4d ago•32 comments

How to get better at guitar

https://www.jakeworth.com/posts/how-to-get-better-at-guitar/
366•jwworth•2d ago•179 comments

Cambodia unveils statue to honour famous landmine-sniffing rat

https://www.bbc.com/news/articles/c0rx7xzd10xo
418•speckx•18h ago•94 comments

Explore union types in C# 15

https://devblogs.microsoft.com/dotnet/csharp-15-union-types/
17•0x00C0FFEE•3d ago•1 comments

S3 Files

https://www.allthingsdistributed.com/2026/04/s3-files-and-the-changing-face-of-s3.html
319•werner•16h ago•92 comments

A truck driver spent 20 years making a scale model of every building in NYC

https://www.smithsonianmag.com/smart-news/a-truck-drive-spent-20-years-making-this-astonishing-sc...
349•1659447091•2d ago•60 comments

Mario and Earendil

https://lucumr.pocoo.org/2026/4/8/mario-and-earendil/
13•doppp•3h ago•2 comments

Show HN: An interactive map of Tolkien's Middle-earth

https://middle-earth-interactive-map.web.app/
222•frasermarlow•15h ago•41 comments

Binary obfuscation used in AAA Games

https://blog.farzon.org/2026/04/binary-obfuscation-that-doesnt-kill-lto.html
108•noztol•2d ago•48 comments

Cloudflare targets 2029 for full post-quantum security

https://blog.cloudflare.com/post-quantum-roadmap/
345•ilreb•22h ago•108 comments

The Clock

https://blog.senko.net/the-clock
83•senko•4d ago•29 comments

US and Iran agree to provisional ceasefire

https://www.theguardian.com/us-news/2026/apr/07/trump-iran-war-ceasefire
517•g-b-r•13h ago•1543 comments

A database of analog cameras that can be 3D printed

https://printed.analogcamera.space/
120•thomasjb•5d ago•16 comments

Xilem – An experimental Rust native UI framework

https://github.com/linebender/xilem
114•Levitating•12h ago•47 comments

Hobby CNC machining and resin casting (2015)

https://lcamtuf.coredump.cx/gcnc/
22•achierius•3d ago•7 comments

Rescuing old printers with an in-browser Linux VM bridged to WebUSB over USB/IP

https://printervention.app/details
209•gmac•19h ago•86 comments

JSIR: A High-Level IR for JavaScript

https://discourse.llvm.org/t/rfc-jsir-a-high-level-ir-for-javascript/90456
68•nnx•11h ago•20 comments