frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

New accessibility features powered by Apple Intelligence

https://www.apple.com/newsroom/2026/05/apple-unveils-new-accessibility-features-and-updates-with-...
44•interpol_p•31m ago•3 comments

Gaussian Splat of a Strawberry

https://superspl.at/scene/84df8849
150•danybittel•1h ago•51 comments

Photo GIMP – A Patch for GIMP 3 for Photoshop Users

https://github.com/Diolinux/PhotoGIMP
66•SockThief•2d ago•34 comments

I Found Ultra-Pure Quantum Crystals in an Abandoned Mine in the Atacama Desert

https://medium.com/@breid.at/ultra-pure-quantum-crystals-from-an-abandoned-mine-in-a-mysterious-d...
111•vi_sextus_vi•2d ago•29 comments

Show HN: Id-agent – Token efficient UUID alternative for AI agents

https://github.com/vostride/id-agent
18•pranshuchittora•1h ago•29 comments

Peter Neumann has died

https://www.tuhs.org/pipermail/tuhs/2026-May/033748.html
182•pabs3•9h ago•13 comments

Polypad

https://polypad.amplify.com/
106•ivank•2d ago•8 comments

Click (2016)

https://clickclickclick.click/
323•andrewzeno•13h ago•82 comments

Kv4p HT – A homebrew 1W radio (VHF or UHF) that plugs into an Android phone

https://www.kv4p.com/
108•krupan•2d ago•36 comments

Cursor Introduces Composer 2.5

https://cursor.com/blog/composer-2-5
175•asar•19h ago•140 comments

Anthropic acquires Stainless

https://www.anthropic.com/news/anthropic-acquires-stainless
476•tomeraberbach•19h ago•332 comments

The lasting influence of Netscape Time

https://thehistoryoftheweb.com/the-lasting-influence-of-netscape-time/
54•zdw•2d ago•10 comments

The last six months in LLMs in five minutes

https://simonwillison.net/2026/May/19/5-minute-llms/
515•yakkomajuri•11h ago•407 comments

PyTorch Landscape

https://pytorch.landscape2.io
51•salamo•8h ago•14 comments

1024000^2 Blocks, 2B2T Minecraft Server World Download Project, and Discoveries

https://github.com/2b2tplace/1m_release
153•exploraz•22h ago•93 comments

Regex Chess: A 2-ply minimax chess engine in 84,688 regular expressions

https://nicholas.carlini.com/writing/2025/regex-chess.html
143•surprisetalk•4d ago•36 comments

Energy return in running shoes explained (2025)

https://runrepeat.com/guides/energy-return-in-running-shoes
20•jstrieb•1d ago•10 comments

Mini Shai-Hulud Strikes Again: 314 npm Packages Compromised

https://safedep.io/mini-shai-hulud-strikes-again-314-npm-packages-compromised/
135•theanonymousone•7h ago•74 comments

We let AIs run radio stations

https://andonlabs.com/blog/andon-fm
289•lukaspetersson•18h ago•221 comments

Nim-Presto – REST API Framework for Nim Language

https://github.com/status-im/nim-presto
4•TheWiggles•1d ago•0 comments

Make ZIP files smaller with ZIP Shrinker

https://evanhahn.com/make-zip-files-smaller-with-zip-shrinker/
41•zdw•2d ago•25 comments

Pope Leo XIV’s first encyclical Magnifica humanitas to be published May 25

https://www.vaticannews.va/en/pope/news/2026-05/pope-leo-xiv-first-encyclical-magnifica-humanitas...
246•cucho•13h ago•161 comments

Show HN: Number Gacha, a gacha game distilled to its essence

https://isabisabel.com/gacha/
178•babel16•5d ago•71 comments

Hyperpolyglot Lisp: Common Lisp, Racket, Clojure, Emacs Lisp

https://hyperpolyglot.org/lisp
168•veqq•17h ago•41 comments

Show HN: Hsrs – Type-Safe Haskell Bindings Generator for Rust

https://github.com/harmont-dev/hsrs
38•suis_siva•8h ago•3 comments

AI eats the world (Spring 26) [pdf]

https://static1.squarespace.com/static/50363cf324ac8e905e7df861/t/6a0af5d0484fbf5fe9a7743e/177910...
257•topherjaynes•23h ago•140 comments

Elon Musk has lost his lawsuit against Sam Altman and OpenAI

https://techcrunch.com/2026/05/18/elon-musk-has-lost-his-lawsuit-against-sam-altman-and-openai/
993•nycdatasci•18h ago•499 comments

Two computers, one monitor, zero fiddling (2025)

https://alexplescan.com/posts/2025/08/16/kvm/
236•ankitg12•3d ago•134 comments

Peter Salus has died

https://www.tuhs.org/pipermail/tuhs/2026-May/033750.html
150•speckx•9h ago•12 comments

Alignment pretraining: AI discourse creates self-fulfilling (mis)alignment

https://arxiv.org/abs/2601.10160
61•anigbrowl•15h ago•25 comments
Open in hackernews

Show HN: Id-agent – Token efficient UUID alternative for AI agents

https://github.com/vostride/id-agent
18•pranshuchittora•1h ago

Comments

nither•46m ago
Smart idea but the concern can be that in the future, tokenization techniques and libraries may change. And also this looks like a very edge optimization to me. But overall, it deserve to exist. Good job.
whazor•45m ago
i would be afraid of accidental prompt injection
pranshuchittora•43m ago
Yes, that a valid point. That's why we have a verification method which can be part of the harness to make sure the ids are not hallucinated.
Falimonda•41m ago
Great point. Also, wouldn't this taint the context with semantic meaning that affects response?
felipeyanez•44m ago
any plans for a python port?
pranshuchittora•31m ago
Would love to, can you please create an issue on the GH repo.
Tiberium•43m ago
Is this just a reinvented humanhash?
pranshuchittora•30m ago
Kinda similar, but this is token efficient. Each word is ~1 BPE token
asdfsa32•29m ago
Everything is old is new.
Falimonda•43m ago
Benchmark comparing conventional UUID and AID across models, hallucination rate, token usage, would be cool!
simedw•42m ago
Nice package, not only is using words more token-efficient [saving time and money], but weaker models are also less likely to make mistakes when providing the key, at least in my tests.

That said, for `createAliasMap`, don't you think you could create a deterministic mapping from and to UUIDs <-> word chains? That way, no additional state would be needed. [Might require fairly long word chains...]

thrance•41m ago
An even better solution is to present the AI with local IDs and map those to UUIDs outside of its context. So when giving a list of items for the LLM to choose from, just list them with incremental numbers (1, 2, 3...) and ask for these numbers in tool schemas.
pranshuchittora•34m ago
Hello HN,

I have built id-agent to solve the 3 major pain points with using UUIDs with LLMs:

1. Increased token usage 2. Incorrect ids in the output tokens because of hallucination 3. Poor readability when debugging prompts and traces

While building agent-qa, I encountered challenges with ids, since each test, suite, and run artifact has a UUID associated with it. The AI agents often made mistakes and referenced incorrect entities. With id-agent, I was able to achieve a token-efficient way of defining unique ids which can be easily differentiated. The prefixed id path drastically reduces the hallucination rate.

Do check out agent-qa: Open-source Agentic QA Harness with Memory https://vostride.com/

railka•34m ago
Why do people choose the hyphen ("-") as the separator in an identifier? When double-clicking, the ID does not select completely, unlike when an underscore ("_") is used.
pranshuchittora•32m ago
Using "_" separator increases the token usage.
railka•30m ago
Ah, I understand, thank you for the answer!
pranshuchittora•29m ago
No worries, Checkout https://vostride.com/agent-qa to see how we are using this in production.
railka•31m ago
There is an example on GitHub with a prefix: "task_storm-delta-stone" (prefix: 'task'). Wouldn't it be more logical to have it reversed, like "task-storm_delta_stone"?
jy14898•32m ago
I don't like that they're not apples to apples; less bits so of course it'll take less tokens.

> Where UUIDs cost ~23 tokens and get hallucinated by LLMs

How does this solve the hallucination problem?

Just removing the - from the example UUID takes it from 26 tokens to 18

pranshuchittora•24m ago
LLMs are good at predicting words, since each word in the id is ~1 BPE token. But uuids are random hex characters, this is where LLMs struggle to output the right ids.

You can use the .from method https://github.com/vostride/id-agent/#idagentfrominput-opts

To convert uuid or any text to id-agent based id. Then do the LLM inference and then convert it back to UUID.

mrweasel•29m ago
Can someone explain why this would even be needed? Why is there a cost to generating say and UUIDv4? E.g. Claude Code has some regex in the client side code that filters out "bad words", so why can't the agent just generate UUIDs client side, using zero tokens.

I sort of get the "problem", but the fact that this is even needed is stupid.

tyleo•24m ago
Yeah, it doesn’t make a whole lot of sense. Over hundreds of hours of Claude Code use, I’ve never had this problem.

I feel like people just jam poorly specified input into LLMs and hope for the best. Then pile more tools on top when they don’t get what they want.

Slartie•17m ago
> I feel like people just jam poorly specified input into LLMs and hope for the best. Then pile more tools on top when they don’t get what they want.

People call this exact process "vibe coding".

baq•20m ago
the machines this is designed for are stupid. this makes them less stupid. do not anthropomorphize.

I can see this being useful when feeding raw table dump csvs into models, isomorphism means it's a simple pre-post processing step which could give you a cheap decrease of tokens and increase in accuracy.

sdevonoes•7m ago
You wrote a lot of things, but said nothing.

I guess you’re another bot

pranshuchittora•5m ago
Looks like it ;)
synthos•16m ago
Isn't this solving a subproblem of the overall issue of uncompressed tool call polluting context?

Furthermore, this could be compressed even further with a dynamic legend of every UUID in the context. So UUID@Bravo and UUID@Delta would be the actual symbols in the context but dynamically replaced when calling tools.

nkmnz•15m ago
Neat idea! I'd argue that the collision risk is basically zero because even though the entropy is lower, because you must validate the LLM-output anyways for two reasons:

1. LLMs might lack intrinsic entropy and reuse some UUIDs much more often.

2. Referential integrity is as important as collision resistance. An LLM must be able to reuse the correct id in the correct place.

On the other hand, using a dictionary for the ids helps with readability, but depending on the models strenghts, it might also add a confounder. After all, tokens that represent real words will probably influence the attention in a different way than random numbers.