Show HN: Id-agent – Token efficient UUID alternative for AI agents

18•pranshuchittora•1h ago

Comments

nither•46m ago

Smart idea but the concern can be that in the future, tokenization techniques and libraries may change. And also this looks like a very edge optimization to me. But overall, it deserve to exist. Good job.

whazor•45m ago

i would be afraid of accidental prompt injection

pranshuchittora•43m ago

Yes, that a valid point. That's why we have a verification method which can be part of the harness to make sure the ids are not hallucinated.

Falimonda•41m ago

Great point. Also, wouldn't this taint the context with semantic meaning that affects response?

felipeyanez•44m ago

any plans for a python port?

pranshuchittora•31m ago

Would love to, can you please create an issue on the GH repo.

Tiberium•43m ago

Is this just a reinvented humanhash?

pranshuchittora•30m ago

Kinda similar, but this is token efficient. Each word is ~1 BPE token

asdfsa32•29m ago

Everything is old is new.

Falimonda•43m ago

Benchmark comparing conventional UUID and AID across models, hallucination rate, token usage, would be cool!

simedw•42m ago

Nice package, not only is using words more token-efficient [saving time and money], but weaker models are also less likely to make mistakes when providing the key, at least in my tests.

That said, for `createAliasMap`, don't you think you could create a deterministic mapping from and to UUIDs <-> word chains? That way, no additional state would be needed. [Might require fairly long word chains...]

thrance•41m ago

An even better solution is to present the AI with local IDs and map those to UUIDs outside of its context. So when giving a list of items for the LLM to choose from, just list them with incremental numbers (1, 2, 3...) and ask for these numbers in tool schemas.

pranshuchittora•34m ago

Hello HN,

I have built id-agent to solve the 3 major pain points with using UUIDs with LLMs:

1. Increased token usage 2. Incorrect ids in the output tokens because of hallucination 3. Poor readability when debugging prompts and traces

While building agent-qa, I encountered challenges with ids, since each test, suite, and run artifact has a UUID associated with it. The AI agents often made mistakes and referenced incorrect entities. With id-agent, I was able to achieve a token-efficient way of defining unique ids which can be easily differentiated. The prefixed id path drastically reduces the hallucination rate.

Do check out agent-qa: Open-source Agentic QA Harness with Memory https://vostride.com/

railka•34m ago

Why do people choose the hyphen ("-") as the separator in an identifier? When double-clicking, the ID does not select completely, unlike when an underscore ("_") is used.

pranshuchittora•32m ago

Using "_" separator increases the token usage.

railka•30m ago

Ah, I understand, thank you for the answer!

pranshuchittora•29m ago

No worries, Checkout https://vostride.com/agent-qa to see how we are using this in production.

railka•31m ago

There is an example on GitHub with a prefix: "task_storm-delta-stone" (prefix: 'task'). Wouldn't it be more logical to have it reversed, like "task-storm_delta_stone"?

jy14898•32m ago

I don't like that they're not apples to apples; less bits so of course it'll take less tokens.

> Where UUIDs cost ~23 tokens and get hallucinated by LLMs

How does this solve the hallucination problem?

Just removing the - from the example UUID takes it from 26 tokens to 18

pranshuchittora•24m ago

LLMs are good at predicting words, since each word in the id is ~1 BPE token. But uuids are random hex characters, this is where LLMs struggle to output the right ids.

You can use the .from method https://github.com/vostride/id-agent/#idagentfrominput-opts

To convert uuid or any text to id-agent based id. Then do the LLM inference and then convert it back to UUID.

mrweasel•29m ago

Can someone explain why this would even be needed? Why is there a cost to generating say and UUIDv4? E.g. Claude Code has some regex in the client side code that filters out "bad words", so why can't the agent just generate UUIDs client side, using zero tokens.

I sort of get the "problem", but the fact that this is even needed is stupid.

tyleo•24m ago

Yeah, it doesn’t make a whole lot of sense. Over hundreds of hours of Claude Code use, I’ve never had this problem.

I feel like people just jam poorly specified input into LLMs and hope for the best. Then pile more tools on top when they don’t get what they want.

Slartie•17m ago

> I feel like people just jam poorly specified input into LLMs and hope for the best. Then pile more tools on top when they don’t get what they want.

People call this exact process "vibe coding".

baq•20m ago

the machines this is designed for are stupid. this makes them less stupid. do not anthropomorphize.

I can see this being useful when feeding raw table dump csvs into models, isomorphism means it's a simple pre-post processing step which could give you a cheap decrease of tokens and increase in accuracy.

sdevonoes•7m ago

You wrote a lot of things, but said nothing.

I guess you’re another bot

pranshuchittora•5m ago

Looks like it ;)

synthos•16m ago

Isn't this solving a subproblem of the overall issue of uncompressed tool call polluting context?

Furthermore, this could be compressed even further with a dynamic legend of every UUID in the context. So UUID@Bravo and UUID@Delta would be the actual symbols in the context but dynamically replaced when calling tools.

nkmnz•15m ago

Neat idea! I'd argue that the collision risk is basically zero because even though the entropy is lower, because you must validate the LLM-output anyways for two reasons:

1. LLMs might lack intrinsic entropy and reuse some UUIDs much more often.

2. Referential integrity is as important as collision resistance. An LLM must be able to reuse the correct id in the correct place.

On the other hand, using a dictionary for the ids helps with readability, but depending on the models strenghts, it might also add a confounder. After all, tokens that represent real words will probably influence the attention in a different way than random numbers.

New accessibility features powered by Apple Intelligence

Gaussian Splat of a Strawberry

Photo GIMP – A Patch for GIMP 3 for Photoshop Users

I Found Ultra-Pure Quantum Crystals in an Abandoned Mine in the Atacama Desert

Show HN: Id-agent – Token efficient UUID alternative for AI agents

Peter Neumann has died

Polypad

Click (2016)

Kv4p HT – A homebrew 1W radio (VHF or UHF) that plugs into an Android phone

Cursor Introduces Composer 2.5

Anthropic acquires Stainless

The lasting influence of Netscape Time

The last six months in LLMs in five minutes

PyTorch Landscape

1024000^2 Blocks, 2B2T Minecraft Server World Download Project, and Discoveries

Regex Chess: A 2-ply minimax chess engine in 84,688 regular expressions

Energy return in running shoes explained (2025)

Mini Shai-Hulud Strikes Again: 314 npm Packages Compromised

We let AIs run radio stations

Nim-Presto – REST API Framework for Nim Language

Make ZIP files smaller with ZIP Shrinker

Pope Leo XIV’s first encyclical Magnifica humanitas to be published May 25

Show HN: Number Gacha, a gacha game distilled to its essence

Hyperpolyglot Lisp: Common Lisp, Racket, Clojure, Emacs Lisp

Show HN: Hsrs – Type-Safe Haskell Bindings Generator for Rust

AI eats the world (Spring 26) [pdf]

Elon Musk has lost his lawsuit against Sam Altman and OpenAI

Two computers, one monitor, zero fiddling (2025)

Peter Salus has died

Alignment pretraining: AI discourse creates self-fulfilling (mis)alignment

New accessibility features powered by Apple Intelligence

Gaussian Splat of a Strawberry

Photo GIMP – A Patch for GIMP 3 for Photoshop Users

I Found Ultra-Pure Quantum Crystals in an Abandoned Mine in the Atacama Desert

Show HN: Id-agent – Token efficient UUID alternative for AI agents

Peter Neumann has died

Polypad

Click (2016)

Kv4p HT – A homebrew 1W radio (VHF or UHF) that plugs into an Android phone

Cursor Introduces Composer 2.5

Anthropic acquires Stainless

The lasting influence of Netscape Time

The last six months in LLMs in five minutes

PyTorch Landscape

1024000^2 Blocks, 2B2T Minecraft Server World Download Project, and Discoveries

Regex Chess: A 2-ply minimax chess engine in 84,688 regular expressions

Energy return in running shoes explained (2025)

Mini Shai-Hulud Strikes Again: 314 npm Packages Compromised

We let AIs run radio stations

Nim-Presto – REST API Framework for Nim Language

Make ZIP files smaller with ZIP Shrinker

Pope Leo XIV’s first encyclical Magnifica humanitas to be published May 25

Show HN: Number Gacha, a gacha game distilled to its essence

Hyperpolyglot Lisp: Common Lisp, Racket, Clojure, Emacs Lisp

Show HN: Hsrs – Type-Safe Haskell Bindings Generator for Rust

AI eats the world (Spring 26) [pdf]

Elon Musk has lost his lawsuit against Sam Altman and OpenAI

Two computers, one monitor, zero fiddling (2025)

Peter Salus has died

Alignment pretraining: AI discourse creates self-fulfilling (mis)alignment

Show HN: Id-agent – Token efficient UUID alternative for AI agents

Comments