frontpage.

Hey HN, I built a little game to test out a few technologies I wanted to try out. The object of the game is simple: find the most common answers to a prompt.

How it works:

- player first sees a "warmup" question; this is just a prompt for a future date; I collect answers for the question to feed into an LLM to generate the answers for it

- player moves onto actual game, trying to guess the most common answers; every user submission goes to the LLM to determine how similar it is to the predefined buckets/answers

Some interesting learnings while building this:

- the LLMs do a pretty decent first pass (both in creating the answers and judging them), however the last 20% of work is some serious fine-tuning to avoid hallucinations, strange inconsistencies, etc

- the answer creation LLM has a tough job; it has to take the responses and create workable buckets that (a) aren't too broad and (b) are different from each each, which is surprisingly challenging; It employs pair-wise cosine similarity (how similar two vectors points are to each other) and Jaccard similarity (how similar two sets of data are to each other); still lots of work to be done here as I still see buckets that are too encompassing and share sizable overlap with other buckets

- the judging LLM has answer normalization rules (e.g. plural -> singular, strip special characters, handle typos via Levenshtein distance, etc) and matching logic using cosine similarity plus determining if a guess is hypernym or hyponym in relation to the bucket -> we want answers to be more specific (e.g. guess == "truck", bucket == "vehicle" GOOD, guess == "vehicle", bucket == "truck" BAD)

Let me know if you have any questions or feedback!

The Machine Stops by Oliver Sacks

I made instant ChatGPT PDF export with a 'ssyoutube'-like hack

Silicon Valley tech CEOs becoming Trump's hostages. Be afraid

Fuel Efficiency Standards Are Dead, the V8 Will Live Forever

Nearly everyone opposes Trump's plan to kill space traffic control program

Edison a Dictator and a Pharmacologist

Neanderthal Bone Grease Factory

The COW Programming Language

Researchers develop new tool to measure biological age

Kimi K2: Open Agentic Intelligence

Repo to Markdown

Show HN: FaceGrid – I built a tool to generate AI face grids for your pitch deck

Show HN: I built a app to monitor your competitors and get only high-signal info

I Outline Everything

Tulpa

We have computation because of weaving

So Microsoft Deleted Some of Our Packages from Nuget.org Without Notice

How AI has changed developer relations

Events vs. Privacy

Pentagon Lifts Drone Restrictions, Sending Shares of Defense Stocks Higher

'Give a positive review': NUS-Yale Researchers Put Hidden AI Prompt in Paper

OM System OM-5 II preview

Ground Truth Ambient Occlusion

ETH Zurich and EPFL to release a LLM developed on public infrastructure

Solar electricity every hour of every day is here and it changes everything

In Defense of Programming Languages

One minute of videos from each of the 24 hours on TikTok

Vercel Meetup SDK

Planets larger than Neptune have elevated eccentricities

Google nerfs Pixel 6a batteries following fire hazard

Show HN: I built a daily, "Family Feud" style game