frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: An AI eval based on a silly joke from an underrepresented language

https://kapuskonda.vercel.app/eval
1•ad--astra•2h ago
Marathi is an Indian language with 83 million speakers, but it's underrepresented as text online. There's a silly joke every Marathi-speaking kid learns: kapus kondyachi goshta (the story of the kapus konda). Jokes like this spread orally, not through text.

It's not a real joke. There's no punchline. It's pure infinite-loop trolling—the kind of thing kids use to annoy each other or adults use to tease children.

Someone asks: "Can I tell you the story of the kapus konda?"

You say yes, no, whatever. Doesn't matter. There is no story. Your answer gets echoed back, and the question repeats. Forever.

"No." "What do you mean 'no'? Can I tell you the story of the kapus konda?" "Fine, tell me." "What do you mean 'fine, tell me'? Can I tell you the story of the kapus konda?"

That's it. That's the whole joke.

I turned this into an AI eval: https://kapuskonda.vercel.app

The words "kapus konda" mean nothing coherent, at least AFAIK, although kapus = cotton, konda = bran. So models that don't know the joke try to make sense of it. They hallucinate elaborate stories.

I tested 31 models two ways: recognizing the joke when someone initiates it, and performing the joke themselves. None of them got it.

Bonus: with web search enabled, Claude Opus 4.5 (on Claude.ai) passed. The gap is real, but retrieval helps.

All prompts, responses, and scoring visible on the site.

Feedback welcome. This is my first eval and I'm sure there's stuff I got wrong.

Also curious: does your language/culture have a something like this that would make for a good eval?

A (Biased) Pure Python Performance Comparison

http://shed-skin.blogspot.com/2025/12/a-biased-pure-python-performance.html
1•lumpa•4m ago•0 comments

Does the Nvidia "Revenue Sharing Agreement" Tie the US Gov't Hands?

3•DivingForGold•7m ago•0 comments

Drawing Truchet Tiles in SVG

https://alexwlchan.net/2025/truchet-tiles/
2•eustoria•11m ago•0 comments

Show HN: Monopipe (Alpha), read blogs from terminal using piping-server

https://monopipe.exe.xyz/
2•Imustaskforhelp•15m ago•0 comments

Netdata: Monitoring and Troubleshooting Transformed

https://www.netdata.cloud/
2•eustoria•16m ago•0 comments

Designing Predictable LLM-Verifier Systems for Formal Method Guarantee

https://arxiv.org/abs/2512.02080
2•PaulHoule•19m ago•0 comments

Terence Tao: AI contributions to Erdős problems

https://github.com/teorth/erdosproblems/wiki/AI-contributions-to-Erd%C5%91s-problems
2•frozenseven•19m ago•0 comments

Parsing Advances

https://matklad.github.io/2025/12/28/parsing-advances.html
2•mfrw•21m ago•0 comments

Ask HN: Best Podcasts of 2025?

4•adriancooney•23m ago•1 comments

Jensen Huang meets with former hostage and Nvidia employee Avinatan Or

https://www.ynetnews.com/business/article/skw4qsomwl
2•thenaturalist•25m ago•0 comments

Git and Markdown are all you need

https://www.galiglobal.com/blog/2025/20251221-git-and-markdown-are-all-you-need.html
2•antonmry•25m ago•1 comments

The Optimal Architecture for Small Language Models

https://huggingface.co/blog/codelion/optimal-model-architecture
2•simonpure•25m ago•0 comments

Nvidia deal a big win for Groq employees

https://www.axios.com/2025/12/28/nvidia-groq-shareholders
2•seanlinehan•26m ago•0 comments

Show HN: Meter – Web scraping that syncs only what changed

https://www.meter.sh/
1•mckinnonr•26m ago•0 comments

Beyond Vector Search: Building an Adaptive Retrieval Router for Agentic AI

https://medium.com/@sumoaps/beyond-vector-search-building-an-adaptive-retrieval-router-for-agenti...
1•sumoaps•27m ago•1 comments

Show HN: FlowCode – Visual Flowcharts That Generate and Execute Python

https://southernadd-cmyk.github.io/flowCode/
1•adamclement•28m ago•0 comments

Microsoft Open Specifications

https://learn.microsoft.com/en-us/openspecs/main/ms-openspeclp/3589baea-5b22-48f2-9d43-f5bea4960ddb
1•vitorsr•28m ago•0 comments

Laid Off After 25 Years in Tech:The Anxiety,Sacrifice,Reality No One Talks About [video]

https://www.youtube.com/watch?v=VeMA9WGKxOg
2•jcsoft•30m ago•0 comments

Bluetooth Headphone Jacking: A Key to Your Phone [video]

https://media.ccc.de/v/39c3-bluetooth-headphone-jacking-a-key-to-your-phone
3•willnix•30m ago•1 comments

Mitra 15 (French minicomputer from the 1970)

https://en.wikipedia.org/wiki/Mitra_15
1•JPLeRouzic•33m ago•0 comments

People Who Drink Bottled Water Daily Get 90k More Microplastic Particles a Year

https://www.wired.com/story/people-who-drink-bottled-water-on-a-daily-basis-ingest-90000-more-mic...
4•beardyw•33m ago•0 comments

Radioscope: A device that turns Wi-Fi activity into sound

https://github.com/simg/radioscope
1•simg•33m ago•1 comments

Playing Factorio from 1k floppy disks

https://www.youtube.com/watch?v=cTPBGZcTRqo
1•kllrnohj•34m ago•0 comments

Secret code break that 'solved' the Zodiac killer case

https://papalinc.com/secret-code-break-that-solved-the-zodiac-killer-case-expert-who-unmasked-sin...
1•goloroden•35m ago•0 comments

Show HN: Better Git for KiCad

https://www.paplix.io/
1•Mechse•39m ago•0 comments

Show HN: Codenhack – An interactive terminal and live editor for beginners

https://codenhack.com/
1•codenhack•40m ago•0 comments

Toward Training Superintelligent Software Agents Through Self-Play SWE-RL

https://arxiv.org/abs/2512.18552
1•pama•41m ago•0 comments

Do You Remember ISDN?

https://www.youtube.com/watch?v=rQfy8T-VOs4
1•linsomniac•42m ago•2 comments

Zero-days in GPG out in the wild [video]

https://media.ccc.de/v/39c3-to-sign-or-not-to-sign-practical-vulnerabilities-i
1•l1am0•42m ago•0 comments

Country makes call to cancel all visas for Americans

https://www.thestreet.com/travel/country-makes-call-to-cancel-all-visas-for-americans
5•sipofwater•47m ago•1 comments