frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: Language1 – Benchmarking LLM comprehension of vague prompts via Taboo

https://language1.app/
1•kaandemirel•1h ago
Hi HN,

I built Language1 (https://language1.app), a word game where you play "reverse Taboo" against an LLM.

How it works: You are given a target word (e.g., "Apple") and a list of forbidden "taboo" words (e.g., "fruit", "red", "tree"). Your goal is to write a prompt that guides the LLM to output the exact target word, without using any of the forbidden words.

The Benchmark Goal: I am developing this project with the plan of using the gameplay data to build a benchmark dataset. The goal is to test and evaluate LLM capabilities when processing unclear prompts, metaphors, analogies, and vague explanations under semantic constraints.

Game Modes:

Single Player: Play through a pool of challenges to test your prompt precision. You compete against other players globally across attempts, solve time, and token consumption (measured via standard cl100k_base encoding). You can play instantly without registering, or sign in (one-click Google login) to submit scores to the leaderboards. Multiplayer Races: Real-time lobbies of up to 10 players racing through 3 rounds. Note: Since the game is new, public lobbies might be empty at first, but you can create private lobbies to play with friends. Available Models:

Anonymous users play with the default Gemma 3 Instruct model. Free registered users can choose between multiple models to test and compare reasoning styles, including Llama 3 8B, Liquid LFM 24B, Amazon Nova Micro, and Ministral 8B.

The Tech & Guardrails: The app is built with a React frontend and a Node.js/AWS Lambda backend. To keep things fair, we built a validation guard that parses input clues to block easy bypasses like letter-spacing (e.g., "A-P-P-L-E"), translations, cyphers, and base64. You have to rely purely on semantic reasoning to guide the model.

The game is completely free, has no ads, and is playable instantly in the browser.

I'd love to hear your thoughts on the gameplay and see what creative semantic tricks you use to guide the LLM!

Ask HN: Is there a way to stop the animated Google Doodles?

1•arnejenssen•1m ago•0 comments

SRAM Wall Art

https://old.reddit.com/r/chipdesign/comments/1u99eio/256x8_sram_wall_art/
1•random__duck•2m ago•0 comments

LLMs Put Style over Substance, You Should Put Substance over Style

https://www.felixhaba.com/writing/llms-put-style-over-substance/
1•feliixh•2m ago•0 comments

The Harajuku Moment

https://tim.blog/2024/02/09/harajuku-moment/
1•abhaynayar•2m ago•0 comments

Show HN: We ran 74 popular MCP servers in microVMs to see what breaks

https://usethrone.dev/registry
1•imtaimoorkhan•3m ago•0 comments

What We Know About Billionaire Peter Thiel's 'Dialog' Society

https://www.forbes.com/sites/maryroeloffs/2026/06/18/what-we-know-about-billionaire-peter-thiels-...
1•sreekanth850•4m ago•3 comments

Show HN: Emacs log-mode (cheap copy of Logseq)

https://github.com/luqtas/log-mode
1•luqtas•4m ago•0 comments

Ellf: Virtual NLP Engineer

https://beta.ellf.ai/
1•paffdragon•6m ago•0 comments

Global Freedom and Democracy Indices

https://www.amos.design/the-civic-atlas
1•bookofjoe•7m ago•0 comments

LLM biased against accessible code (Claude Code issue #56079)

https://www.aaron-gustafson.com/notebook/2026-06-17-llm-biased-against-accessible-code/
1•robin_reala•7m ago•0 comments

JetBrains IDE Expertise, Now on LinkedIn

https://blog.jetbrains.com/blog/2026/06/17/your-jetbrains-ide-expertise-now-on-linkedin/
1•WhiteDawn•8m ago•0 comments

SQLite Cloud SQLite database with real-time synchronization

https://www.sqlite.ai
1•Asfand3099•9m ago•0 comments

GLM-5.2 is probably the most powerful text-only open weights LLM

https://simonwillison.net/2026/Jun/17/glm-52/
5•Brajeshwar•10m ago•0 comments

Double Entry Programming

https://www.0xsid.com/blog/double-entry-programming
2•ssiddharth•12m ago•0 comments

What is the best Duolingo for X thing to build?

2•Lil-Finance-Bro•12m ago•1 comments

Windows 93

http://windows93.net/
2•xg15•14m ago•0 comments

Show HN: StartupWiki, a free alternative to crunchbase/pitchbook

https://startupwiki.tech/
1•shpran•15m ago•0 comments

AI-Native Firms

https://twitter.com/orgRem/status/2067318661669372196
1•jeffreyrogers•15m ago•0 comments

Cultivating Interests in Undergrad

https://bcmullins.github.io/favorite-books-from-undergrad/
1•wannabebarista•16m ago•0 comments

EOL – Find an audience from public posts, talk to it as one

https://www.earthonlines.com/product
1•BonanKou•16m ago•0 comments

Trump administration Backs Off Plan to End Ocean Monitoring

https://www.nytimes.com/2026/06/18/climate/trump-ocean-observatories-initiative.html
5•burkaman•16m ago•1 comments

The Makings of a Good Bioweapon

https://www.owlposting.com/p/the-makings-of-a-good-bioweapon
1•crescit_eundo•17m ago•0 comments

Coinbase AI Adviser

https://www.coindesk.com/business/2026/06/16/coinbase-intoduces-ai-advisor-stock-options-and-pre-...
1•AnhTho_FR•17m ago•0 comments

Apple's Tim Cook Says Price Increases Are 'Unavoidable'

https://www.cnet.com/tech/mobile/apples-tim-cook-says-price-increases-are-unavoidable/
1•speckx•19m ago•0 comments

Prof. Sarah Paine – Round Up of Grand Strategy and Geopolitics

https://www.youtube.com/watch?v=OS1NZLgKM2c
1•lifeisstillgood•22m ago•0 comments

September 2025 NPM Attack Hit 2.6B Weekly Downloads. Most Found Out on Twitter

https://datanexusmcp.com/blog/npm-supply-chain-attack-2025/
2•jsmudda•22m ago•1 comments

Show HN: Motion-contact-sheet – give a coding agent eyes for motion

https://github.com/Kallin/motion-contact-sheet
1•kal9000•23m ago•0 comments

Show HN: Jsonl-tools – secure paste bin for agent run traces

https://jsonl-tools.dev/
2•vierliam•23m ago•0 comments

Show HN: One hundred LLMs Generating a HTML/CSS Solar System

https://aibenchy.com/showcase/solar-system-animation/
2•XCSme•24m ago•0 comments

Show HN: Asili – open-source, privacy-first in-browser DNA PGS scoring

https://github.com/techninja/asili
2•techninja42•26m ago•1 comments