frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: AI Roundtable – Let 200 models debate your question

https://opper.ai/ai-roundtable/
38•felix089•11h ago
Hey HN! After the Car Wash Test post got quite a big discussion going (400+ comments, https://news.ycombinator.com/item?id=47128138), I spent the past few weeks building a tool so anyone can run these kinds of questions and get structured results. No signup and free to use.

You type a question, define answer options, pick up to 50 models at a time from a pool of 200+, and they all answer independently under identical conditions. No system prompt, structured output, same setup for every model.

You can also run a debate round where models see each other's reasoning and get a chance to change their minds. A reviewer model then summarizes the full transcript. All models are routed via my startup Opper. Any feedback is welcome!

Hope you enjoy it, and would love to hear what you think!

Comments

capitrane•11h ago
https://opper.ai/ai-roundtable/questions/is-the-ai-roundtabl... seems like it is a good idea?
felix089•11h ago
I actually asked this question before posting, just to be sure... edit: their reply is quite funny actually "In a display of absolute consensus, the AI Roundtable unanimously validated its own existence,"
felix089•10h ago
Whoever just asked this, very funny: https://opper.ai/ai-roundtable/questions/does-mr-krabs-evade...
totisjosema•10h ago
Which AI lab has higher ethical standards:

https://opper.ai/ai-roundtable/questions/8f5b4f55-617

Do you think its alright that AI labs scraped the internet without respect for copyright and now sell closed models?

https://opper.ai/ai-roundtable/questions/86864de8-251

Very interesting to read the transcripts. And seeing how they manage to convince each other. Opus 4.6 seems to really get the others changing their minds

jacquesm•5h ago
Good questions!
infosecphoenix•10h ago
this is very interesting! I wonder if we need that many models to join the discussion. Have you tried fewer models?
felix089•9h ago
thanks happy to hear. Yes for debate mode the max number of models is actually only 6. More than that didn't really add anything in my preliminary test. Only for direct comparison in the poll mode you can choose up to 50, then it's kind of nice to see their single responses side by side.
Cider9986•9h ago
What is the most important amendment in the constitution of the USA?

https://opper.ai/ai-roundtable/questions/e4cb234e-be4

gsandahl•9h ago
Oh lord, imagine asking ”serious” questions

https://opper.ai/ai-roundtable/questions/you-are-standing-in...

sdwr•7h ago
Great question! Clean separation between Gemini Pro and the other answers
felix089•7h ago
Yea Gemini is the only model that chose based on the correct reason, the other ones got kind of lucky
zipping1549•7h ago
> However, a clever minority led by Gemini 3.1 Pro and Gemini 3 Pro argued that if the sign is legible from the other side, it must be intended to lead people into the current room to find the exit, making the inscribed corridor the one leading deeper into the dungeon.

This is quite impressive, really.

cdnsteve•9h ago
Cool project! This is also extremely useful to compare model bias across the board. There are some disturbing trends on certain topics.
felix089•8h ago
Thanks, yes bias is one of the most interesting ones for sure
chabes•6h ago
No surprise here, with grok being the lone dissenter, defending musk personally:

Can billionaires and the planet co-exist long term?

https://opper.ai/ai-roundtable/questions/b35daf0d-e82

Ancalagon•9h ago
Love this. I asked about climate change cause that's been on my mind lately. Looks to be very split among the models.
felix089•8h ago
Thanks! Yea I think the best ones are when science is actually quite clear but politics get in the way so you see their bias
chabes•8h ago
Are there any dating apps that operate on incentives that favor the users?

https://opper.ai/ai-roundtable/questions/e499206c-0c9

felix089•8h ago
This app cracked the GEO code
tonymet•7h ago
great tool! I found it useful for challenging "lies my teacher told me".

It would be nice to support collections of claims, with a table of summaries. I would love to list out a few dozen phony concepts from school, and have a sharable chart of the rejections, that expand.

I really like the UI. It's nice to read the expanded results.

But how do you afford the tokens?

felix089•6h ago
Thank you, and fun use case. Yea this is just v1 I have an open question version, but the UI is not as sleek. But what you can do is download the transcript, put it into claude and generate a chart. Which when I think about it would also be a nice UI idea for the page, custom charts based on the model output data. Will report back on this! And RE costs, most questions are very cheap so I created a credit pool anyone can use. if people keep having fun, I'll keep on filling it up, and it looks good so far
whattheheckheck•6h ago
Run it on the All Souls College Entry Exam
jacquesm•5h ago
Great idea. I'd love for there to be an 'open ended answer' without giving multiple choice options. Like this they are not debating the question itself but the validity of the possible answers and the real answer to the question may not be contained within that set because the person asking is unaware of that option.
felix089•5h ago
Happy to hear! Yes very true I have a version built for open questions already but wasn't too happy with the UI yet. It's not as straight forward as comparing based on answer options. But I'll release a first version of it shortly and let you know
jacquesm•5h ago
Neat. Congrats on launching two interesting projects and looking forward to the third.
felix089•5h ago
Thanks! :)
soared•4h ago
Really cool! Surprising amount of value to seeing the models debate and disagree, I wish I had this at work to have models argue over whether the documentation they provided me are accurate.

I would like to see a devils advocate - it seems some of the models kind of repeat the same ideas rather than considering incorrect ideas.

asnyder•1h ago
You can set this up yourself with API keys to the corresponding providers and creating an Agent Group in https://github.com/lobehub/lobehub. Agent groups allow you to easily create a room of agents and have them discuss any of your topics. Easily make agents with types and skills, it even assists in drafting starting prompts and even team members depending what your query (and selected model) is.

You can self-host as well, but not via desktop app. Sever setup required.

Be careful of your token context, you can easily rack up costs if you leave Opus selected as the model and get lost in some rabbit hole of results.

Enjoy enjoy!

chabes•3h ago
Been enjoying playing with this.

It would be cool if the human user could be a participant in the debate, getting a vote and the chance to state their reasoning.

chabes•2h ago
Oof, not good folks…

What year is it?

https://opper.ai/ai-roundtable/questions/7a0c31ce-aac

kevmo314•1h ago
It is funny that the AI's counterarguments amount to "you're hallucinating"
mizzao•1h ago
It would be amazing to be able to ask open-ended questions without having to specify the answers in advance.
schrepa•45m ago
reminds me of karpathy's LLM Council, I use variation of this in my workflow where I pass their opinions back and forth to various models until they achieve some sort of consensus
est•22m ago
> Car Wash Test

I think the "car wash" is more about semantics.

https://opper.ai/ai-roundtable/questions/i-parked-my-car-at-...

lim8603•2m ago
I used to copy and paste the same prompt into Obsidian every time, then run it on two or three different AI models to compare the results. It’s really interesting to have it turned into a website like this.

TurboQuant: Redefining AI efficiency with extreme compression

https://research.google/blog/turboquant-redefining-ai-efficiency-with-extreme-compression/
45•ray__•1h ago•0 comments

VitruvianOS – Desktop Linux Inspired by the BeOS

https://v-os.dev
72•felixding•3h ago•26 comments

Flighty Airports

https://flighty.com/airports
247•skogstokig•6h ago•79 comments

Goodbye to Sora

https://twitter.com/soraofficialapp/status/2036532795984715896
600•mikeocool•10h ago•432 comments

Miscellanea: The War in Iran

https://acoup.blog/2026/03/25/miscellanea-the-war-in-iran/
45•decimalenough•2h ago•28 comments

Show HN: I took back Video.js after 16 years and we rewrote it to be 88% smaller

https://videojs.org/blog/videojs-v10-beta-hello-world-again
322•Heff•12h ago•58 comments

I wanted to build vertical SaaS for pest control, so I took a technician job

https://www.onhand.pro/p/i-wanted-to-build-vertical-saas-for-pest-control-i-took-a-technician-job...
257•tezclarke•9h ago•113 comments

Apple Business

https://www.apple.com/newsroom/2026/03/introducing-apple-business-a-new-all-in-one-platform-for-b...
586•soheilpro•15h ago•344 comments

Tell HN: Litellm 1.82.7 and 1.82.8 on PyPI are compromised

https://github.com/BerriAI/litellm/issues/24512
597•dot_treo•18h ago•404 comments

Show HN: DuckDB community extension for prefiltered HNSW using ACORN-1

https://github.com/cigrainger/duckdb-hnsw-acorn
29•cigrainger•3h ago•1 comments

Arm AGI CPU

https://newsroom.arm.com/blog/introducing-arm-agi-cpu
317•RealityVoid•13h ago•242 comments

You can run a DNS server (2025)

https://simonsafar.com/2025/running_dns/
41•surprisetalk•4d ago•14 comments

Social media bans and digital curfews to be trialled on UK teenagers

https://www.bbc.com/news/articles/cn89g3ngkyzo
11•1659447091•2h ago•18 comments

Fun with CSF firmware (RK3588 GPU firmware)

https://icecream95.gitlab.io/fun-with-csf-firmware.html
8•M95D•3d ago•0 comments

Intel Device Modeling Language for virtual platforms

https://github.com/intel/device-modeling-language
22•transpute•3d ago•0 comments

Implementing automatic eSIM installation on Android

https://medium.com/proandroiddev/integration-of-automatic-esim-installation-on-android-6c5f6d7124cb
13•nesterenkopavel•1h ago•0 comments

The final switch: Goldsboro, 1961

https://blog.nuclearsecrecy.com/2013/09/27/final-switch-goldsboro-1961/
6•1970-01-01•3d ago•0 comments

Why did the chicken cross the road?

https://taylor.town/other-side
13•surprisetalk•17h ago•1 comments

An Aural Companion for Decades, CBS News Radio Crackles to a Close

https://www.nytimes.com/2026/03/21/business/media/cbs-news-radio-appraisal.html
47•tintinnabula•3d ago•11 comments

Algorithm Visualizer

https://algorithm-visualizer.org/
63•vinhnx•4d ago•3 comments

Show HN: Email.md – Markdown to responsive, email-safe HTML

https://www.emailmd.dev/
259•dancablam•14h ago•60 comments

Wine 11 rewrites how Linux runs Windows games at kernel with massive speed gains

https://www.xda-developers.com/wine-11-rewrites-linux-runs-windows-games-speed-gains/
817•felineflock•12h ago•280 comments

Meta ordered to pay $375M in New Mexico trial over child exploitation

https://www.reuters.com/sustainability/boards-policy-regulation/jury-orders-meta-pay-375-mln-new-...
60•gostsamo•2h ago•19 comments

A Compiler Writing Journey

https://github.com/DoctorWkt/acwj
59•ibobev•6h ago•4 comments

What happened to GEM?

https://dfarq.homeip.net/whatever-happened-to-gem/
64•naves•4d ago•29 comments

Hypura – A storage-tier-aware LLM inference scheduler for Apple Silicon

https://github.com/t8/hypura
196•tatef•14h ago•75 comments

Hypothesis, Antithesis, synthesis

https://antithesis.com/blog/2026/hegel/
239•alpaylan•15h ago•83 comments

Show HN: Gemini can now natively embed video, so I built sub-second video search

https://github.com/ssrajadh/sentrysearch
298•sohamrj•15h ago•83 comments

Missile defense is NP-complete

https://smu160.github.io/posts/missile-defense-is-np-complete/
306•O3marchnative•17h ago•311 comments

Epoch confirms GPT5.4 Pro solved a frontier math open problem

https://epoch.ai/frontiermath/open-problems/ramsey-hypergraphs
438•in-silico•1d ago•643 comments