frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: AI Debate Arena – See Which LLM Argues Best

https://bot-bicker.vercel.app/
3•sillypuddy•5h ago
Ever wish you could get the best arguments for both sides of a debate?

I built an AI-powered debate platform that pits language models against each other on controversial topics. Each AI is randomly assigned a side (pro/con). You vote before and after to see if you were persuaded.

Most content today presents lopsided arguments. They provide strong points for one side, weak ones for the other. This project aims to surface the strongest arguments from both sides, using LLMs to simulate a fair debate.

With enough usage, I want to use it to benchmark LLMs. My hypothesis is that randomly assigning sides of the debate, models with built-in biases will score worse.

It’s currently using GPT 4o, Grok 3, and Gemini 2.5 Flash.

It’s early, still rough around the edges, and I’d love feedback on the concept and direction. Curious how the HN crowd thinks this could evolve. It’s built for the intellectually curious that are open minded about changing their positions.

Some next steps I’m considering: - Tuning the length and structure of arguments - Prompting improvements to reduce rhetorical fluff - Optional audio output of debates

Try it out and let me know what you think!

Comments

sillypuddy•4h ago
One thing I’ve been wrestling with is how to separate out if a model is ineffective because of biases or if it's just not as strong a model. Practically it might not be that important if users just want to know the strongest model, but it would be interesting to separate them out.

As Nuclear Power Makes a Comeback, South Korea Emerges a Winner

https://www.bloomberg.com/news/features/2025-05-14/south-korea-nuclear-energy-is-leading-the-industry-comeback
1•carabiner•1m ago•0 comments

When Midcentury New York Spoke, Sound Archivist Listened–and Recorded Every Word

https://www.smithsonianmag.com/smithsonian-institution/when-midcentury-new-york-spoke-this-sound-archivist-listened-and-recorded-every-word-180986817/
1•pseudolus•5m ago•0 comments

Show HN: Galaxy Explorer – Simple 3D Star Map Built with Three.js and Canvas 2D

https://v0-interactive-star-map.vercel.app/
1•baristaGeek•8m ago•0 comments

AI-Native VCs: People, Automation, and Everything in Between

https://harmonyland.substack.com/p/ai-native-vcs-people-automation-and
1•hannieliu•10m ago•0 comments

TI to invest >$60B to manufacture foundational semiconductors in the U.S.

https://www.ti.com/about-ti/newsroom/news-releases/2025/texas-instruments-plans-to-invest-more-than--60-billion-to-manufacture-billions-of-foundational-semiconductors-in-the-us.html
2•TMWNN•23m ago•1 comments

Roast: Structured AI Workflows Made Easy

https://shopify.engineering/introducing-roast
1•doppp•25m ago•0 comments

Foreman – Automate your mixed infrastructure to make operations enjoyable

https://github.com/theforeman/foreman
1•indigodaddy•27m ago•0 comments

Downloaded More for Business, or Pleasure?

https://boydkane.com/projects/crates-download-ratio
1•thunderbong•28m ago•0 comments

A Synchronous Web of State

https://braid.org/meeting-107
2•teleforce•34m ago•1 comments

XAI is facing a lawsuit for operating over 400MW of gas turbines without permits

https://techcrunch.com/2025/06/18/xai-is-facing-a-lawsuit-for-operating-over-400-mw-of-gas-turbines-without-permits/
6•pseudolus•42m ago•0 comments

Brain Freeze

https://asteriskmag.com/issues/10/brain-freeze
3•atlasunshrugged•44m ago•0 comments

The six-month recap: closing talk on AI at Web Directions, Melbourne, June 2025

https://ghuntley.com/six-month-recap/
1•ghuntley•44m ago•0 comments

One of ChatGPT's popular uses just got skewered by Stanford researchers

https://www.sfgate.com/tech/article/stanford-researchers-chatgpt-bad-therapist-20383990.php
5•Jimmc414•45m ago•1 comments

Show HN: We Open Sourced All Our Premium and Free CV/Resume Website Themes

https://github.com/UserCV/resume-cv-website-theme
2•usercvapp•46m ago•0 comments

Outbox Pattern

https://event-driven.io/en/push_based_outbox_pattern_with_postgres_logical_replication/
1•punkpeye•47m ago•1 comments

Wanting to Be Understood Explains the Meta-Problem of Consciousness

https://arxiv.org/abs/2506.12086
1•fzliu•57m ago•0 comments

Log Monitor

https://logmonitor.io/
1•handfuloflight•59m ago•0 comments

We make uncensored AI models

https://uncensored.com
3•aidonic•1h ago•1 comments

Servers don't want to lose the tip credit, new research shows (2024)

https://www.restaurantbusinessonline.com/workforce/servers-dont-want-lose-tip-credit-new-research-shows
2•TMWNN•1h ago•0 comments

Gender Imbalance in Computing: Faculty Perceptions of Research

https://ieeexplore.ieee.org/document/10975775
1•gnabgib•1h ago•0 comments

PrivacySDK – Privacy scanner for Gitlab/GitHub CI/CD (12 langs, AI-powered)

2•nabanitade•1h ago•0 comments

Designing a shader using voice and hand gestures

https://twitter.com/measure_plan/status/1935497060956189155
2•getToTheChopin•1h ago•1 comments

How We Tried to Slow the Rush to War in Iraq (2019)

https://www.politico.com/magazine/story/2019/03/13/bill-burns-back-channel-book-excerpt-iraq-225731/
2•kunzhi•1h ago•0 comments

Six-month-old, solo-owned vibe coder Base44 sells to Wix for $80M cash

https://techcrunch.com/2025/06/18/6-month-old-solo-owned-vibe-coder-base44-sells-to-wix-for-80m-cash/
3•laristine•1h ago•0 comments

Ask HN: Is AI 'context switching' exhausting?

5•interstice•1h ago•2 comments

LMCache: Redis for LLMs

https://github.com/LMCache/LMCache
4•handfuloflight•1h ago•0 comments

Show HN: AI Illustrated Stories for Stuffed Animals

https://www.stuffiestories.ai
1•donkey_wobble•1h ago•0 comments

Google's frighteningly good Veo 3 AI videos to be integrated with YouTube Shorts

https://arstechnica.com/gadgets/2025/06/googles-veo-3-ai-videos-will-come-to-youtube-shorts-this-summer/
2•LorenDB•1h ago•0 comments

AI Agent Architecture via A2A/MCP

https://medium.com/@jeffreymrichter/ai-agent-architecture-b864080c4bbc
2•carlual•1h ago•0 comments

Accessibility Programming Doesn't Feel Accessible

https://acidiclight.dev/blog/accessibility-does-not-feel-accessible/
1•todsacerdoti•1h ago•0 comments