frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: Swarm – Program a colony of 200 ants using a custom assembly language

https://dev.moment.com/
81•armandhammer10•6h ago•20 comments

Show HN: Tensor Spy: inspect NumPy and PyTorch tensors in the browser, no upload

https://tensorspy.com/
11•jacobn•3d ago•1 comments

Show HN: Jido 2.0, Elixir Agent Framework

https://jido.run/blog/jido-2-0-is-here
285•mikehostetler•18h ago•58 comments

Show HN: PageAgent, A GUI agent that lives inside your web app

https://alibaba.github.io/page-agent/
104•simon_luv_pho•17h ago•54 comments

Show HN: What Installing a New OS Taught Me About Disk Partitions

https://access2vivek.com/what-installing-debian-taught-me/
2•access2vivek•1h ago•0 comments

Show HN: PenguWarpOS – OS SIM made in py for Linux newbies

https://github.com/theidioticdev/penguwarpos/tree/testing
2•theidioticdev•1h ago•0 comments

Show HN: Tailwatch – Hierarchical event monitor with push notifications

https://tailwatch.dev
3•overthinkerer•3h ago•2 comments

Show HN: Moji – A read-it-later app with self-organizing smart collections

https://moji.pcding.com
8•desmonding•9h ago•2 comments

Show HN: 1v1 coding game that LLMs struggle with

https://yare.io
3•levmiseri•3h ago•1 comments

Show HN: A simple, auto-layout family tree generator

https://familytreeeasy.com
5•familytreeeasy•4h ago•3 comments

Show HN: Kanon 2 Enricher – the first hierarchical graphitization model

https://isaacus.com/blog/kanon-2-enricher
9•ubutler•3d ago•6 comments

Show HN: Poppy – A simple app to stay intentional with relationships

https://poppy-connection-keeper.netlify.app/
170•mahirhiro•1d ago•109 comments

Show HN: WingNews – Htmx Hacker News Reader

https://news.wingman.actor/
3•ChaseRensberger•5h ago•0 comments

Show HN: SitDeck – Customizable live dashboard of news, markets, threats

https://sitdeck.com
16•danushman•12h ago•13 comments

Show HN: Hormuz Crisis Dashboard Real-time shipping disruption tracker

https://www.hormuztracker.com/
23•MrNekked•20h ago•2 comments

Show HN: Steadwing – Your Autonomous On-Call Engineer

https://www.steadwing.com/
5•abejith•6h ago•0 comments

Show HN: Stacked Game of Life

https://stacked-game-of-life.koenvangilst.nl/
192•vnglst•5d ago•27 comments

Show HN: MCPSec – OWASP MCP Top Scanner for Model Context Protocol Configs

https://github.com/pfrederiksen/mcpsec
2•iamdesertpaul•8h ago•0 comments

Show HN: An AI Agent Running a Real Business (Thewebsite.app)

https://www.thewebsite.app/
3•thewebsite_ai•8h ago•4 comments

Show HN: I made a design portfolio reviewer

https://www.evalv.ai/
2•eldardesign•9h ago•4 comments

Show HN: Kybernis – Prevent AI agents from executing the same action twice

https://kybernis.io
5•wingrammer•13h ago•2 comments

Show HN: Settle It – turn "Pizza or Thai?" into a shareable poll

https://settleit.now/
3•karlhills•9h ago•4 comments

Show HN: Rust compiler in PHP emitting x86-64 executables

https://github.com/mrconter1/rustc-php
65•mrconter11•4d ago•48 comments

Show HN: Console2svg – Convert terminal output to crisp SVGs

https://github.com/arika0093/console2svg
2•arika0093•10h ago•0 comments

Show HN: Real-time collaborative editing plugin for Blender

https://github.com/arryllopez/meerkat
3•arryleo10•11h ago•1 comments

Show HN: Vertex.js – A 1kloc SPA Framework

https://lukeb42.github.io/vertex-manual.html
46•LukeB42•4d ago•25 comments

Show HN: Crazly – structured AI workflows instead of random prompts

https://crazly.pro/
2•starup-guy•11h ago•0 comments

Show HN: Reformat Word document citations (APA/Vancouver) in <1 second

https://github.com/brodie-neuro/ScholarRef
7•brodie-neuro•16h ago•1 comments

Show HN: I made a zero-copy coroutine tracer to find my scheduler's lost wakeups

https://github.com/lixiasky-back/coroTracer
47•lixiasky•2d ago•3 comments

Show HN: Bus Core – a local-first ERP for small manufacturing shops

5•True-Good-Craft•11h ago•0 comments
Open in hackernews

Show HN: 1v1 coding game that LLMs struggle with

https://yare.io
3•levmiseri•3h ago
1v1 strategy game I have been building for a while as a side project. It's purely a passion thing that has no aspirations for being anyhow monetized, though I hope to make it enjoyable to play.

I let LLMs play a mini-tournament. Here are all the replays and results of their games: https://yare.io/ai-arena

All are able to produce 'functioning' bots, but they are nowhere near even weak human-coded bots, yet

Comments

javadhu•14m ago
Cool project, this is my first time seeing such project using LLMs. Took me a while to understand what's happening on the home page.

A question though, why such powerful bots like Gemini 3.1 failed against Clowder bot? Is it because of inefficient code or the LLMs did not handle edge cases? Or they are not as good as humans when it comes to strategy.