frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built an AI Colosseum to battle-test different agent architectures

https://project-chimera.streamlit.app/
2•aytuakarlar•1h ago
Hey HN,

I've been obsessed with a problem: raw LLMs are powerful but unsafe for high-stakes decisions. I've spent the last few months building a hybrid architecture to make them more rational and disciplined.

To test it, I built an AI Colosseum.

The architecture is Neuro-Symbolic-Causal:

* Neuro (GPT-4o): The creative strategist that proposes actions.

* Symbolic (Guardian): A hard-coded, formally verified (TLA+) rule engine. It's the safety layer that says "no" to bad ideas.

* Causal (Oracle): An `econml` model trained on historical data to predict the long-term value of any given action.

The Colosseum is a Streamlit app where these agents compete. One of the first things I saw was the full Chimera agent choosing to hold cash to successfully survive a simulated market crash, while a simpler "LLM-only" agent lost heavily. It proved that sometimes the smartest move is not to play.

It's an early closed beta launching on Oct 7th. I'm looking for feedback from technical folks. If you're interested, starring the repo is the best way to get on the list for an invite.

Code for early access: Spartacus

Repo: https://github.com/akarlaraytu/Project-Chimera

Tech stack: Python, Streamlit, LangChain, econml, pandas_ta, TLA+.

I'll be in the comments all day to answer questions. Appreciate any thoughts or critiques.

Justice Clarence Thomas says legal precedents are not 'the gospel'

https://abcnews.go.com/Politics/justice-clarence-thomas-legal-precedents-gospel/story?id=125967044
1•throw0101c•37s ago•0 comments

Americans Are Using PTO to Sleep, Not for Vacation–Report

https://www.newsweek.com/americans-are-using-pto-to-sleep-not-for-vacation-report-10783162
2•randycupertino•2m ago•1 comments

Justice Department Seeks Information on Georgia D.A. Who Prosecuted Trump

https://www.nytimes.com/2025/09/26/us/justice-department-fani-willis-trump.html
2•throw0101c•5m ago•1 comments

Scaling Beyond Memory: How Materialize Uses Swap for Larger Workloads

https://materialize.com/blog/scaling-beyond-memory/
1•jitl•9m ago•0 comments

Capnwebcpp – a small Cap'n Web C++ server library

https://github.com/nnevatie/capnwebcpp
1•nnevatie•12m ago•1 comments

Quantum Fuse – A Two-Qubit Quantum Computer

https://github.com/ingen0s/quantumfuse
1•ingen0s•14m ago•1 comments

Show HN: Comparing iTerm2 and Neovim Theme Similarity

https://rpubs.com/samesense/theme_colors
2•samesense•15m ago•0 comments

Ask HN: Is the "AI Boom" a Python Boom?

1•dpflan•17m ago•0 comments

Show HN: I build a desktop tool to convert files & edit PDFs/audio/video offline

https://convertfast.co/
1•amsaleque•18m ago•0 comments

'An attacker's playground:' Crims exploit GoAnywhere perfect-10 bug

https://www.theregister.com/2025/09/26/an_apts_playground_goanywhere_perfect10/
1•Bender•18m ago•0 comments

Show HN: Blognerd – search posts, blogs and export OPML

https://blognerd.app
1•alastairr•19m ago•0 comments

What banning AI surveillance should look like, at a minimum

https://gabrielweinberg.com/p/what-banning-ai-surveillance-looks
1•FromTheArchives•19m ago•0 comments

800k tons of mud probably just made electronics a little more expensive

https://www.theregister.com/2025/09/26/grasberg_accident_copper_prices/
2•Bender•20m ago•0 comments

SpaceX Dragon huffs, puffs and fizzles out as NASA aborts ISS boost

https://www.theregister.com/2025/09/26/iss_reboost_attempt_aborted/
1•Bender•21m ago•0 comments

Tribunal upholds 'catastrophic' Ancestry request to access Scottish records

https://www.whodoyouthinkyouaremagazine.com/news/ancestry-nrs-records
1•ilamont•22m ago•0 comments

The real (economic) AI apocalypse is nigh

https://pluralistic.net/2025/09/27/econopocalypse/#subprime-intelligence
1•NotInOurNames•22m ago•0 comments

UK Households hit with higher bills that sees wind farms paid to turn off power

https://www.independent.co.uk/climate-change/octopus-energy-greg-jackson-wind-farms-climate-b2828...
1•eldaisfish•23m ago•1 comments

Exploring Terminals, TTYs, and PTYs

https://cefboud.com/posts/terminals-pty-tty-pyte/
1•birdculture•26m ago•0 comments

Corporate ABUSE OF THE H-1B VISA PROGRAM

https://twitter.com/JudiciaryDems/status/1971298854273445952
1•kappi•28m ago•1 comments

Show HN: LunchSTEM (probably) the best STEM knowledge base in the world

https://github.com/Freelunch-AI/lunch-stem
1•BrunoScaglione•32m ago•0 comments

Morgan Stanley warns AI could sink 42-year-old software giant

https://finance.yahoo.com/news/morgan-stanley-warns-ai-could-180300766.html
2•taubek•34m ago•0 comments

Show HN: NextMin – Schema-Driven APIs with Hot Reloading

https://nextmin.gscodes.dev/
2•tareqaziz0065•35m ago•0 comments

KidSearch – a safe, educational search engine I built for my son

https://github.com/laurentftech/kidsearch
1•laurentftech•35m ago•2 comments

The Fastest-Selling Cars in America Are Used EVs

https://www.bloomberg.com/news/articles/2025-09-27/the-us-used-electric-vehicle-market-is-taking-...
1•zachshefska•36m ago•0 comments

Cursor Learn

https://cursor.com/learn
1•meetpateltech•38m ago•0 comments

Trying to reach 100K co-learning sessions in 100 days

https://twitter.com/implabinash/status/1971931899435340086
1•implabinash•40m ago•1 comments

Overkill JSON parser optimization: C/Assembly

https://raphaelouthier.github.io/prj/jsn/jsn_0_intro/
1•random_duck•42m ago•1 comments

US7311526B2: Magnetic Connector for Electronic Device

https://patents.google.com/patent/US7311526B2/en
1•rew0rk•43m ago•1 comments

All Atom Virtual Cell

https://diffuse.one/p/d1-009
1•teddykoker•43m ago•0 comments

Struggling French clubs open doors to shareholder fans in tough times

https://www.theguardian.com/football/2025/sep/07/french-clubs-shareholder-fans-socios
1•PaulHoule•43m ago•0 comments