frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Realworld benchmark between Codex 5.3 and Opus 4.6

https://swe-agi.com/
3•hongbo_zhang•1h ago

Comments

hongbo_zhang•1h ago
This is the benchmark between the latest models on a new programming language to avoid overfitting. Latest models are quite good over generalization to new languages, they can write tens of thousands of lines of code in one prompt that just works.
alontorres•1h ago
I do feel like the latest codex 5.2 and 5.3 have been really excellent in coding and have been giving opus a good fight. I still prefer Opus 4.6 as my daily driver but specifically for coding tasks I think codex 5.3 is the best, especially when considering value for money.
hongbo_zhang•1h ago
Another thing I like about codex 5.3 is that its CLI support queueing the message directly without using third party plugins. And it can run weeks without any issues, the CC used to have memory issues and stackoverflows.

Google may be cracking down on self-promotional 'best of' listicles

https://searchengineland.com/google-cracking-down-self-promotional-best-of-listicles-468227
1•gnabgib•1m ago•0 comments

Show HN: Sovereign Suite – A Recursive Logic Framework for AI Governance

https://github.com/holland202/Sovereign-Suite-Manifest
1•badatchess•5m ago•0 comments

Show HN: New Open Source Agent with 62 Stars on GitHub

https://github.com/dakotalock/holygrailopensource
1•Moriarty2027•8m ago•0 comments

Mitchell Hashimoto Launches 'Vouch' to Fight AI Slop in Open Source Ecosystem

https://itsfoss.com/news/mitchell-hashimoto-vouch/
2•WaitWaitWha•8m ago•1 comments

Ethnic minorities are driving America's startup boom

https://www.economist.com/finance-and-economics/2026/02/12/ethnic-minorities-are-driving-americas...
1•andsoitis•10m ago•0 comments

Authoring, simulating, and testing dynamic human-AI group conversations

https://research.google/blog/beyond-one-on-one-authoring-simulating-and-testing-dynamic-human-ai-...
1•gmays•11m ago•0 comments

PostgreSQL v19: Password expiration warnings

https://hexacluster.ai/blog/postgresql-v19-password-expiration-warnings
1•avivallssa•14m ago•0 comments

Show HN: Khaos – Every AI agent I tested broke in under 30 seconds

1•exordex•16m ago•0 comments

How Are Amps Modeled? [video]

https://www.youtube.com/watch?v=9YL8pwF7Mnc
2•dsego•19m ago•0 comments

What 1.4M emails reveal about America's most notorious sex offender

https://www.economist.com/interactive/international/2026/02/12/inside-epsteins-network
1•doener•20m ago•0 comments

Simile: The Simulation Company

https://twitter.com/joon_s_pk/status/2022023097017421874
1•jaehong747•21m ago•0 comments

Elide is an all-in-one, AI-native, open source software runtime

https://elide.dev/
2•shirian•23m ago•0 comments

The March Cliff: Why the 2026 Economic Collapse Is Different

https://ramakanth-d.medium.com/the-march-cliff-why-the-2026-economic-collapse-is-different-e1c619...
1•playhard•25m ago•1 comments

Welcome to the Great Regression

https://www.bloomberg.com/opinion/newsletters/2026-02-12/the-us-risks-a-great-regression
1•petethomas•26m ago•0 comments

Judge rules that LLM provided legal advice is open to discovery [pdf]

https://storage.courtlistener.com/recap/gov.uscourts.nysd.652138/gov.uscourts.nysd.652138.22.0.pdf
2•stingrae•27m ago•0 comments

My hot take on vibe coding for PMs

https://www.ddmckinnon.com/2026/02/11/my-%f0%9f%8c%b6-take-on-vibe-coding-for-pms/
1•awaxman11•30m ago•0 comments

AI: Brainrot Inducer or Cognitive Multiplier?

https://www.cjroth.com/blog/2026-02-12-brainrot
1•thoughtfulchris•31m ago•0 comments

Deft – a class and interface system for Clojure[video]

https://www.youtube.com/watch?v=dlW6YzwUZ-M
1•sammy0910•31m ago•0 comments

AI and consciousness: from objective descriptions to 'level zero'

https://randomseed.io/txt/ai-and-consciousness/
1•siefca•32m ago•1 comments

Cloudflare adds real-time Markdown rendering for AI agents

https://blog.cloudflare.com/markdown-for-agents/
5•thestackfox•34m ago•2 comments

A Read-Only Philosophical Archive on Restraint and AI Ethics

https://coexilia.io/coexilian-documents/
1•aegissolis•34m ago•1 comments

RFK Jr. food pyramid site links to Grok, which says you shouldn't trust RFK Jr

https://arstechnica.com/health/2026/02/rfk-jr-food-pyramid-site-links-to-grok-which-says-you-shou...
3•doener•34m ago•2 comments

Skip the Tips: A game to select "No Tip" but dark patterns try to stop you

https://skipthe.tips/
4•randycupertino•34m ago•2 comments

Amazon's Ring cancels Flock partnership amid Super Bowl ad backlash

https://www.cnbc.com/2026/02/12/amazons-ring-cancels-flock-partnership-amid-super-bowl-ad-backlas...
2•zzzeek•38m ago•0 comments

Z-Image Implemented in NCNN Vulkan

https://github.com/nihui/zimage-ncnn-vulkan
2•luyu_wu•40m ago•0 comments

Show HN: I taught AI to remember. Then it warned me

https://github.com/Relic-Studios/ISSA-Repository
1•relicstudios•41m ago•0 comments

What happens when capability decouples from credentials?

2•falsework•41m ago•2 comments

Bryan Johnson's Immortals program costs $1M. How to DIY it <1% of the price

https://www.empirical.health/blog/bryan-johnson-immortals-program-diy/
1•brandonb•42m ago•0 comments

True, Relevant, and Wrong: The Applicability Problem in RAG

https://www.pinecone.io/learn/series/beyond-retrieval/rag-applicability-problem/
2•gk1•43m ago•0 comments

Coinbase Posts $667M Net Loss, Revenue Declines 20%

https://www.bloomberg.com/news/articles/2026-02-12/coinbase-posts-667-million-loss-sees-revenue-t...
5•petethomas•45m ago•0 comments