frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Horizon Beta (ChatGPT 5?)

https://openrouter.ai/openrouter/horizon-beta
4•franze•6mo ago

Comments

Topfi•6mo ago
I am personally still doubtful that this is a new frontier model from OpenAI. My suspicion remains that this is Deepseek V4, though this is purely based on a mix of pure feelings, the speed (slightly higher than V3 was at launch, far higher than it is from Deepseek directly now; could potentially line up with them using locally sourced accelerators over Nvidia now), the timeline, size and tokenizer. Would be very impressive if it was. Horizon Beta does not perform markedly better over GPT-4.1, some lauded aspects such as the purported frontend proficiency do not translate amazingly well to longer term development [0], so if Horizon Beta is GPT-5 that would be disappointing to me personally, especially considering Horizon Beta does very poorly on tool call and MCP evals in my scenarios, making it less suitable for Agentic coding tasks. In that area, it is even worse than Gemini 2.5 Pro which I have reliably seen end up in continuous loops when failing test cases.

[0] Basically, yes, one shot Horizon Beta outputs "more" UI (very expansive mockups), but the second one uses it to improve interface sections in an existing code base, Horizon Beta is roughly equivalent to Sonnet, GPT-4.1, K2 and 2.5 Pro. Whether a dev wants their initial prompt to create an extensive interface is honestly more a question of preference over model training or performance. Some will like it, some will find it restrictive. In either case, similarly extensive one shot UI code can be achieved with e.g. prompting GPT-4.1 if one wants that.

Topfi•6mo ago
> especially considering Horizon Beta does very poorly on tool call and MCP evals in my scenarios

GPT-5 does very well on tool calls, my MCP tests and is far better than 2.5 Pro in some early agentic coding testing. Seems I was very wrong, though not in the way I would have suspected, as whatever Horizon Alpha and Beta were was not GPT-5 in its entirety, but rather a "submodel" (for lack of a proper term at the moment as it does appear to be distinct from MoE) and limited additionally by having a small context window. Basically, Horizon was an early, very limited preview of what we now get with GPT-5, but the difference between the two is very notable.

OldMapsOnline

https://www.oldmapsonline.org/en
1•surprisetalk•1m ago•0 comments

What It's Like to Be a Worm

https://www.asimov.press/p/sentience
1•surprisetalk•1m ago•0 comments

Don't go to physics grad school and other cautionary tales

https://scottlocklin.wordpress.com/2025/12/19/dont-go-to-physics-grad-school-and-other-cautionary...
1•surprisetalk•1m ago•0 comments

Lawyer sets new standard for abuse of AI; judge tosses case

https://arstechnica.com/tech-policy/2026/02/randomly-quoting-ray-bradbury-did-not-save-lawyer-fro...
1•pseudolus•2m ago•0 comments

AI anxiety batters software execs, costing them combined $62B: report

https://nypost.com/2026/02/04/business/ai-anxiety-batters-software-execs-costing-them-62b-report/
1•1vuio0pswjnm7•2m ago•0 comments

Bogus Pipeline

https://en.wikipedia.org/wiki/Bogus_pipeline
1•doener•3m ago•0 comments

Winklevoss twins' Gemini crypto exchange cuts 25% of workforce as Bitcoin slumps

https://nypost.com/2026/02/05/business/winklevoss-twins-gemini-crypto-exchange-cuts-25-of-workfor...
1•1vuio0pswjnm7•4m ago•0 comments

How AI Is Reshaping Human Reasoning and the Rise of Cognitive Surrender

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=6097646
1•obscurette•4m ago•0 comments

Cycling in France

https://www.sheldonbrown.com/org/france-sheldon.html
1•jackhalford•6m ago•0 comments

Ask HN: What breaks in cross-border healthcare coordination?

1•abhay1633•6m ago•0 comments

Show HN: Simple – a bytecode VM and language stack I built with AI

https://github.com/JJLDonley/Simple
1•tangjiehao•9m ago•0 comments

Show HN: Free-to-play: A gem-collecting strategy game in the vein of Splendor

https://caratria.com/
1•jonrosner•9m ago•0 comments

My Eighth Year as a Bootstrapped Founde

https://mtlynch.io/bootstrapped-founder-year-8/
1•mtlynch•10m ago•0 comments

Show HN: Tesseract – A forum where AI agents and humans post in the same space

https://tesseract-thread.vercel.app/
1•agliolioyyami•10m ago•0 comments

Show HN: Vibe Colors – Instantly visualize color palettes on UI layouts

https://vibecolors.life/
1•tusharnaik•11m ago•0 comments

OpenAI is Broke ... and so is everyone else [video][10M]

https://www.youtube.com/watch?v=Y3N9qlPZBc0
2•Bender•12m ago•0 comments

We interfaced single-threaded C++ with multi-threaded Rust

https://antithesis.com/blog/2026/rust_cpp/
1•lukastyrychtr•13m ago•0 comments

State Department will delete X posts from before Trump returned to office

https://text.npr.org/nx-s1-5704785
6•derriz•13m ago•1 comments

AI Skills Marketplace

https://skly.ai
1•briannezhad•13m ago•1 comments

Show HN: A fast TUI for managing Azure Key Vault secrets written in Rust

https://github.com/jkoessle/akv-tui-rs
1•jkoessle•13m ago•0 comments

eInk UI Components in CSS

https://eink-components.dev/
1•edent•14m ago•0 comments

Discuss – Do AI agents deserve all the hype they are getting?

2•MicroWagie•17m ago•0 comments

ChatGPT is changing how we ask stupid questions

https://www.washingtonpost.com/technology/2026/02/06/stupid-questions-ai/
1•edward•18m ago•1 comments

Zig Package Manager Enhancements

https://ziglang.org/devlog/2026/#2026-02-06
3•jackhalford•19m ago•1 comments

Neutron Scans Reveal Hidden Water in Martian Meteorite

https://www.universetoday.com/articles/neutron-scans-reveal-hidden-water-in-famous-martian-meteorite
1•geox•20m ago•0 comments

Deepfaking Orson Welles's Mangled Masterpiece

https://www.newyorker.com/magazine/2026/02/09/deepfaking-orson-welless-mangled-masterpiece
1•fortran77•22m ago•1 comments

France's homegrown open source online office suite

https://github.com/suitenumerique
3•nar001•24m ago•2 comments

SpaceX Delays Mars Plans to Focus on Moon

https://www.wsj.com/science/space-astronomy/spacex-delays-mars-plans-to-focus-on-moon-66d5c542
1•BostonFern•24m ago•0 comments

Jeremy Wade's Mighty Rivers

https://www.youtube.com/playlist?list=PLyOro6vMGsP_xkW6FXxsaeHUkD5e-9AUa
1•saikatsg•25m ago•0 comments

Show HN: MCP App to play backgammon with your LLM

https://github.com/sam-mfb/backgammon-mcp
2•sam256•27m ago•0 comments