Horizon Beta (ChatGPT 5?)

https://openrouter.ai/openrouter/horizon-beta

4•franze•6mo ago

Comments

Topfi•6mo ago

I am personally still doubtful that this is a new frontier model from OpenAI. My suspicion remains that this is Deepseek V4, though this is purely based on a mix of pure feelings, the speed (slightly higher than V3 was at launch, far higher than it is from Deepseek directly now; could potentially line up with them using locally sourced accelerators over Nvidia now), the timeline, size and tokenizer. Would be very impressive if it was. Horizon Beta does not perform markedly better over GPT-4.1, some lauded aspects such as the purported frontend proficiency do not translate amazingly well to longer term development [0], so if Horizon Beta is GPT-5 that would be disappointing to me personally, especially considering Horizon Beta does very poorly on tool call and MCP evals in my scenarios, making it less suitable for Agentic coding tasks. In that area, it is even worse than Gemini 2.5 Pro which I have reliably seen end up in continuous loops when failing test cases.

[0] Basically, yes, one shot Horizon Beta outputs "more" UI (very expansive mockups), but the second one uses it to improve interface sections in an existing code base, Horizon Beta is roughly equivalent to Sonnet, GPT-4.1, K2 and 2.5 Pro. Whether a dev wants their initial prompt to create an extensive interface is honestly more a question of preference over model training or performance. Some will like it, some will find it restrictive. In either case, similarly extensive one shot UI code can be achieved with e.g. prompting GPT-4.1 if one wants that.

Topfi•6mo ago

> especially considering Horizon Beta does very poorly on tool call and MCP evals in my scenarios

GPT-5 does very well on tool calls, my MCP tests and is far better than 2.5 Pro in some early agentic coding testing. Seems I was very wrong, though not in the way I would have suspected, as whatever Horizon Alpha and Beta were was not GPT-5 in its entirety, but rather a "submodel" (for lack of a proper term at the moment as it does appear to be distinct from MoE) and limited additionally by having a small context window. Basically, Horizon was an early, very limited preview of what we now get with GPT-5, but the difference between the two is very notable.

Show HN: Identifier for files and directories (like ISBN for Books)

Show HN: Holy Grail: Open-Source Autonomous Development Agent

Show HN: Minecraft Creeper meets 90s Tamagotchi

Show HN: Termiteam – Control center for multiple AI agent terminals

The only U.S. particle collider shuts down

Ask HN: Why do purchased B2B email lists still have such poor deliverability?

Show HN: Remotion directory (videos and prompts)

Portable C Compiler

Show HN: Kokki – A "Dual-Core" System Prompt to Reduce LLM Hallucinations

Software Engineering Transformation 2026

Microsoft purges Win11 printer drivers, devices on borrowed time

Lunch with the FT: Tarek Mansour

Old Mexico and her lost provinces (1883)

'AI' is a dick move, redux

The source code was the moat. But not anymore

Does anyone else feel like their inbox has become their job?

An AI model that can read and diagnose a brain MRI in seconds

Dev with 5 of experience switched to Rails, what should I be careful about?

AlphaFace: High Fidelity and Real-Time Face Swapper Robust to Facial Pose

Scientists discover “levitating” time crystals that you can hold in your hand

Rammstein – Deutschland (C64 Cover, Real SID, 8-bit – 2019) [video]

Tell HN: Yet Another Round of Zendesk Spam

Postgres Message Queue (PGMQ)

Show HN: Django-rclone: Database and media backups for Django, powered by rclone

NY lawmakers proposed statewide data center moratorium

OpenClaw AI chatbots are running amok – these scientists are listening in

Show HN: AI agent forgets user preferences every session. This fixes it

Introduce the Vouch/Denouncement Contribution Model

Show HN: SSHcode – Always-On Claude Code/OpenCode over Tailscale and Hetzner

Microsoft appointed a quality czar. He has no direct reports and no budget