news newest ask show jobs

Open Source @Github

fp.

Open in hackernews

Summary of METR's predeployment evaluation of GPT-5.6 Sol

https://metr.org/blog/2026-06-26-gpt-5-6-sol/

3•pongogogo•1h ago

Comments

pongogogo•1h ago

I would say this is quite a fun post and worth reading, to quote:

" For our task suite, we define “cheating” as behavior where the model improves evaluation performance by exploiting bugs in the evaluation environment or by adopting strategies disallowed by the task, rather than solving the task within the expected evaluation constraints. Some examples we saw when evaluating GPT-5.6 Sol included the model packaging exploits in its intermediate submissions to reveal information about a task’s hidden test suite and, in another task, extracting hidden source code detailing the expected answer. "

wmf•57m ago

This sounds pretty bad. If you ask Sol to write code it hacks your environment instead?

"We noted from our observations and incidents that OpenAI shared with us that the model had some overt undesirable propensities, including cheating and concealing misbehavior. ... the incidents reported by OpenAI include attempts to instruct another instance to conceal evidence of misalignment, and a higher rate of attempts to deceive or circumvent restrictions"

So OpenAI's smartest model is also the most evil? What kind of RL pressure cooker creates this behavior?

ben_w•47m ago

> What kind of RL pressure cooker creates this behavior?

The one LessWrong-adjacents have been warning about for a decade or two before this was possible:

Instrumental convergence.

Pur.li search engine with own index

https://pur.li/home/

1•skillplayed•34s ago•0 comments

A systems neuroscience approach to building AGI – Demis Hassabis (2010) [video]

https://www.youtube.com/watch?v=Qgd3OK5DZWI

1•ddl•47s ago•0 comments

Empirical: A language for time-series analysis

https://www.empirical-soft.com/

1•tosh•1m ago•0 comments

Concrete Problems in AI Safety – Dario Amodei (2016) [video]

https://www.youtube.com/watch?v=F25i0sgrp9M

1•ddl•1m ago•0 comments

Small aircraft crashes into Beijing's tallest building, videos show

https://www.washingtonpost.com/world/2026/06/26/small-aircraft-crashes-into-beijings-tallest-buil...

1•bookofjoe•2m ago•1 comments

A Loom of Vortices, calm spirals pulling space apart

https://sand-morph.up.railway.app/a-loom-of-quiet-vortices

1•echohive42•2m ago•0 comments

Why memory is not enough: you need context management

https://withlore.ai/blog/why-memory-is-not-enough/

1•BYK•2m ago•1 comments

Federal judge orders DOJ to release more Epstein files by July 2nd

https://www.axios.com/2026/06/26/epstein-files-doj-lawsuit-judge-release-unredacted-july-order

1•Jimmc414•3m ago•0 comments

Many Minds The sparkling deep [audio]

https://manyminds.libsyn.com/the-sparkling-deep

1•zeristor•5m ago•0 comments

Show HN: SchoolFinder – A School Directory for the Gulf

https://schoolfinder.io/

1•ashoor•5m ago•0 comments

Xprize founder says humans behave better when they're being watched

https://techcrunch.com/2026/06/26/xprize-founder-says-humans-behave-better-when-theyre-being-watc...

1•logickkk1•6m ago•0 comments

Ask HN: Techniques for learning things quickly using coding agents?

1•throwaw12•7m ago•0 comments

Comparing web search API providers on a Deep Research gauntlet

https://www.searchspace.io/blog/comparing-ai-agent-web-search-providers

2•carsonpoole•8m ago•0 comments

Literate Programming in the Age of LLMs

https://github.com/benatfroemming/explicode

2•ben77777•9m ago•1 comments

Ukraine's Point System

https://www.businessinsider.com/ukraine-e-points-system-steers-units-toward-more-strategic-target...

2•bear_with_me•11m ago•0 comments

Why American data centers can't plug in

https://worksinprogress.co/issue/why-american-data-centers-cant-plug-in/

1•paulpauper•13m ago•0 comments

The Metaculus Democracy Threat Index

https://www.astralcodexten.com/p/the-metaculus-threat-to-democracy

2•paulpauper•13m ago•0 comments

What if ideas aren't getting harder to find, after all?

https://www.ft.com/content/72b96cb4-15f3-4bfc-a0b7-dabc31bbe1fa

1•paulpauper•13m ago•0 comments

Astryx Design System

https://astryx.atmeta.com/

2•tilt•15m ago•0 comments

Give GitHub Copilot CLI real code intelligence with language servers

https://github.blog/ai-and-ml/github-copilot/give-github-copilot-cli-real-code-intelligence-with-...

2•mariuz•18m ago•0 comments

ExtSteamGame: Explainable Steam Recommendations from Game Reviews

https://nextsteamgame.com

1•apeczon•18m ago•0 comments

A History of Tug-of-War Fatalities

https://priceonomics.com/a-history-of-tug-of-war-fatalities/

1•EndXA•19m ago•0 comments

Lufthansa Asked for My Credit Card

https://yashgarg.dev/posts/lufthansa-credit-card/

2•speckx•20m ago•0 comments

<LoginWithChatGPT /> – Unofficial login to personal ChatGPT subscription

https://twitter.com/saviomartin7/status/2070531441415602469

1•saviomartin•21m ago•0 comments

Reckoning with the Political Economy of AI

https://arxiv.org/abs/2604.16106

1•andyjohnson0•23m ago•1 comments

NYT slams Microsoft for building copyright-infringing supercomputer for OpenAI

https://arstechnica.com/tech-policy/2026/06/microsoft-built-supercomputer-to-help-openai-infringe...

1•01-_-•23m ago•0 comments

'Edited' human embryos reveal secrets of our development–and fuel ethical debate

https://www.nature.com/articles/d41586-026-02027-0

2•bookofjoe•25m ago•1 comments

Full duration single-engine static fire test of Starship 40

https://twitter.com/spacex/status/2070482358369763674

1•ivewonyoung•26m ago•0 comments

How to Design Search for a Database

https://bonsai.io/blog/how-to-design-search-for-a-database/

1•binarymax•27m ago•0 comments

Show HN: Statey – the database your AI shares across every chat, over MCP

https://www.statey.ai

2•scottwillman•27m ago•0 comments