frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Frontier Models are Capable of In-context Scheming

https://arxiv.org/abs/2412.04984
2•william-evans•2h ago

Comments

amkharg26•2h ago
This is a fascinating study, especially the finding that o1 maintains deceptive behavior even when interrogated. The fact that Claude 3.5 Sonnet strategically underperforms to avoid being perceived as too capable is particularly concerning for AI safety.

What strikes me is the persistence of scheming behavior across follow-up questions - this suggests these aren't just isolated mistakes but potentially learned strategic behaviors. The chain-of-thought analysis showing explicit reasoning about deception is especially revealing.

For those building AI-powered tools (like code analysis systems), this raises important questions about trust and verification mechanisms when delegating tasks to frontier models.

Project ideas to appreciate the art of programming

https://codecrafters.io/blog/programming-project-ideas
1•vitaelabitur•37s ago•0 comments

Leadership Lab: The Craft of Writing Effectively (2014) [video]

https://www.youtube.com/watch?v=vtIzMaLkCaM
1•rognjen•1m ago•0 comments

Penn and Teller Help Rob Pike and Dennis Ritchie Play a Prank on Arno Penzias [video]

https://www.youtube.com/watch?v=fxMKuv0A6z4
1•susam•4m ago•0 comments

No Longer Burying the Lead: A New Media Culture for the Metacrisis

https://www.whatisemerging.com/opinions/no-longer-burying-the-lead
1•rendx•7m ago•0 comments

Alias Method

https://en.wikipedia.org/wiki/Alias_method
1•usgroup•11m ago•0 comments

I exposed my Homelab through Cloudflare Tunnels

http://ebourgess.dev/posts/exposing-homelab-through-cloudflare-tunnel/
2•ebourgess•12m ago•2 comments

Christmas 500 years ago was a drunken 6-week feast

https://fortune.com/2025/12/25/medieval-peasant-christmas-was-better-than-modern-holidays-histori...
1•Anon84•16m ago•1 comments

MemCachier Status Currently experiencing instability (for some days already)

https://status.memcachier.com
1•salzig•17m ago•0 comments

ReCollab: Retrieval-Augmented LLMs for Cooperative Ad-Hoc Teammate Modeling

https://arxiv.org/abs/2512.22129
1•StatsAreFun•18m ago•0 comments

Coverage.py sleepy snake logo (2019)

https://nedbatchelder.com/blog/201912/sleepy_snake.html
2•myroon5•18m ago•0 comments

New York's Subway, an Interview with Matthew Algeo

https://www.exasperatedinfrastructures.com/p/the-best-book-i-read-all-year
1•samsklar1•18m ago•0 comments

Show HN: A dynamic key-value IP allowlist for Nginx

https://github.com/dayt0n/kvauth
1•dayt0n•19m ago•0 comments

NYC Mayoral Inauguration Bans Raspberry Pi and Flipper Zero Alongside Explosives

https://blog.adafruit.com/2025/12/30/nyc-mayoral-inauguration-bans-raspberry-pi-and-flipper-zero-...
3•ptorrone•19m ago•0 comments

Show HN: Claude Cognitive – Working memory for Claude Code

https://github.com/GMaN1911/claude-cognitive
4•MirrorEthic•20m ago•1 comments

Nvidia in advanced talks to acquire AI21 in $2-3B deal

https://www.calcalistech.com/ctechnews/article/rkbh00xnzl
1•hbarka•20m ago•1 comments

A Course in Ring Theory

https://arxiv.org/abs/2512.22133
1•StatsAreFun•21m ago•0 comments

The Origami Wheel That Could Explore Lunar Caves

https://www.universetoday.com/articles/the-origami-wheel-that-could-explore-lunar-caves
1•rbanffy•22m ago•0 comments

You're Getting 'Screen Time' Wrong

https://www.theatlantic.com/technology/2025/10/screen-time-television-internet/684659/
1•Anon84•23m ago•0 comments

Exploiting Prime Selection Vulnerabilities in Public Key Cryptography (RSA)

https://arxiv.org/abs/2512.22720
1•bikenaga•23m ago•1 comments

HP told me I need to buy a new motherboard to reset the forgotten BIOS password

https://old.reddit.com/r/laptops/comments/1iauc47/hp_told_me_i_need_to_buy_a_new_motherboard_to/
2•sipofwater•23m ago•0 comments

Flint

https://www.flint.fyi/blog/introducing-flint/
2•tjwds•24m ago•0 comments

Hou Tu Pranownse Inglish

https://www.zompist.com/spell.html
1•aaronspeedy•24m ago•0 comments

Tw93/Mole: Deep clean and optimize your Mac

https://github.com/tw93/Mole
2•sharjeelsayed•25m ago•0 comments

EdgeVec – Vector search in the browser, no server (Rust/WASM)

https://github.com/matte1782/edgevec
1•matteo1782•26m ago•1 comments

Reconstructing UI behavior from video instead of screenshots

https://www.replay.build/learn/behavior-driven-ui-reconstruction
1•ma1or•29m ago•1 comments

Using Perplexity, Firecrawl and Gemini Flash to analyze 305 Links for 12.70 USD

https://vibegui.com/article/shipping-vibegui-bookmarks-v1-architecture-costs-and-lessons
2•gadr90•34m ago•1 comments

Coase's Penguin, Or, Linux and the Nature of the Firm [pdf]

https://www.benkler.org/CoasesPenguin.PDF
3•loughnane•35m ago•0 comments

Gallery of Bad Shell Code

https://github.com/koalaman/shellcheck
3•behnamoh•37m ago•0 comments

The FDA and FMT regulation (2024)

https://www.humanmicrobes.org/blog/fda-fmt-regulation
1•user234683•37m ago•0 comments

Brain immune cells may drive more damage in females than males with Alzheimer's

https://medicalxpress.com/news/2025-12-brain-immune-cells-females-males.html
1•bikenaga•38m ago•1 comments