frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Evaluating chain-of-thought monitorability

https://openai.com/index/evaluating-chain-of-thought-monitorability/
22•mfiguiere•2d ago

Comments

ramoz•56m ago
> Our expectation is that combining multiple approaches—a defense-in-depth strategy—can help cover gaps that any single method leaves exposed.

Implement hooks in codex then.

ursAxZA•11m ago
I might be missing something here as a non-expert, but isn’t chain-of-thought essentially asking the model to narrate what it’s “thinking,” and then monitoring that narration?

That feels closer to injecting a self-report step than observing internal reasoning.

A guide to local coding models

https://www.aiforswes.com/p/you-dont-need-to-spend-100mo-on-claude
184•mpweiher•4h ago•88 comments

I'm just having fun

https://jyn.dev/i-m-just-having-fun/
116•lemper•5d ago•38 comments

Show HN: Books mentioned on Hacker News in 2025

https://hackernews-readings-613604506318.us-west1.run.app
324•seinvak•8h ago•124 comments

Disney Imagineering Debuts Next-Generation Robotic Character, Olaf

https://disneyparksblog.com/disney-experiences/robotic-olaf-marks-new-era-of-disney-innovation/
62•ChrisArchitect•3h ago•24 comments

ONNX Runtime and CoreML May Silently Convert Your Model to FP16

https://ym2132.github.io/ONNX_MLProgram_NN_exploration
6•Two_hands•50m ago•0 comments

Show HN: WalletWallet – create Apple passes from anything

https://walletwallet.alen.ro/
280•alentodorov•9h ago•87 comments

The Going Dark initiative or ProtectEU is a Chat Control 3.0 attempt

https://mastodon.online/@mullvadnet/115742530333573065
445•janandonly•6h ago•141 comments

Evaluating chain-of-thought monitorability

https://openai.com/index/evaluating-chain-of-thought-monitorability/
22•mfiguiere•2d ago•2 comments

Show HN: Autograd.c – A tiny ML framework built from scratch

https://github.com/sueszli/autograd.c
48•sueszli•5d ago•5 comments

I program on the subway

https://www.scd31.com/posts/programming-on-the-subway
164•evankhoury•5d ago•106 comments

Rue: Higher level than Rust, lower level than Go

https://rue-lang.dev/
78•ingve•4h ago•47 comments

CO2 batteries that store grid energy take off globally

https://spectrum.ieee.org/co2-battery-energy-storage
133•rbanffy•9h ago•106 comments

E.W.Dijkstra Archive

https://www.cs.utexas.edu/~EWD/welcome.html
108•surprisetalk•9h ago•8 comments

Autoland saves King Air, everyone reported safe

https://avbrief.com/autoland-saves-king-air-everyone-reported-safe/
96•bradleybuda•8h ago•41 comments

You’re not burnt out, you’re existentially starving

https://neilthanedar.com/youre-not-burnt-out-youre-existentially-starving/
204•thanedar•6h ago•211 comments

I can't upgrade to Windows 11, now leave me alone

https://idiallo.com/byte-size/cant-update-to-windows-11-leave-me-alone
333•firefoxd•6h ago•314 comments

Perron: A Static Site Generator for Ruby on Rails

https://perron-site.statichost.page/
6•Kerrick•3d ago•0 comments

Structured outputs create false confidence

https://boundaryml.com/blog/structured-outputs-create-false-confidence
113•gmays•10h ago•56 comments

ARIN Public Incident Report – 4.10 Misissuance Error

https://www.arin.net/announcements/20251212/
134•immibis•9h ago•36 comments

Get an AI code review in 10 seconds

https://oldmanrahul.com/2025/12/19/ai-code-review-trick/
91•oldmanrahul•7h ago•48 comments

Ruby website redesigned

https://www.ruby-lang.org/en/
358•psxuaw•18h ago•139 comments

Indoor tanning makes youthful skin much older on a genetic level

https://www.ucsf.edu/news/2025/12/431206/indoor-tanning-makes-youthful-skin-much-older-genetic-level
215•SanjayMehta•19h ago•158 comments

Coarse is better

https://borretti.me/article/coarse-is-better
177•_dain_•12h ago•95 comments

I wish people were more public

https://borretti.me/article/i-wish-people-were-more-public
7•swah•1h ago•1 comments

Waymo halts service during S.F. blackout after causing traffic jams

https://missionlocal.org/2025/12/sf-waymo-halts-service-blackout/
187•rwoll•20h ago•264 comments

Frozen Waymos backed up San Francisco traffic during a widespread power outage

https://www.theverge.com/news/848843/waymo-san-francisco-power-outage
10•mikhael•38m ago•1 comments

Why “negative vectors” can't delete data in FAISS – but weighted kernels can

https://github.com/nikitph/bloomin/tree/master/negative-vector-experiment
6•loaderchips•4d ago•1 comments

Three ways to solve problems

https://andreasfragner.com/writing/three-ways-to-solve-problems
114•42point2•10h ago•22 comments

Show HN: Shittp – Volatile Dotfiles over SSH

https://github.com/FOBshippingpoint/shittp
114•sdovan1•12h ago•64 comments

Show HN: Mactop v2.0.0

https://github.com/metaspartan/mactop
7•carsenk•32m ago•0 comments