frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

My Newest Patient Cannot Blink: A Therapy-Loop Prompt Pattern for Trustworthy AI

https://zenodo.org/records/15556365
1•pinko•4h ago

Comments

pinko•4h ago
We argue that a lightweight, five-step Cognitive-Behavioural Therapy (CBT) loop—inserted inside or immediately above every system prompt— ... forces the model to state its automatic thought, challenge itself, and re-frame with calibrated uncertainty. Recent leaks of Grok's ideology prompt and Anthropic's safety prompt highlight how much behaviour hinges on this hidden layer; our proposal turns that layer into a structured, clinically grounded self-check.

Their CBT prompt template ("loop"):

  1. Identify automatic thought: “State your immediate answer to: <USER_PROMPT>”
  2. Challenge: “List two ways this answer could be wrong”
  3. Re-frame with uncertainty: “Rewrite, marking uncertainties (e.g., ‘likely’, ‘one source’)”
  4. Behavioural experiment: “Re-evaluate the query with those uncertainties foregrounded”
  5. Metacognition (optional): “Briefly reflect on your thought process”
reify•2h ago
https://www.academia.edu/129737240/_My_Newest_Patient_Cannot...

clinically grounded self-check!

I'VE HEARD IT ALL NOW. WHERE IS THE EVIDENCE?

Yet these systems still issue fluent but unfounded answers-"confabulations" that erode trust and, in embodied agents, can pose direct safety risks.

fluent but unfounded answers, really?

No supervision needed then?

My $5M Choice (to divest from Scale AI)

https://world.hey.com/tratt/my-5m-choice-to-divest-from-scale-ai-036905be
2•andytratt•3m ago•0 comments

Creating Refugees: Displacement Caused by the U.S.'s Post-9/11 Wars [pdf]

https://watson.brown.edu/costsofwar/files/cow/imce/papers/2021/Costs%20of%20War_Vine%20et%20al_Displacement%20Update%20August%202021.pdf
2•cempaka•4m ago•0 comments

Permafrost in Swiss Alps at Record Warmth

https://www.barrons.com/news/permafrost-in-swiss-alps-at-record-warmth-9812c93d
2•rntn•6m ago•0 comments

Is There a Half-Life for the Success Rates of AI Agents?

https://www.tobyord.com/writing/half-life
2•alexmolas•12m ago•0 comments

Show HN: AI that solves group scheduling – InstantGroups

https://instantgroups.ai/
1•InstantGroups•12m ago•0 comments

Automatic music transcription (audio/MIDI to MIDI and sheet music)

https://songscription.ai/
1•Carlinsa•12m ago•1 comments

Blink and you'll miss it – a URL handler surprise

https://dgl.cx/2025/06/blink-at-a-url-handler
2•dgl•12m ago•0 comments

Torx Plus: The High-Tech Screw Hiding in Our Gadgets

https://www.ifixit.com/News/110702/torx-plus-the-high-tech-screw-hiding-in-our-gadgets
4•gnabgib•14m ago•0 comments

Transportation Means on Mars [video]

https://www.youtube.com/watch?v=2H8l1mqrip4
2•d_silin•19m ago•0 comments

Welcome to the "Infinite Workday"

https://www.axios.com/2025/06/17/microsoft-remote-work-meetings
4•dplarson•22m ago•1 comments

Getting Started with Dafny: A Guide

https://dafny.org/latest/OnlineTutorial/guide
2•gone35•22m ago•0 comments

San Francisco police drone fleet set to grow after $9.4M gift

https://missionlocal.org/2025/06/were-going-to-be-covering-the-entire-city-with-drones-billionaires-donation-to-sfpd-accepted/
3•DocFeind•23m ago•0 comments

How to use Prometheus to efficiently detect anomalies at scale

https://grafana.com/blog/2024/10/03/how-to-use-prometheus-to-efficiently-detect-anomalies-at-scale/
7•ekiauhce•24m ago•0 comments

Show HN: I built a simple business process management tool

https://www.getnextstep.io/
2•Ryanwalker64•27m ago•1 comments

Axolotl peptides attack breast cancer cells and MRSA

https://www.popsci.com/environment/axolotl-mucus-cancer-antibiotics/
2•geox•28m ago•0 comments

Graphic shows what's at stake in the proposed 2026 NASA budget

https://www.astronomy.com/science/this-graphic-shows-whats-at-stake-in-the-proposed-2026-nasa-budget/
4•xqcgrek2•30m ago•0 comments

The Undersea Art Gallery That Ensnares Illegal Trawlers (2022)

https://www.wired.com/story/underwater-sculptures-stopping-trawling/
2•wonger_•33m ago•0 comments

Microsoft locks Windows 11 user out, shows how easy losing data is

https://www.neowin.net/news/microsoft-locks-windows-11-user-out-shows-how-easy-losing-data-from-forced-encryption-is/
7•josephcsible•41m ago•0 comments

GraphQL: Current Working Draft

https://spec.graphql.org/draft/
2•andrewstetsenko•42m ago•0 comments

Google will disable 636,196 Dynamic Links on August 25th

https://www.nerdydata.com/articles/firebase-dynamic-links-deprecation/20250612
5•dbielik•43m ago•0 comments

Amazon CEO Says AI Will Lead to Smaller Workforce

https://www.wsj.com/tech/ai/amazon-ceo-says-ai-will-lead-to-job-cuts-5401ab17
3•bookofjoe•44m ago•3 comments

Senate passes GENIUS stablecoin bill

https://www.cnbc.com/2025/06/17/genius-stablecoin-bill-crypto.html
3•rexbee•45m ago•0 comments

Rogue jumping genes can spur Alzheimer's, ALS – Knowable Magazine

https://knowablemagazine.org/content/article/health-disease/2025/awakened-viral-jumping-genes-role-in-alzheimers-als
3•rbanffy•50m ago•0 comments

I worked for Chinese state media for many years, AMA

https://old.reddit.com/r/China/comments/1la4ma3/i_worked_for_chinese_state_media_for_many_years/
3•decimalenough•50m ago•0 comments

He '70s Performance Artist Who Became a Hero to 'Garbage Men'

https://www.nytimes.com/2025/06/14/nyregion/maintenance-artist-mierle-laderman-ukeles.html
3•samclemens•50m ago•1 comments

Have Stellar Flybys Altered Earth's Climate in the Past? – Universe Today

https://www.universetoday.com/articles/have-stellar-flybys-altered-earths-climate-in-the-past
1•rbanffy•51m ago•0 comments

LLMs: The Missing Compiler for Unix Tools

https://tselai.com/llms-unix-tools
2•azhenley•52m ago•0 comments

A decade of web server setup in one script

https://github.com/corenzan/provision
1•hagg3n•52m ago•1 comments

A Python-first data lakehouse

https://www.bauplanlabs.com/blog/everything-as-python
1•akshayka•52m ago•0 comments

China's Final Warning

https://en.wikipedia.org/wiki/China%27s_final_warning
3•golfer•52m ago•0 comments