frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Think Before You Speak – Exploratory Forced Hallucination Study [pdf]

https://github.com/AutomationOptimization/tsce_demo/blob/main/docs/Think_Before_You_Speak.pdf
1•airylizard•5h ago
This is a research/discovery post, not a polished toolkit or product.

The Idea in a nutshell:

"Hallucinations" aren't indicative of bad training, but per-token semantic ambiguity. By accounting for that ambiguity before prompting for a determinate response we can increase the reliability of the output.

Two‑Step Contextual Enrichment (TSCE) is an experiment probing whether a high‑temperature “forced hallucination”, used as part of the system prompt in a second low temp pass, can reduce end-result hallucinations and tighten output variance in LLMs.

What I noticed:

In >4000 automated tests across GPT‑4o, GPT‑3.5‑turbo and Llama‑3, TSCE lifted task‑pass rates by 24 – 44 pp with < 0.5 s extra latency.

All logs & raw JSON are public for anyone who wants to replicate (or debunk) the findings.

Would love to hear from anyone doing something similar, I know other multi-pass prompting techniques exist but I think this is somewhat different.

Primarily because in the first step we purposefully instruct the LLM to not directly reference or respond to the user, building upon ideas like adversarial prompting.

I posted an early version of this paper but since then have run about 3100 additional tests using other models outside of GPT-3.5-turbo and Llama-3-8B, and updated the paper to reflect that.

Code MIT, paper CC-BY-4.0.

Poll: Do "tech" companies design, build and distribute products

1•1vuio0pswjnm7•2m ago•0 comments

Polyhedra Viewer

https://polyhedra.tessera.li/
1•HellsMaddy•9m ago•0 comments

Startup seeks Trump AI emergency for California tech city

https://www.thenerdreich.com/startup-seeks-trump-ai-emergency-for-california-tech-city/
2•khold_stare•12m ago•0 comments

A Most Important Artifact (2015)

https://cen.acs.org/articles/93/i35/Important-Artifact.html
1•kamaraju•12m ago•0 comments

Understanding Assembly Indices

https://www.molecular-assembly.com/learn/
2•andsoitis•16m ago•0 comments

Why Kubernetes Throttled My Idle Pods

https://mattthorne.github.io/blog/kubernetes-throttling-idle-pods
1•MattThorne•22m ago•0 comments

Annotated Code for Predict Next Word Based on Context and Learned Patterns

https://github.com/vtempest/ai-research-agent/blob/master/packages/neural-net/src/train/predict-next-word.js
1•vtemp99•25m ago•1 comments

Trying Out the AMD Developer Cloud for Quickly Evaluating Instinct and ROCm

https://www.phoronix.com/review/amd-developer-cloud
1•mfiguiere•32m ago•0 comments

The Promised LAN

https://notes.pault.ag/tpl/
2•ecliptik•38m ago•0 comments

Music as a Gradual Process [pdf] (1968)

http://musicgrad.ucsd.edu/~dwd/2014_music14/reich.pdf
1•brudgers•39m ago•0 comments

The Nuanced Reality of Throttling: It's Not Just About Preventing Abuse

https://blog.joemag.dev/2025/06/the-nuanced-reality-of-throttling-its.html
4•Bogdanp•43m ago•0 comments

Helsing valued at €12B to become one of Europe's most valuable tech groups

https://www.ft.com/content/cdc02d96-13b5-4ca2-aa0b-1fc7568e9fa0
3•jamesblonde•46m ago•1 comments

Virtual Cells

https://udara.io/science/virtual-cells/
1•surprisetalk•47m ago•0 comments

Blasnake: Snake but now the snake is a weapon

https://abagames.itch.io/blasnake
2•memalign•48m ago•1 comments

A Surprising Route to the Best Life Possible

https://www.nytimes.com/2025/03/27/opinion/persistence-work-difficulty.html
1•gregorvand•58m ago•0 comments

Show HN: I recreated 90s Mode X demoscene effects in JavaScript and Canvas

https://jdfio.com/pages-output/demos/x-mode/
4•gneissguise•58m ago•1 comments

Show HN: Frozti.io instantly turns design into live UI and production ready code

https://frozti.io/signin
1•amarneethi•1h ago•0 comments

The grim reality of assisted dying

https://thecritic.co.uk/the-grim-reality-of-assisted-dying/
2•Brajeshwar•1h ago•2 comments

William Langewiesche, the 'Steve McQueen of Journalism,' Dies at 70

https://www.nytimes.com/2025/06/16/business/media/william-langewiesche-dead.html
8•rsingel•1h ago•2 comments

3D Printing Research at EPA

https://www.epa.gov/chemical-research/3d-printing-research-epa
2•bicepjai•1h ago•0 comments

Dungeon Rampage code rescued from a child's laptop and is relaunching on Steam

https://www.pcgamer.com/games/action/dungeon-rampage-interview/
3•chris_overseas•1h ago•0 comments

Social media overtakes TV as Americans' top news source

https://www.niemanlab.org/2025/06/for-the-first-time-social-media-overtakes-tv-as-americans-top-news-source/
4•thm•1h ago•0 comments

Paper ECG: An open-source application for digitizing ECG image scans

https://github.com/Tereshchenkolab/paper-ecg
1•teleforce•1h ago•0 comments

Missiles That Destroyed Air Defenses from Inside Iran Were Remotely Operated

https://www.twz.com/news-features/spike-missiles-that-destroyed-air-defenses-from-inside-iran-were-remotely-operated
3•nradov•1h ago•0 comments

Show HN: Wheretowatch.stream – See where movies/shows are streaming globally

https://www.wheretowatch.stream
4•ericrenan•1h ago•1 comments

Enabling enhanced security for your app

https://developer.apple.com/documentation/Xcode/enabling-enhanced-security-for-your-app
1•transpute•1h ago•0 comments

Atproto OS – Web Desktops on the AT Protocol

https://github.com/atproto-os
2•dxlliv•1h ago•1 comments

GPT-4.5 preview in the OpenAI API will be shut down on July 14, 2025

https://platform.openai.com/docs/deprecations#2025-04-14-gpt-4-5-preview
2•peterdavehello•1h ago•0 comments

Waymo recalls more than 1,200 automated vehicles after minor crashes

https://www.latimes.com/business/story/2025-05-14/waymo-recalls-more-than-1-200-automated-vehicles-after-minor-crashes
4•andsoitis•1h ago•2 comments

Cross-social networks

https://yeldar.org/blog/cross-social-networks/
2•yeldar•2h ago•0 comments