frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: How are you preventing LLMs from hallucinating in real workflows?

1•Agent_Builder•1h ago
I recently tried building a small agent for coaching centers.

The idea was simple: a teacher uploads a syllabus or notes, and the agent generates a test paper from that material. The hard requirement was reliability. No invented questions, no drifting outside the syllabus.

Instead of trying to “fix” hallucinations with better prompts, I constrained the agent’s job very narrowly.

I defined:

a fixed knowledge base (only the uploaded syllabus)

explicit tools the agent was allowed to use

a structured output format for the test paper

a hardness distribution (for example 30% easy, 50% medium, 20% hard)

Once those constraints were in place, the behavior changed a lot. The agent stopped being creative in the wrong places and consistently produced usable test papers. The quality improvement came from reducing freedom, not from changing models.

I built this using GTWY.ai, mainly because it let me wire together a knowledge base, step-level tool permissions, and model choice without writing a lot of glue code. But the interesting part for me wasn’t the platform, it was the pattern.

It made me wonder:

Are others seeing similar results by narrowing agent scope instead of adding verification layers?

Do constraints scale better than smarter models for production use cases?

For education or other regulated domains, is this how people are actually shipping agents?

Curious what’s working for others in real deployments

Show HN: On the edge of Apple Silicon memory speeds

https://github.com/timoheimonen/macOS-memory-benchmark
2•user_timo•3m ago•0 comments

Judge orders Anna's Archive to delete scraped data from WorldCat

https://arstechnica.com/tech-policy/2026/01/judge-orders-annas-archive-to-delete-scraped-data-no-...
1•-0•3m ago•0 comments

Iran plans permanent break from global internet, say activists

https://www.theguardian.com/world/2026/jan/17/iran-plans-permanent-break-from-global-internet-say...
3•pr337h4m•11m ago•0 comments

Show HN: A smart camera that detects eye movements during REM sleep

https://github.com/lucidcode/Halovision-INSPEC
1•MichaelCoder•14m ago•0 comments

The Misogyny Myth

https://www.city-journal.org/article/the-misogyny-myth
1•mpweiher•15m ago•0 comments

Every data centre is a U.S. military base

https://www.policyalternatives.ca/news-research/every-data-centre-is-a-u-s-military-base/
1•HotGarbage•17m ago•0 comments

An Agent for Acme

https://blazelight.dev/blog/plan9-agent.mdx
1•theblazehen•20m ago•0 comments

A faceless hacker stole my therapy notes – my deepest secrets are online forever

https://www.bbc.co.uk/news/articles/c62nzxqw45eo
3•mellosouls•20m ago•0 comments

Best AI Training Platforms of 2026: Ranked and Reviewed

https://aitrainer.work/guides/best-ai-training-platforms-reviewed
2•xceladonx•29m ago•0 comments

Mastra

https://mastra.ai/
1•blufish•29m ago•1 comments

IN Memory of Professor Emeritus Benedict Gross

https://www.math.harvard.edu/in-memory-of-professor-emeritus-benedict-gross/
1•tzury•31m ago•0 comments

AeroSpace is an i3-like tiling window manager for macOS

https://github.com/nikitabobko/AeroSpace
1•y1n0•31m ago•0 comments

The 'untouchable hacker god' behind Finland's biggest ever crime

https://www.theguardian.com/technology/2026/jan/17/vastaamo-hack-finland-therapy-notes
3•c420•31m ago•0 comments

Escape from Woomera

https://en.wikipedia.org/wiki/Escape_from_Woomera
3•viraptor•32m ago•0 comments

AI Contribution Policy

https://www.graphite.art/volunteer/guide/starting-a-task/ai-contribution-policy/
1•jruohonen•34m ago•0 comments

Kip: A programming language based on grammatical cases of Turkish

https://github.com/joom/kip
1•todsacerdoti•35m ago•0 comments

L-Systems: an exploration in Swift [video]

https://vimeo.com/1155453426
1•Austin_Conlon•38m ago•0 comments

AI friend- Brought to you by your friendly neighborhood mega corporation

https://gpt3experiments.substack.com/p/your-ai-friend-brought-to-you-by
1•nutanc•39m ago•1 comments

Ask HN: Should Developers Shift from Coding to Architecture in the LLM Era?

3•danver0•50m ago•3 comments

Meta delays international launch of Ray-Ban Display due to U.S. demand surge

https://techfusiondaily.com/meta-delays-ray-ban-display-international-launch-us-demand-surge/
1•nelkazzu•53m ago•0 comments

25 Years of Wikipedia

https://wikipedia25.org/en/
1•atulatul•55m ago•1 comments

Everything Is a Ralph Loop

https://ghuntley.com/loop/
1•ghuntley•58m ago•0 comments

Little red dots as young supermassive black holes in dense ionized cocoons [pdf]

https://www.nature.com/articles/s41586-025-09900-4
3•thunderbong•1h ago•0 comments

Politics and the English Language (1946) [pdf]

https://bioinfo.uib.es/~joemiro/RecEscr/PoliticsandEngLang.pdf
2•dvrp•1h ago•0 comments

U.S. freezes visas to 75 countries

https://www.kenklippenstein.com/p/trump-freezes-visas-to-75-countries
1•0x54MUR41•1h ago•1 comments

A Data Model for Git

https://jvns.ca/blog/2026/01/08/a-data-model-for-git/
2•vismit2000•1h ago•0 comments

Why is "Am I the asshole" always popular on Reddit

2•jaskirat1216•1h ago•2 comments

The New Food-Stamp Rules Will Make Your Head Spin

https://www.theatlantic.com/health/2026/01/snap-soda-ban-food-stamps/685637/
2•JumpCrisscross•1h ago•1 comments

Dps

https://engineering.fb.com/2019/08/15/security/zoncolan/
1•JohnCorey•1h ago•1 comments

Show HN: AudiobookHub – Blinkist-style summaries and full classics

https://www.audiobookhub.net/
1•baoyashishui•1h ago•2 comments