news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Microsoft Paper: LLMs Corrupt Your Documents When You Delegate (Arxiv.org)

https://arxiv.org/abs/2604.15597

5•wuschel•1h ago

Comments

bediger4000•1h ago

even frontier models (...) corrupt an average of 25% of document content by the end of long workflows, with other models failing more severely

Wow, 25% corrupted seems like a lot. The abstract and the intro of this paper emphasizes "documents" and it's Microsoft, so I assumed Word docs, but that's not true, they used a wide variety of things, graphs, text files, possibly images, or some machine readable description of textile weaving. A proof reader might not catch 25% corrupted textile description file, or 25% corruption in a graph.

Is this "corruption" what in text files we've all been taught to call "hallucinations"?

jqpabc123•13m ago

Our analysis shows that current LLMs are unreliable delegates:

Who knew that a tool that relies on probability could make such a mess?

Criteria Enabling Cloud Computing Autonomy

https://www.bsi.bund.de/EN/Themen/Unternehmen-und-Organisationen/Informationen-und-Empfehlungen/E...

1•layer8•37s ago•0 comments

Oracle Corporation Emerges as AI Shot-Caller

http://tommyatomsjr.blogspot.com/2026/04/oracle-corporation-transforms-into-ai.html

1•paizono•47s ago•0 comments

Show HN: Open-source Semrush alternative just passed 1.6k stars

https://openseo.so/

1•bsenescu•1m ago•0 comments

The Duolingo taxi test–could being rude to the driver cost you your dream job?

https://phys.org/news/2026-04-duolingo-taxi-rude-driver-job.html

1•i7l•7m ago•0 comments

Anchor – Lisp→C compiler, no GC, hygienic macros, Chez Scheme at compile time

https://github.com/allenj12/anchor/tree/main

1•AnchorLang•7m ago•1 comments

Show HN: Lightport – AI gateway that makes LLM providers OpenAI-compatible

https://github.com/glama-ai/lightport

1•smokybay•8m ago•0 comments

L123: A Lotus 1-2-3–style terminal spreadsheet with modern Excel compatibility

https://github.com/duane1024/l123

2•duane1024•8m ago•0 comments

Finding a Therapist in Germany, Automation

https://github.com/sasanamari/busy_therapists

1•alikhoramshahi•8m ago•1 comments

Show HN: Boredroom, An app for men who stare at walls

https://apps.apple.com/us/app/bored-room/id6758528370

1•ZeidJ•10m ago•0 comments

Lovable: We're Currently Experiencing Issues

https://status.lovable.dev/

1•doener•12m ago•0 comments

Blockify – Build Shopify sections visually, export clean native Liquid you own

https://www.blockifybuilder.com/

1•kibbyd1985•13m ago•0 comments

Clio MCP Open-source Claude connector for law firms

https://github.com/oktopeak/clio-mcp

1•piterjov•16m ago•0 comments

Y Combinator – The French Documentary (With English Subtitles) [video]

https://www.youtube.com/watch?v=w-zS3U40Juo

1•rmason•16m ago•1 comments

I built an AI travel agent that books real hotels

https://medium.com/@sorin_14830/how-i-built-an-ai-travel-startup-that-actually-books-real-hotel-1...

2•sorinmihailescu•18m ago•0 comments

AI can replace your job. Here's what it can't replace

https://www.cjchilvers.com/blog/ai-can-replace-your-job-heres-what-it-cant-replace/

1•evo_9•20m ago•0 comments

Local Figma Port – export scoped design context to MCP for AI coding agents

https://github.com/echo-ae/local_figma_port

1•echo-ae•23m ago•0 comments

To buy this Bay Area home, you'll need Anthropic equity

https://techcrunch.com/2026/04/26/to-buy-this-bay-area-home-youll-need-anthropic-equity/

1•momentmaker•24m ago•0 comments

OpenAI is building a phone that would make apps obsolete

https://thenextweb.com/news/openai-qualcomm-ai-phone-agents-replace-apps

1•skeledrew•24m ago•1 comments

Sebastian Sawe breaks iconic sub-two-hour marathon barrier

https://www.bbc.com/sport/athletics/articles/cp383n09030o

7•avicado0o•24m ago•1 comments

Tokenmaxxing Isn't an AI Strategy

https://www.theregister.com/2026/04/26/ai_price_tag/

1•saikatsg•25m ago•0 comments

Show HN: Rocketship, AI app builder that comes with an AI sales team

https://deployrocketship.com

1•CarlosJeer•26m ago•1 comments

Show HN: Pdfnative-MCP – Model Context Protocol server for the pdfnative engine

https://www.npmjs.com/package/pdfnative-mcp

1•nizoka•26m ago•0 comments

U.S. companies back Sam Altman's World ID even as much of the world pushes back

https://restofworld.org/2026/sam-altman-worldcoin-zoom-tinder-partnerships/

12•kelnos•26m ago•0 comments

Bridging West Papua Through Dispossession

https://failedarchitecture.com/bridging-west-papua-through-dispossession/

1•Thevet•27m ago•0 comments

How Much of Substack Is AI?

https://www.usermag.co/p/how-much-of-substack-is-actually-ai-pangram-analysis-substack-bestsellers

2•laurex•27m ago•0 comments

Show HN: Claude Architect

https://github.com/willhennessy/architect

1•hennessywill•28m ago•0 comments

Cognition Launches Devin CLI

https://twitter.com/cognition/status/2048821234281181302

1•mschrage•28m ago•0 comments

Are landline phones making a comeback? [video]

https://www.bbc.com/reel/video/p0ncdtb6/watch

1•rolph•28m ago•0 comments

Copilot Student GPT-5.3-Codex removal from model picker

https://github.blog/changelog/2026-04-27-copilot-student-gpt-5-3-codex-removal-from-model-picker/

2•uncognic•32m ago•1 comments

Git hooks, upgraded: What's new in Git 2.54 and coming in 2.55

https://www.collabora.com/news-and-blog/news-and-events/git-hooks-upgraded-whats-new-git-254-and-...

3•losgehts•32m ago•0 comments