frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

NASA now allowing astronauts to bring their smartphones on space missions

https://twitter.com/NASAAdmin/status/2019259382962307393
2•gbugniot•3m ago•0 comments

Claude Code Is the Inflection Point

https://newsletter.semianalysis.com/p/claude-code-is-the-inflection-point
1•throwaw12•5m ago•0 comments

MicroClaw – Agentic AI Assistant for Telegram, Built in Rust

https://github.com/microclaw/microclaw
1•everettjf•5m ago•2 comments

Show HN: Omni-BLAS – 4x faster matrix multiplication via Monte Carlo sampling

https://github.com/AleatorAI/OMNI-BLAS
1•LowSpecEng•6m ago•1 comments

The AI-Ready Software Developer: Conclusion – Same Game, Different Dice

https://codemanship.wordpress.com/2026/01/05/the-ai-ready-software-developer-conclusion-same-game...
1•lifeisstillgood•8m ago•0 comments

AI Agent Automates Google Stock Analysis from Financial Reports

https://pardusai.org/view/54c6646b9e273bbe103b76256a91a7f30da624062a8a6eeb16febfe403efd078
1•JasonHEIN•11m ago•0 comments

Voxtral Realtime 4B Pure C Implementation

https://github.com/antirez/voxtral.c
1•andreabat•14m ago•0 comments

I Was Trapped in Chinese Mafia Crypto Slavery [video]

https://www.youtube.com/watch?v=zOcNaWmmn0A
1•mgh2•20m ago•0 comments

U.S. CBP Reported Employee Arrests (FY2020 – FYTD)

https://www.cbp.gov/newsroom/stats/reported-employee-arrests
1•ludicrousdispla•21m ago•0 comments

Show HN: I built a free UCP checker – see if AI agents can find your store

https://ucphub.ai/ucp-store-check/
2•vladeta•27m ago•1 comments

Show HN: SVGV – A Real-Time Vector Video Format for Budget Hardware

https://github.com/thealidev/VectorVision-SVGV
1•thealidev•28m ago•0 comments

Study of 150 developers shows AI generated code no harder to maintain long term

https://www.youtube.com/watch?v=b9EbCb5A408
1•lifeisstillgood•29m ago•0 comments

Spotify now requires premium accounts for developer mode API access

https://www.neowin.net/news/spotify-now-requires-premium-accounts-for-developer-mode-api-access/
1•bundie•31m ago•0 comments

When Albert Einstein Moved to Princeton

https://twitter.com/Math_files/status/2020017485815456224
1•keepamovin•33m ago•0 comments

Agents.md as a Dark Signal

https://joshmock.com/post/2026-agents-md-as-a-dark-signal/
2•birdculture•34m ago•0 comments

System time, clocks, and their syncing in macOS

https://eclecticlight.co/2025/05/21/system-time-clocks-and-their-syncing-in-macos/
1•fanf2•36m ago•0 comments

McCLIM and 7GUIs – Part 1: The Counter

https://turtleware.eu/posts/McCLIM-and-7GUIs---Part-1-The-Counter.html
2•ramenbytes•39m ago•0 comments

So whats the next word, then? Almost-no-math intro to transformer models

https://matthias-kainer.de/blog/posts/so-whats-the-next-word-then-/
1•oesimania•40m ago•0 comments

Ed Zitron: The Hater's Guide to Microsoft

https://bsky.app/profile/edzitron.com/post/3me7ibeym2c2n
2•vintagedave•43m ago•1 comments

UK infants ill after drinking contaminated baby formula of Nestle and Danone

https://www.bbc.com/news/articles/c931rxnwn3lo
1•__natty__•44m ago•0 comments

Show HN: Android-based audio player for seniors – Homer Audio Player

https://homeraudioplayer.app
3•cinusek•44m ago•1 comments

Starter Template for Ory Kratos

https://github.com/Samuelk0nrad/docker-ory
1•samuel_0xK•45m ago•0 comments

LLMs are powerful, but enterprises are deterministic by nature

2•prateekdalal•49m ago•0 comments

Make your iPad 3 a touchscreen for your computer

https://github.com/lemonjesus/ipad-touch-screen
2•0y•54m ago•1 comments

Internationalization and Localization in the Age of Agents

https://myblog.ru/internationalization-and-localization-in-the-age-of-agents
1•xenator•54m ago•0 comments

Building a Custom Clawdbot Workflow to Automate Website Creation

https://seedance2api.org/
1•pekingzcc•57m ago•1 comments

Why the "Taiwan Dome" won't survive a Chinese attack

https://www.lowyinstitute.org/the-interpreter/why-taiwan-dome-won-t-survive-chinese-attack
2•ryan_j_naughton•57m ago•0 comments

Xkcd: Game AIs

https://xkcd.com/1002/
2•ravenical•59m ago•0 comments

Windows 11 is finally killing off legacy printer drivers in 2026

https://www.windowscentral.com/microsoft/windows-11/windows-11-finally-pulls-the-plug-on-legacy-p...
2•ValdikSS•59m ago•0 comments

From Offloading to Engagement (Study on Generative AI)

https://www.mdpi.com/2306-5729/10/11/172
1•boshomi•1h ago•1 comments
Open in hackernews

KPMG wrote 100-page prompt to build agentic TaxBot

https://www.theregister.com/2025/08/20/kpmg_giant_prompt_tax_agent/
14•ofrzeta•5mo ago

Comments

ofrzeta•5mo ago
"It is very efficient," Munnelly told the Forrester conference. "It does what our team used to do in about two weeks, in a day. It will strip through our documents and the legislation and produce a 25-page document for a client as a first draft.

"That speed is important," he added. "If we have a client who is about to do a merger, and they want to understand the tax implications, getting that knowledge in a day is much more important than getting it in two weeks' time."

---

I really wonder what is the foundation for their confidence in LLMs. If you have ever used ChatGPT you will be highly skeptic that the output is correct. If it's code, you can at least compile, typecheck, run it, to verify it to some extent. How do you do that with a 25 page report?

defrost•5mo ago
> How do you do that with a 25 page report?

Like any technical 25 page report it'll be ballpark with reality, shorter to read and grasp than crawling through a wall of document filled boxes, and passed to other people to 'verify' / offer their opinions on.

Once contracts are in place with millions of dollars in play (or tens of millions, or billions) there will be clauses addressing responsibility and recompense should key parts of the reports upon which an agreement is based prove to be false.

The world runs on technical reports that aren't perfect, but "near enough"; errors are assumed and a frequency of deliberate malfeasance (knowingly lying, misleading, faking results) can be estimated.

Part of my career consisted of producing summaries of two to three thousand documents a day from stock markets about the globe, documents that ranged from three lines announcing a change on a board, a table disclosing a change in holdings by largest investors, etc. to large (hundred+ page) quarterly and annual reports, to small book economic feasibility reports with wads of raw data, interpretation, proposed plans, costings, timelines, etc.

> It will strip through our documents and the legislation and produce a 25-page document for a client as a first draft.

is the key point here, it's a rapid first draft of the major dot points seen to be most important for <whatever>. It is intended to be crawled through with a finer comb and a keen eye before contracts are signed based on a separate framing of <deal>.

The big change here is that an AI churns out a draft faster, the quality of the document will be as suspect as a non AI created human first draft .. untrusted.

ofrzeta•5mo ago
Untrusted ... but does it have any value at all when you can't be sure that a lot of it is hallucinated? After all, LLMs are not very good with numbers.
defrost•5mo ago
You're correct that I can't be sure as I don't work at KPMG and haven't had any contact with their piles of documents, existing practices, or TaxBot summaries.

What I do know as a fact is that KPMG are self reporting satisfaction with their in house work on putting such a thing together.

The 'proof' will be the next five years of application to corporate clients.

> After all, LLMs are not very good with numbers.

The assumption, always, should be that neither are interns.

Hence why draft summaries should be reviewed and sanity checked by senior experienced people.

I would assume (based on my prior work summarizing large volumes of data for mineral and energy resources domain) that any report produced would have references back to source documents and pages making the task of cross checking the product simple and relatively straightforward.

Neywiny•5mo ago
I think the concern is more than what it gathered, I think there's a lot of skepticism over it missing something. The same way so many AI tools just ignore commands, imagine it just ignoring a few sentences. Maybe like:

> We'll sell you our company for $100. But, you have to do a hand-stand and spin around 5 times.

If the AI only puts the first sentence in the summary, you could see how it'd be a bad day for the client. Any human would go "huh that's weird, I'll make sure that's noted in the summary" but in my experience, AIs just don't have that feeling.

defrost•5mo ago
What's being ignored, it seems, is this is explicitly an in-house tool for a first draft summary to be reviewed by an in-house accountant prior to a final presentation to a client.

> imagine it just ignoring a few sentences.

Sure. Just like the risk every such human intern | associate | junior prepared similar draft report already carries today and in the past.

One would hope that as a company at risk of litigation and carrying the can for bad advice that an AI reduced draft such as this would be proof read by a senior expert in house who would trace back every "We'll sell you our company for $100." to the _original_ context via an embedded hyperlink in the draft.

It's certainly the way in which things were done when generating summaries of tens of thousands of documents for mineral and energy clients looking to invest at least $50 million in advancing projects for return.

Neywiny•5mo ago
You've missed my point. I don't think any human who has a job at a law firm would ignore a sentence like that. I think any AI I've used has ignored explicit instructions of moderate severity. I'm not worried it'll hallucinate things into existence, I'm worried it'll ignore them out. Can't summarize without throwing away words. I don't trust it to choose the right ones.
defrost•5mo ago
And you've missed mine.

I don't think any human at any law firm, medical practice, major resource company, etc. that deals with volumes of documentation in the course of making multi million deals would _trust_ an associate / intern pool or an AI to create a perfect product that can be passed directly to a client without any form of checking and verification.

It's a _given_ that there will be shortfalls and errors and the procedures need to be sufficient to embrace an error prone distillation phase and a circle back and verify phase.

At least in my experience to date.

It's clear from the article that KPMG feel much the same way.

SvenL•5mo ago
I wonder the same. I mean, if it is produced in 1 day but I need 2 weeks to verify it, I don’t gain much. Sure I can ask it to quote and link the sources, but still. I remember this case of the Machine Learning book from Springer press where the author used a LLM and it was only revealed when someone tried to look up the quoted sources - they didn’t exist, they were made up.
yobbo•5mo ago
It might also be their relative confidence in peope vs LLMs for this sort of task. People could be worse when the task itself is trivial but the volume is intangible for a single human.
immibis•5mo ago
The secret is that nobody both reads the report and wants it to be factual.
bashtoni•5mo ago
If it really is a single 100 page prompt then it will be even less reliable than a KPMG audit.

(See https://www.theguardian.com/business/2023/oct/12/kpmg-fined-... or https://pcaobus.org/news-events/news-releases/news-release-d... or https://www.sec.gov/newsroom/press-releases/2017-142 or any of a myriad of other cases)

roxolotl•5mo ago
> Munnelly said KPMG built the agent by writing a 100-page prompt it fed into Workbench. The Register asked for details of the prompt and Munnelly said a substantial team worked on it for months, and the resulting agent asks for four or five inputs before it starts working on tax advice, then asks a human for direction before generating a document.

> Only tax agents can use the tool, because its output is not suitable for people without deep tax expertise.

Ok cool so they write a giant piece of software to assist in highly specialized tasks. Would love to know what the LLM adds. Maybe just parsing?

UK-Al05•5mo ago
"We produced a tool that produces a wonky result, so the results need to be examined in detail anyway."
UltraSane•5mo ago
KPMG can't actually perform accurate audits .