frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

IRS open sources its fact graph

https://github.com/IRS-Public/fact-graph
174•ronbenton•3h ago

Comments

ok123456•2h ago
Why would I want to use this over Prolog/Datalog?
NoahZuniga•2h ago
Because prolog/datalog don't offer a list of questions that you can ask based on context to calculate someone's US taxes.
ok123456•2h ago
That's the database you consult(). Doing income taxes is well-suited to traditional logic programming.
akerl_•2h ago
This is a bit like asking "why would I use my car's schematics instead of a wrench".

This is the rules engine's details. You could use it to build the logic and traversal in whatever language you like.

tallowen•2h ago
It's nice to see an open sourced implementation of the US tax code! This was part of the IRS Direct File codebase that allowed people to file their taxes for free, directly with the IRS. It was canceled earlier this year by the Trump administration. It looks like the Fact Graph was already opensourced a couple months ago and that version of the factgraph lives here: https://github.com/IRS-Public/direct-file/tree/main/direct-f...

I'm curious why a second repository was created for this.

ronbenton•2h ago
I wonder too. Perhaps the intent is for it to be standalone for general usage and not just as a part of the direct file project?
Twisol•2h ago
Seems so, according to this file: https://github.com/IRS-Public/fact-graph/blob/main/docs/from...

> The main changes are: [...] converting the fact-graph to a standalone library [...]

infotainment•2h ago
I'm still disappointed that they got rid of Direct File, such a promising start...
ronbenton•2h ago
Big W for the tax lobby, big L for the rest of us
beej71•2h ago
I'm far beyond disappointed for that. I'm fucking pissed. Such stupid politicking that makes all of our lives shittier.
thegreatpeter•1h ago
https://directfile.irs.gov ??
CamperBob2•1h ago
Did you try actually using it?
mensetmanusman•2h ago
Build it and release for free.
astrange•2h ago
It's still there. They like saying things and not doing them.

https://directfile.irs.gov

So it's always possible they'll just forget to shut it off.

shrinks99•25m ago
Having talked at length with one of the developers of Direct File at a conference who was fired along with many of the other folks that worked on Direct File, I can assure you that it's no longer being worked on.

The 2024 site remains up so people can file their taxes for that year, but it will no longer be updated.

aliljet•2h ago
I wonder how this can be used with an LLM to provide interesting tax advice? I'd love to regularly ask questions of the tax code...
ronbenton•2h ago
Makes me wonder if someone has already trained a model on the tax code. Would be interesting for sure.
astrange•2h ago
Model training data already contains all the text there is[0], so they can already answer questions like this (especially with web search), but they aren't good at tax calculations.

https://arxiv.org/abs/2507.16126v1

[0] but it's quite possible the conversion from HTML to text is bad

kevin_thibedeau•1h ago
The problem is that the text of US tax code isn't enough to know the correct action to take. The IRS has semi-formal policies based on how it has chosen to interpret the statutes. There are areas of gray that they don't clearly specify. Some of this is in supplementary publications but it still has subjective elements. One example is that settlements for "serious injuries" are regarded as non-taxable income. What constitutes serious is a squishy concept.
cco•1h ago
Yeah you'd have to pull in a lot of case law and perform a lot of fine tuning on expert tax advice (you'd probably have to create this training data).

Would be neat (and still legally fraught!).

TZubiri•1h ago
You can technically use the language model as a data model. That was the quick hack that started it all, autocomplete on a question produces the answer, yes.

However it's clear that we are moving towards separating the data and the language model. Even base chatgpt is given Search Tools and python Tools instead of producing them by text, the tool call itself may be generated by the model though.

You can for sure use a pure LLM to ask it questions about tax code, but we'll probably see specific tools that only contain canon law and kosher case law, and sources it properly. Y'know instead of halucinating

ryandrake•2h ago
I guess as long as it's for entertainment purposes only. I'm going to file "actually following tax/legal advice from a potentially hallucinating LLM" under NOPE.
hahahacorn•1h ago
The super obvious workflow is to query for an idea in natural English and then verify or ask the LLM to provide the paths it was following.

It begs the question why you assume the parent comment was going to blindly follow the LLMs output.

Jach•1h ago
patio11's already saved over $2k apparently, maybe he'll do a more formal write-up at some point. (A couple threads here https://x.com/patio11/status/1977425626584711668 and here https://x.com/patio11/status/1978168404793037087 )
koolba•52m ago
Any idea what the actual deduction it supposedly found for private school?

You can pay for K-12 with 529 or Coverdell ESA funds. But neither allows deductions for contributions. Only growth in either is tax free (assuming it’s spent on education expenses).

jauntywundrkind•2h ago
This was such a fun neat part of the Direct File code drop 5 months ago. https://news.ycombinator.com/item?id=44131901

In particular there's a pretty nice inline tutorial that's still there in that release: https://github.com/IRS-Public/direct-file/blob/main/direct-f...

bickfordb•2h ago
Surprised to learn we still have an IRS
alberth•1h ago
> As a work of the United States Government, this project is in the public domain within the United States.

What does it mean for the license to say "within the US"?

Does this mean this software cannot be used outside the US?

ronbenton•1h ago
Good question. Copyright laws are country-specific, right? So perhaps it is just trying to be clear that there is no license being asserted outside of the US.
dragonwriter•1h ago
Licenses are offered or granted (they are permissions from the copyright holder), not asserted.
dragonwriter•1h ago
> What does it mean for the license to say "within the US"?

It means exactly what it says; you have to read the whole thing (or at least the two sentences before the CC 1.0 Universal text, which is the operative mechanism by which the second sentence is effected), not a fraction of the first sentence.

> Does this mean this software cannot be used outside the US?

No. The license explains two things:

(1) Without any license, this is automatically public domain in the US because it is a federal government work.

(2) The federal government (as the owner of the copyright at creation outside the United States, at least anywhere that applies the common rules underlying the Berne Convention) waives copyright worldwide, and does so via the CC 1.0 Universal declaration (the text of which is then included.)

So, it is, to the extent that this is legally possible, copyright-free globally.

jandrewrogers•1h ago
Some countries don't recognize the concept of Public Domain works. In the US, many government works are Public Domain as a matter of law. This creates complications internationally in those countries that don't recognize the legitimacy of Public Domain as a legal concept. Nonetheless, the US still wants to make it available internationally.

To satisfy these conflicting requirements, the US government places it in the Public Domain in the US to satisfy US law. Additionally, they make it available internationally under a license that approximates the intent of Public Domain while still being recognized as a legally valid thing.

vineyardmike•1h ago
Am I being dumb or does this not actually contain the facts about the tax code? Is the /demo/all-facts file supposed to be the “real” facts? Are the XML fact files provided in another location?

It’s pretty cool to see the way that the IRS handles defining and maintaining its tax calculations, but also a machine-readable tax code seems cool too.

ronbenton•1h ago
I believe the actual IRS tax code implementation is in a separate repo here: https://github.com/IRS-Public/direct-file while the originally linked repo is the fact graph tooling decoupled from the tax implementation.
tyingq•1h ago
Look like many of them are specifically the xml files here:

https://github.com/IRS-Public/direct-file/tree/e0d5c84451cc5...

ronbenton•1h ago
I was just reading through those! A bit dizzying
kumavis•1h ago
specifically here https://github.com/IRS-Public/direct-file/tree/main/direct-f...
MangoToupe•1h ago
As far as I am aware, fact just means shared assumption. This seems entirely reasonable for a tax code.
rvitorper•1h ago
Scala mentioned
hedayet•1h ago
I’ve had frustrating experiences with TurboTax due to its overly complex interface, aggressive data collection under the guise of saving money (which it doesn’t deliver), and a convoluted pricing structure that rivals the IRS’s own complexity.

I hope this initiative is good enough to enable domain experts and good people to build transparent, user-friendly alternatives to challenge TurboTax’s market grip.

Has anyone encountered promising tools or approaches that tackle these pain points?

willis936•1h ago
DirectFile was quite good for the one year I was able to use it and addressed your concerns. Don't worry, that's since been taken care of.

https://apnews.com/article/irs-direct-file-tax-returns-free-...

j_bum•1h ago
Just a heads up, your URL 404’s
willis936•1h ago
Thanks. Fixed. I stripped what I thought was a tracker without testing.
babelfish•40m ago
FreeTaxUSA is legitimately fantastic!
somehnguy•37m ago
TurboTax’s advertising is borderline fraudulent in my opinion.

Freetaxusa.com (no affiliation) is just as good and legitimately free.

Spooky23•34m ago
The H&R Block software is better imo.
hk1337•1h ago
My eyes read Scala but my brain was thinking Clojure, so I was a bit confused on why there weren’t any parentheses for the first couple of seconds looking at the source.

Apple M5 chip

https://www.apple.com/newsroom/2025/10/apple-unleashes-m5-the-next-big-leap-in-ai-performance-for...
980•mihau•13h ago•1064 comments

Claude Haiku 4.5

https://www.anthropic.com/news/claude-haiku-4-5
470•adocomplete•9h ago•185 comments

I'm recomming my customers switch to Linux rather that Upgrade to Windows 11

https://www.scottrlarson.com/publications/publication-windows-move-towards-surveillance/
156•trinsic2•1h ago•81 comments

Next Steps for the Caddy Project Maintainership

https://caddy.community/t/next-steps-for-the-caddy-project-maintainership/33076
121•francislavoie•5h ago•43 comments

Build a Superscalar 8-Bit CPU (YouTube Playlist) [video]

https://www.youtube.com/watch?v=bwjMLyBU4RU&list=PLyR4neQXqQo5nPdEiMbaEJxWiy_UuyNN4&index=1
11•lrsjng•5d ago•0 comments

Writing an LLM from scratch, part 22 – training our LLM

https://www.gilesthomas.com/2025/10/llm-from-scratch-22-finally-training-our-llm
87•gpjt•3h ago•1 comments

I almost got hacked by a 'job interview'

https://blog.daviddodda.com/how-i-almost-got-hacked-by-a-job-interview
766•DavidDodda•13h ago•398 comments

IRS open sources its fact graph

https://github.com/IRS-Public/fact-graph
174•ronbenton•3h ago•47 comments

Zed is now available on Windows

https://zed.dev/blog/zed-for-windows-is-here
210•meetpateltech•10h ago•71 comments

ImapGoose

https://whynothugo.nl/journal/2025/10/15/introducing-imapgoose/
36•xarvatium•4h ago•8 comments

Are hard drives getting better?

https://www.backblaze.com/blog/are-hard-drives-getting-better-lets-revisit-the-bathtub-curve/
137•HieronymusBosch•9h ago•61 comments

Bringing NumPy's type-completeness score to nearly 90%

https://pyrefly.org/blog/numpy-type-completeness/
53•todsacerdoti•1w ago•22 comments

Gerald Sussman - An Electrical Engineering View of a Mechanical Watch (2003)

https://techtv.mit.edu/videos/15895-an-electrical-engineering-view-of-a-mechanical-watch
52•o4c•1w ago•11 comments

Show HN: Halloy – Modern IRC client

https://github.com/squidowl/halloy
288•culinary-robot•15h ago•79 comments

Pwning the Nix ecosystem

https://ptrpa.ws/nixpkgs-actions-abuse
246•SuperShibe•13h ago•42 comments

Leaving serverless led to performance improvement and a simplified architecture

https://www.unkey.com/blog/serverless-exit
306•vednig•15h ago•189 comments

F5 says hackers stole undisclosed BIG-IP flaws, source code

https://www.bleepingcomputer.com/news/security/f5-says-hackers-stole-undisclosed-big-ip-flaws-sou...
144•WalterSobchak•13h ago•65 comments

A Gemma model helped discover a new potential cancer therapy pathway

https://blog.google/technology/ai/google-gemma-ai-cancer-therapy-discovery/
54•alexcos•7h ago•11 comments

Recursive Language Models (RLMs)

https://alexzhang13.github.io/blog/2025/rlm/
79•talhof8•9h ago•24 comments

A kernel stack use-after-free: Exploiting Nvidia's GPU Linux drivers

https://blog.quarkslab.com/./nvidia_gpu_kernel_vmalloc_exploit.html
140•mustache_kimono•12h ago•16 comments

Recreating the Canon Cat document interface

https://lab.alexanderobenauer.com/updates/the-jasper-report
93•tonyg•12h ago•7 comments

Princeton Engineering Anomalies Research

https://pearlab.icrl.org/
36•walterbell•1w ago•6 comments

Garbage collection for Rust: The finalizer frontier

https://soft-dev.org/pubs/html/hughes_tratt__garbage_collection_for_rust_the_finalizer_frontier/
116•ltratt•14h ago•117 comments

The brain navigates new spaces by 'darting' between reality and mental maps

https://medicine.yale.edu/news-article/brain-navigates-new-spaces-by-flickering-between-reality-a...
144•XzetaU8•1w ago•60 comments

More About Jumps Than You Wanted to Know

https://gpfault.net/posts/asm-tut-4.html
8•nice_byte•6d ago•0 comments

FSF announces Librephone project

https://www.fsf.org/news/librephone-project
1395•g-b-r•1d ago•567 comments

Americans' love of billiards paved the way for synthetic plastics

https://invention.si.edu/invention-stories/imitation-ivory-and-power-play
65•geox•1w ago•34 comments

M5 MacBook Pro

https://www.apple.com/macbook-pro/
322•tambourine_man•13h ago•441 comments

How First Wap tracks phones around the world

https://www.lighthousereports.com/methodology/surveillance-secrets-explainer/
68•mattboulos•4h ago•11 comments

Reverse engineering iWork

https://andrews.substack.com/p/reverse-engineering-iwork
76•andrew_rfc•15h ago•3 comments