frontpage.

Hey guys,

I’m open sourcing an autonomous financial research and analysis agent. The agent can search SEC filings, extract key financials, and build financial models.

It’s scored 80% on the public Finance Agent validation set with GPT-5, compared to the top result of 55% listed on their website for their private validation set.

But, their public validation set has mistakes. There are quite a few cases where the “ground truth” answers in the benchmark are wrong. I’ve documented each case with citations directly to SEC Edgar here (https://github.com/lucasastorian/intellifin-agent/blob/main/...)

Accuracy with GPT-5 jumps to 92% once we fix those mistakes in the eval.

You can clone the repo and rerun the benchmark yourself.

Next step should be turning this into an open-source “Cursor for Finance,” with a proper UI for equities research and financial modeling.

Feedback & questions are welcome.

https://github.com/lucasastorian/intellifin-agent

Five disciplines discovered the same math independently – none of them knew

We Scanned an AI Assistant for Security Issues: 12,465 Vulnerabilities

Amazon no longer defend cloud customers against video patent infringement claims

Show HN: Medinilla – an OCPP compliant .NET back end (partially done)

How Does AI Distribute the Pie? Large Language Models and the Ultimatum Game

Resistance Infrastructure

Fire-juggling unicyclist caught performing on crossing

Restoring a lost 1981 Unix roguelike (protoHack) and preserving Hack 1.0.3

GPS and Time Dilation – Special and General Relativity

Show HN: Witnessd – Prove human authorship via hardware-bound jitter seals

Show HN: I built a clawdbot that texts like your crush

Scientists reverse Alzheimer's in mice and restore memory (2025)

Compiling Prolog to Forth [pdf]

Show HN: Cymatica – an experimental, meditative audiovisual app

GitBlack: Tracing America's Foundation

Horizon-LM: A RAM-Centric Architecture for LLM Training

We just ordered shawarma and fries from Cursor [video]

Correctio

Trying to make an Automated Ecologist: A first pass through the Biotime dataset

Watch Ukraine's Minigun-Firing, Drone-Hunting Turboprop in Action

Free Trial: AI Interviewer

FDA intends to take action against non-FDA-approved GLP-1 drugs

Supernote e-ink devices for writing like paper

We are QA Engineers now

Show HN: Measuring how AI agent teams improve issue resolution on SWE-Verified

Adversarial Reasoning: Multiagent World Models for Closing the Simulation Gap

Show HN: Poddley.com – Follow people, not podcasts

Layoffs Surge 118% in January – The Highest Since 2009

Papyrus 114: Homer's Iliad

DicePit – Real-time multiplayer Knucklebones in the browser

Show HN: Open-Source Finance Agent