frontpage.

Getting SEC filings faster by predicting future filing url

4•jgfriedman1999•1mo ago

It turns out that you can get SEC filings seconds to minutes faster by using URL prediction. This is because the SEC exposes the filing to the internet, before updating feeds like the RSS and PDS.

How it works: 1. The SEC accepts a filing, this is recorded as e.g. <ACCEPTANCE-DATETIME>20220204201127 2. The SEC then generates an index page for the filing, with filing metadata. This is publicly accessible. Typically the Last Modified Tag is the same as acceptance datetime. 3. The SEC then releases the filing's original sgml upload, and extracted documents. This is publicly accessibly. e.g. 10-K. 4. The SEC then updates RSS and PDS.

URL format A typical index page is expressed publicly as: https://www.sec.gov/Archives/edgar/data/1318605/000095017022000796/0000950170-22-000796-index.html

It turns out that you don't need the cik {1318605} for the url. https://www.sec.gov/Archives/edgar/data/95017022000796/0000950170-22-000796-index.html

This means that you can predict the index page using just the accession number. An accession number has format: {cik of entity submitting the filing NOT necessarily the actual company}-{2d year}-{typically sequential count of submissions that year}

So all you have to do is take the last accession, increment the count, and poll!

Once you match an index page, you can extract cik from that page, and construct the url for the filing information and poll that. https://www.sec.gov/Archives/edgar/data/1318605/0000950170-22-000796.txt

What's great about this approach is that a few entities file on behalf of most companies and individuals. If you only monitor ten entity accessions, you monitor 42% of the corpus, 100 and you get 68%. Numbers taken from 2024.

GitHub Link https://github.com/john-friedman/The-fastest-way-to-get-SEC-filings

This should be much faster than the papers which sparked government investigations! https://www.wsj.com/articles/sec-plans-to-fix-flaw-in-electronic-distribution-system-1419621428?gaa_at=eafs&gaa_n=AWEtsqd6-X8ylp_BlpWHYpFoJqrLMDwYUu3m1QBJhoRtCHDIHraLrD3tMHPXaw57JW4%3D&gaa_ts=693e2fd3&gaa_sig=noGkpoMh6OXa0MqFPgj5kFe9kx7vbkSpB1vFceqW8LtXzD2wWC2vkGLKJwnvkJO-sq7q93qKbX_rs7ULReZIwA%3D%3D

Show HN: Poddley.com – Follow people, not podcasts

Layoffs Surge 118% in January – The Highest Since 2009

Papyrus 114: Homer's Iliad

DicePit – Real-time multiplayer Knucklebones in the browser

Turn-Based Structural Triggers: Prompt-Free Backdoors in Multi-Turn LLMs

Show HN: AI Agent Tool That Keeps You in the Loop

Why Every R Package Wrapping External Tools Needs a Sitrep() Function

Achieving Ultra-Fast AI Chat Widgets

Show HN: Runtime Fence – Kill switch for AI agents

Researchers surprised by the brain benefits of cannabis usage in adults over 40

Peter Thiel warns the Antichrist, apocalypse linked to the 'end of modernity'

USS Preble Used Helios Laser to Zap Four Drones in Expanding Testing

Show HN: Animated beach scene, made with CSS

An update on unredacting select Epstein files – DBC12.pdf liberated

Was going to share my work

Pitchfork: A devilishly good process manager for developers

You Are Here

Why social apps need to become proactive, not reactive

How patient are AI scrapers, anyway? – Random Thoughts

Vouch: A contributor trust management system

I built a terminal monitoring app and custom firmware for a clock with Claude

Tiny C Compiler

Y Combinator Founder Organizes 'March for Billionaires'

Ask HN: Need feedback on the idea I'm working on

OpenClaw Addresses Security Risks

Apple finalizes Gemini / Siri deal

Italy Railways Sabotaged

Emacs-tramp-RPC: high-performance TRAMP back end using MsgPack-RPC

Nintendo Wii Themed Portfolio

"There must be something like the opposite of suicide "