Show HN: I tracked 3,519 stock picks from 23 Substacks – who makes money?

2•lineudemonia•1h ago

I subscribe to 23 paid investment newsletters on Substack (~$9,600/year). I couldn't keep up with reading them all, so I built a system to extract and evaluate every stock pick.

*The pipeline:*

- Crawls articles from Substack - Extracts high-conviction stock picks using Gemini's structured output — filters out casual ticker mentions and only counts calls where the author dedicates real analysis, specific data, or price targets - Tracks returns at 1d, 7d, 15d, 30d, and 60d post-publication using yfinance - Calculates alpha vs sector-specific ETF benchmarks (SOXX for semis, IGV for SaaS, XLF for financials, EWJ for Japan, SPY as fallback) - Deduplication: same author, same ticker within 14 days = one call. Cross-author calls are independent

Total dataset: 3,519 high-conviction calls from 22 authors over 1 year.

*Interesting technical challenges:*

1. AI extraction accuracy. Gemini is surprisingly good at identifying whether an author is making a real call vs. just mentioning a ticker in passing. We tag calls with conviction level (high/low) and direction (bullish/bearish). To validate this, we spot-checked against manual reads and cross-verified with alternative model outputs. Not perfect, but consistent enough to be useful.

2. Custom domain handling. Many Substack authors use custom domains (e.g., collyerbridge.com, lordfed.co.uk) which sometimes trigger Cloudflare challenges. We fall back to headless Playwright when the standard HTTP client gets blocked.

3. Benchmark selection. A naive "did the stock go up?" metric is meaningless in a bull market. We map each ticker to a sector ETF benchmark, so alpha = position return minus benchmark return over the same period. This separates genuine stock-picking skill from just being long in a rising market.

4. Deduplication logic. Authors often revisit the same thesis across multiple articles. Without dedup, a single stock mentioned in 5 articles would count as 5 independent "calls." We use a 14-day window per author per ticker — only the first mention counts.

*Some findings (for context, not the point of this post):*

- Top performer averaged +14.9% at 30d and +26.7% at 60d on long calls - The most expensive newsletters ($1,000+/year) were not the best performers - Authors with fewer, more targeted calls (15-80) tended to outperform those with 300+ calls - 30d vs 60d rankings shift significantly — deep value investors look much better at longer horizons - Short calls were harder for almost everyone

*Stack:* Python, SQLite, Gemini API (structured output), yfinance, Playwright (optional)

I wrote a more detailed breakdown with charts as an X thread: https://x.com/pyhrroll/status/2027374283669066045?s=20

Happy to discuss the methodology, architecture, or share the extraction prompts. The pipeline is ~2,000 lines of Python if there's interest in seeing the code.

Comments

zahlman•54m ago

> I subscribe to 23 paid investment newsletters on Substack (~$9,600/year)

Suppose you had put that money in index funds instead?

lineudemonia•5m ago

nah i'm still trading on my own - so far so good.

Tudumb

Mondrian Entered the Public Domain. The Estate Disagrees

If you drive clock wise along the beach on an island

Software Quality (and Reliability, and Frugality)

Show HN: Alba – Earn and bid on unique software using idle AI credits

Show HN: Treekei – understand project code structure in seconds

Apple Magic TouchstreamLP: Using the Apple Magic Trackpad as Keyboard

NASA shakes up its Artemis program to speed up lunar return

Show HN: Gas Town Control Plane – hosted monitoring for multi‑agent workspaces

An Ode to Houseplant Programming

Toxic combinations: when small signals add up to a security incident

Show HN: ClawDocx – We built a skill and guide library for OpenClaw AI agents

Rivian was saved by software in 2025

Show HN: Offline-First Agent Multiplexer

Media Diet

Traveling neighborhoods: a different kind of group trip (2024)

Colt's Revolutions (2024)

AI RSS feed summarizer that powers feeds.carmo.io

Why Concepts Aren't Objects

Show HN: Badge that shows how well your codebase fits in an LLM's context window

The Simple Essence of Monomorphization [video]

Ente Locker

Most Knowledge Management Systems Fail

Show HN: BotBrowser – MCP server, saves 90% of tokens for web-browsing agents

Snyk Agent Scan: Security scanner for AI agents, MCP servers and agent skills

Government Agencies Raise Alarms About Use of Elon Musk's Grok Chatbot

Show HN: Drag/drop your site's user navigation map, export to code in seconds

Mimestream: Made for Mac. Optimized for Gmail

Podcasts Lead Am/FM in Spoken-Word Listening, Marking a First

The (Searchable) Whole Earth