Show HN: Turn Newsletters into Interactive GPTs

https://www.bookshelf.diy/

5•raunaqvaisoha•7h ago

I’ve been hacking on a project called Bookshelf (https://www.bookshelf.diy/). It lets you take an archive — say, your Substack export, a bunch of PDFs, or even saved HTML files — and turn that into a retrieval-backed GPT that your readers can query.

The idea is: instead of scrolling archives, they just ask questions. Answers are pulled only from your original content, with citations.

It’s aimed at writers and researchers who want their work to be more discoverable — but without spinning up vector infra or fiddling with RAG pipelines.

For context: I’ve always gone back to Paul Graham’s essays for startup advice. But there’s no good way to search them semantically or contextually. So I tried indexing a few with Bookshelf.

Asked: “How does PG think about evaluating founders?” and got a clean answer sourced from Do Things That Don’t Scale and a couple other essays — citations included. It was surprisingly useful.

So far, one early test case is AnthropoceneGPT (https://sammatey.substack.com/p/introducing-anthropocenegpt) for Sam Matey’s newsletter. It’s seen ~100+ queries. Readers say it works like a smart librarian. He says it gives him ideas for what to write next.

Rough implementation: Input: HTML/PDF exports Chunks + embeds via OpenAI (or local) Stored in a vector DB Retrieval API is called by the custom GPT GPT is instructed to only use retrieved chunks and cite them Auth Option: for tracking on queries to give writers some telemetry

Here’s a demo GPT trained on Paul Graham’s archive: Paul Graham GPT (https://tinyurl.com/paul-graham-gpt)

Would love thoughts on: What would make this better for writers or readers? Any UX nits on the GPT side? Has anyone tried doing something similar in-house?

Comments

korgy•3h ago

This is pretty clever. I can definitely see the appeal for writers with big archives that readers don’t have time to sift through. I’m wondering though — does it handle more conversational queries well, or is it better for straightforward factual lookups?

sahilkat•2h ago

It actually works well for conversational queries too. As long as the topic has been covered in the newsletters, it can handle both casual and direct questions. The responses are designed to reflect the author's own style, but it always sticks to what’s in the newsletters—so to avoid hallucination.

sunny9911•42m ago

This is really cool! It let me upload my documents and create a custom GPT. Now, anyone I share the link with can ask questions and get answers based only on what I’ve uploaded.

It’s like having a private assistant that only knows what I’ve written. Setup took some to and fro between ChatGPT and Bookshelf. I also love how it gives citations from the document so I can double check. Till now, it has not hallucinated. Great job bookshelf team.

California achieved significant groundwater recharge last year

Show HN: Is there a way to market BL1NG – where people pay to flex?

What Trump's Big Beautiful Bill means for Wi-Fi 6E and 7 users: It's not pretty

I made a TikTok video downloader website with no ads.. yet

Bezos-funded climate satellite is lost in space

AI Agents ≠ Zapier–A Better Mental Model

Building Proactive AI Agents

Inertia.js in Rails: a new era of effortless integration (2024)

Show HN: DBUF

Tsukudani and hot rice: Still a go-to meal in Japan centuries after its creation

Building a timberframe home from scratch

Robot surgery on humans could be trialled within decade after success on pigs

Unpatchable Vulnerabilities in Windows 10/11: Security Report 2025

Show HN: A Nextflow ↔ Python Integration Plugin

TikTok Sans released under the OFL

Managed Postgres Overview

What are your dream companies to work at?

A simple monthly injection allows mice to live 25% longer and free from diseases

Symbolic 'science fair' showcases research cut by Trump team

Scientists 3D print tumors for cancer research

Perplexity just launched Comet, an AI web browser

Ancient pathogen became deadlier when humans started wearing wool

OpenAI to release web browser in challenge to Google Chrome

LangChain is about to become a unicorn, sources say

Finding PBHs Using the LSST Will Be a Statistical Challenge

<Now Go Bang > the REM-Arkable Misadventures of List

brotab: Control your browser's tabs from the command line

Desktop Publishing Tools That Didn't Make It

The Hungry, Hungry AI Model

Show HN: Program for Framework 16 LED Matrix