frontpage.

Show HN: Thoth – Obsidian AI Research Assistant

https://github.com/acertainKnight/project-thoth

1•acertainKnight•1h ago

I'm an ML scientist who was drowning in papers. Every research tool I tried locked me into their workflow, their sources, their extraction schemas. The real friction wasn't features—it was the interface. Why am I editing files and settings when I could just tell the agent what I want? How do I find new research thats relevant to ME and keep up to date? How do i have my agent actually LEARN about my work and my research paths?

So I built Thoth. Three design decisions that drove everything:

1. HOT-LOADING SKILLS Agents start minimal—no tools loaded. When you ask "find papers on attention mechanisms," the agent loads the paper-discovery skill, which attaches its MCP tools dynamically. When you switch to "analyze this citation network," it loads the citation skill. 10 bundled skills, users can create their own. This keeps context windows small and agents focused.

2. CHAT AS CONFIGURATION Every setting, every source, every extraction schema can be changed through conversation. "Add NeurIPS as a source" just works. "Change the extraction schema to include methodology sections" just works. No config files required (though power users can still edit them).

3. AUTOMATED SOURCE DISCOVERY Give the agent any URL. It uses Playwright + an LLM to auto-detect article elements and creates a working scraper. Zero configuration. If it gets something wrong, tell it in natural language and it iterates. Then tell the agent about a broad research idea you have and let it help you refine that topic, create a new research job, automatically add sources and run recurring updates. You always have the most up to date research for YOUR work not just the hot topic of the day.

Other stuff: - 64 MCP tools across 16 categories - Letta-powered persistent memory (6 memory blocks per agent) - Hybrid RAG: pgvector + BM25 + Reciprocal Rank Fusion + reranking - 6-stage citation resolution (Crossref → OpenAlex → ArXiv → fuzzy matching) - 7 pre-built source plugins (ArXiv, Semantic Scholar, NeurIPS, ICML, etc.) - One-command install: curl | bash - Obsidian plugin for the UI - Fully local / privacy-first

Stack: Python 3.12, FastAPI, Letta, PostgreSQL+pgvector, TypeScript, Docker

https://github.com/acertainKnight/project-thoth

Would love feedback. Defnitiely will have some bugs here and there but its been great for my own workflows.

Show HN: Sheety – An open-source CRM that with Google Sheets as DB

How 'Pong' helped create the multibillion-dollar video game industry

Sabotage Risk Report: Claude Opus 4.6 [pdf]

Modular Acquires BentoML

The First Day of the Cenozoic

Toddlers expect ingroup loyalty to override personal prefs when outgroup present

Early signals that EU AI Act compliance is becoming a sales blocker for AI SaaS

I'm Always in the Club

Solving Automata Cam Profiling with Grasshopper

Everyone Is Oppressed, Especially the Powerful

Hyundai to supply 50k vehicles to Waymo as physical AI move accelerates

Community Living as a Solution to Late Stage Capitalism

Flicker: Upload Videos for Free

Ball Lightning Phenomenon

Chrome Announced WebMCP Integration

FDA refuses to review Moderna's mRNA flu vaccine

Cloudflare forecasts annual sales above estimates as AI drives cloud demand

RAG and Data Boundaries in Multi-Tenant Systems

2026 Is the Year of Serious AI Engineering

The future of coding agents is vertical integration (and why ACP matters)

Built a Python Dependency Audit Tool Because Vulnerability Lists Weren't Enough

Elon Musk's xAI loses second cofounder in 48 hours

Google Chrome 145 Released with JPEG-XL Image Support

Sodium Ion Battery Puncture Test [video]

Scientists reimagine a forgotten battery design from Thomas Edison

Rivian R2: Electric Mid-Size SUV

Show HN: ScanDo – A zero-UI camera that predicts your next action locally

Open Climate Risk

The insane biology of: The saltwater crocodile [video]

Google Chrome: WebMCP is available for early preview