frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Yapit – PDF and webpage reader with TTS that doesn't suck

https://github.com/yapit-tts/yapit
4•MaxWolf-01•5h ago
Yapit converts PDFs and web pages to audio, with a vision-LLM pipeline that handles math and complex layout instead of garbling them. I built it because I read a lot of papers and content online, but drift off after two paragraphs. Listening while following along keeps me focused and lowers the bar to actually start.

Every TTS tool I tried broke on complex formatting. Papers with math, citations, figure references, page numbers in the middle of sentences. You either get garbled output or you're listening to raw LaTeX.

Yapit converts everything to markdown as a common format. For web pages, defuddle (https://github.com/kepano/defuddle) handles the extraction and strips clutter from web pages, presenting the main article content in a clean, consistent format. For PDFs, a vision LLM rewrites each page into markdown with annotation tags that separate what you see from what gets read aloud. Math is rendered visually but gets spoken alt text. Citations like "[13]" or "(Schmidhuber, 1970)" are silently displayed. Page numbers and headers are removed entirely.

Both extraction and audio are cached by content hash, so the same content is never processed or synthesized twice.

Self-hosting works with any OpenAI-compatible TTS server (vLLM-Omni, ...) and any OpenAI-compatible vision model for PDF extraction:

  git clone --depth 1 https://github.com/yapit-tts/yapit.git && cd yapit
  cp .env.selfhost.example .env.selfhost
  make self-host
Kokoro TTS also runs in the browser via WebGPU on desktop.

Try it on Attention Is All You Need (all voices cached, no account needed): https://yapit.md/listen/3bde213b-3a5a-465f-9198-be65430b699e...

Or paste any URL: https://yapit.md/https://arxiv.org/abs/1810.04805 https://yapit.md/https://x.com/karpathy/status/2039805659525...

GitHub: https://github.com/yapit-tts/yapit (AGPL-3)

Show HN: I Built Paul Graham's Intellectual Captcha Idea

https://mentwire.com/sample
21•nowflux•52m ago•14 comments

Show HN: I built a tiny LLM to demystify how language models work

https://github.com/arman-bd/guppylm
763•armanified•17h ago•114 comments

Show HN: Real-time AI (audio/video in, voice out) on an M3 Pro with Gemma E2B

https://github.com/fikrikarim/parlor
216•karimf•23h ago•24 comments

Show HN: GovAuctions lets you browse government auctions at once

https://www.govauctions.app/
3•player_piano•1h ago•2 comments

Show HN: Gemma Gem – AI model embedded in a browser – no API keys, no cloud

https://github.com/kessler/gemma-gem
134•ikessler•17h ago•18 comments

Show HN: Weird Clocks

https://clocks.specr.net
4•vunderba•1h ago•0 comments

Show HN: I made a YouTube search form with advanced filters

https://playlists.at/youtube/search/
293•nevernothing•17h ago•184 comments

Show HN: ReverseCam – See yourself as others see you

https://www.reversecam.com
4•ilamparithi•1h ago•0 comments

Show HN: Tiny TUI for disk usage exploration

https://github.com/dhbradshaw/syz
3•dhbradshaw•1h ago•1 comments

Show HN: MCP 2000 – Browser-based drum machine with AI-generated sounds

https://www.mcp2000.com/
3•harmoni-pet•1h ago•0 comments

Show HN: Modo – I built an open-source alternative to Kiro, Cursor, and Windsurf

https://github.com/mohshomis/modo
91•mohshomis•17h ago•18 comments

Show HN: I replaced Google Analytics with my own tool – no cookies, <1KB script

https://datakool.com/
8•VictorChanet•6h ago•4 comments

Show HN: I just built a MCP Server that connects Claude to all your wearables

https://pacetraining.co/
9•anton_salcher•8h ago•8 comments

Show HN: A game where you build a GPU

https://jaso1024.com/mvidia/
937•Jaso1024•2d ago•181 comments

Show HN: I built a 2-min quiz that shows you how bad you are at estimating

https://www.convexly.app/
6•convexly•4h ago•1 comments

Show HN: Ec – terminal native 3-way Git mergetool

https://github.com/chojs23/ec
8•neozz•4h ago•0 comments

Show HN: I made a crossword app for language learners

https://cranki.app/
7•petargyurov•4h ago•0 comments

Show HN: Yapit – PDF and webpage reader with TTS that doesn't suck

https://github.com/yapit-tts/yapit
4•MaxWolf-01•5h ago•0 comments

Show HN: OsintRadar – Curated directory for osint tools

https://osintradar.com/
77•lexalizer•1d ago•9 comments

Show HN: M. C. Escher spiral in WebGL inspired by 3Blue1Brown

https://static.laszlokorte.de/escher/
169•laszlokorte•1d ago•28 comments

Show HN: Contrapunk – Real-time counterpoint harmony from guitar input

https://contrapunk.com/
116•waveywaves•1d ago•54 comments

Show HN: I built a small app for FSI German Course

https://detawk.com/
53•syedmsawaid•4d ago•15 comments

Show HN: I developed a node editor framework using gpui

https://github.com/tu6ge/ferrum-flow
4•tu6ge•9h ago•4 comments

Show HN: Multi-agent coding assistant with a sandboxed Rust execution engine

https://github.com/christianmeurer/Lula
9•chrismeurer•14h ago•2 comments

Show HN: I built a frontpage for personal blogs

https://text.blogosphere.app/
772•ramkarthikk•3d ago•194 comments

Show HN: Apfel – The free AI already on your Mac

https://apfel.franzai.com
731•franze•3d ago•152 comments

Show HN: Aiaiai.guide: Plain-English mental model for LLM apps, tools and agents

https://aiaiai.guide/
4•myyke•11h ago•0 comments

Show HN: sllm – Split a GPU node with other developers, unlimited tokens

https://sllm.cloud
183•jrandolf•2d ago•89 comments

Show HN: I made open source, zero power PCB hackathon badges

https://github.com/KaiPereira/Overglade-Badges
157•kaipereira•2d ago•18 comments

Show HN: Mdarena – Benchmark your Claude.md against your own PRs

https://github.com/HudsonGri/mdarena
22•hudsongr•17h ago•4 comments