frontpage.

Show HN: CastReader – Free TTS Extension That Reads Kindle Cloud Reader

https://chromewebstore.google.com/detail/castreader-tts-reader/foammmkhpbeladledijkdljlechlclpb

1•vinxu•1h ago

Every TTS browser extension fails on Kindle Cloud Reader. The reason: Amazon renders text using custom font subsets where glyph IDs don't map to standard Unicode. You select text, copy it, and get garbage. The DOM is useless.

  CastReader solves this by intercepting KindleModuleManager to capture font and token data, decoding glyph mappings from the binary font tables, then running Tesseract.js 
  OCR locally in an offscreen document to calibrate the decoder. The final text comes from glyph decoding (not OCR) so it's accurate enough for word-level highlight sync.  
                                                                                                                                                                            
  WeRead (the largest Chinese reading platform) has a similar problem — it renders everything on canvas. CastReader uses a main-world content script injected at
  document_start to intercept fetch responses containing chapter data before the page consumes them.

  For normal websites, there's a 3-tier extraction pipeline: 15+ site-specific extractors (Notion, Google Docs, ChatGPT, Claude, arXiv, etc.), a learned CSS selector rule
  system, and a universal visible-text-block algorithm that fuses ideas from Readability.js, Boilerpipe, and JusText — container scoring with text density, link density
  scaling, stop-word classification, and progressive retry with flag degradation.

  TTS runs through Kokoro, an open model supporting 40+ languages. Audio plays directly in the content script so highlight sync reads currentTime with zero latency — no
  message passing, no offscreen document relay.

  Limitations I should be honest about: the voice library is small (Kokoro only, no premium neural voices), no mobile support, extraction still fails on some complex
  layouts (there's a manual content selector fallback), and the TTS server is something I run myself, so uptime isn't guaranteed.

  Completely free. No signup, no usage limits, no premium tier. Chrome and Edge.

Shall I implement it? No

Show HN: Firstrun – Turn static documentation into interactive walkthroughs

AI error jails innocent grandmother for months in North Dakota fraud case

Source code of Swedish e-govt services from CGI's "E-plattform" has been leaked

Tiiny Pocket Lab: The First Pocket-Size AI Supercomputer

Social Craft AI-How well connected is your LinkedIn Network?

Repeal the Jones Act of 1920 (2024)

Seeking Victim Information in Steam Malware Investigation

Atlas – Self-improving AI trading agents using Karpathy-style autoresearch

FixMyImage

22 years of Brain Science: CoSyNe tells us about the evolution of Neuroscience

Adobe Announces CEO to Step Down, Gives Lackluster Forecast

Show HN: Every Developer in the World, Ranked

A Plain Anabaptist Story: The Hutterites

Apple MacBook Neo beats every single x86 PC CPU for single-core performance

Beyond the Limit: Introducing Mixedbread Wholembed v3

In Praise of Stupid Questions

Removing Comments from SWE-Bench Improves Agent Performance

How to Blur Sensitive Text in Screenshots with AI and ImageMagick

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

The Context Lake

A Claude Code skill for deliberate skill development during AI-assisted coding

Has vibecoding produced anything of substance, or investibility yet?

Frustrating experience reporting bugs on major companies websites as a developer

A Typed Language for Agent Coordination

$6T in Gulf capital is looking for the exit

Systemd 260-Rc3 Released with AI Agents Documentation Added

Adobe CEO Shantanu Narayen says he will step down

Ask HN: How do you cope with the broken rythm of agentic coding?

CostRouter – Cut AI API costs 60% by routing to the cheapest capable model