Show HN: PageEcho – Offline AI eBook Reader (On-Device TTS and AI)

https://apps.apple.com/us/app/pageecho/id6755965837

1•page_echo•1mo ago

Hi HN! I built PageEcho, a fully on-device eBook reader for iOS that integrates offline TTS and Apple Intelligence features. It started as a personal challenge to create a reading app that never sends data to a server while still supporting modern AI capabilities.

What’s unique: • Everything — TTS, summaries, Q&A, mind-maps, translation — runs locally on the device • Supports EPUB, PDF, MOBI, AZW3, TXT, and FB2 with a single unified reading pipeline • Uses Supertonic ONNX for high-quality offline speech (no cloud, no latency) • Integrates Apple Intelligence for chapter-level analysis on supported devices • Local SQLite storage for highlights, progress, and reading analytics

Technical notes for those curious: • EPUB/TXT/MOBI parsing is consolidated into a WebView-based renderer with CFI support • PDF mode uses native PDFKit overlays with TTS + translation layers • TTS runs on-device using a quantized Supertonic model with chunked streaming • AI summaries/Q&A rely on local Apple Intelligence calls with caching at the chapter level • The entire app operates offline — no accounts, no telemetry, no external servers

I’d love feedback from readers, mobile devs, and anyone interested in on-device AI design. Happy to answer any technical questions!

Comments

page_echo•1mo ago

Thanks for taking a look. I wanted to share some of the background and trade-offs that shaped the project, since the constraint of “everything must run locally” influenced almost every decision.

The biggest challenge was balancing feature completeness with device limitations. Running TTS and AI fully on-device meant carefully managing memory spikes, chunking long texts, avoiding UI stalls, and working within the restrictions of Apple Intelligence availability. Getting TTS to feel continuous required experimentation with segmentation, buffering, and timing, especially for long-form documents.

Another interesting challenge was unifying such different file formats into a consistent reading experience. Rather than building multiple rendering paths, I ended up normalizing most formats into HTML and relying on CFI anchors and a WebView-based system. This reduced code surface but introduced its own edge cases, especially around selection accuracy and highlight persistence.

PDF brought its own set of problems — mainly keeping performance stable on large files and making sure overlays (search, highlights, TTS, translation) stayed synchronized with page geometry. I’m still improving this area.

I’m very interested in learning how others approach on-device inference, streaming models, memory usage patterns, and PDF performance. If anyone has experience or ideas in those areas, I’d love to hear them.

Happy to answer questions about any part of the implementation.

Anthropic: Latest Claude model finds more than 500 vulnerabilities

Brooklyn cemetery plans human composting option, stirring interest and debate

Why the 'Strivers' Are Right

Brain Dumps as a Literary Form

Agentic Coding and the Problem of Oracles

Malicious packages for dYdX cryptocurrency exchange empties user wallets

Show HN: I built a <400ms latency voice agent that runs on a 4gb vram GTX 1650"

Penisgate erupts at Olympics; scandal exposes risks of bulking your bulge

Arcan Explained: A browser for different webs

What did we learn from the AI Village in 2025?

An open replacement for the IBM 3174 Establishment Controller

The P in PGP isn't for pain: encrypting emails in the browser

Show HN: Mirror Parliament where users vote on top of politicians and draft laws

Ask HN: Opus 4.6 ignoring instructions, how to use 4.5 in Claude Code instead?

We Mourn Our Craft

Jim Fan calls pixels the ultimate motor controller

Exploring a Modern SMTPE 2110 Broadcast Truck with My Dad

AI UX Playground: Real-world examples of AI interaction design

The Field Guide to Design Futures

The Other Leverage in Software and AI

AUR malware scanner written in Rust

Free FFmpeg API [video]

Are AI agents ready for the workplace? A new benchmark raises doubts

Show HN: AI Watermark and Stego Scanner

Clarity vs. complexity: the invisible work of subtraction

Solid-State Freezer Needs No Refrigerants

Ask HN: Will LLMs/AI Decrease Human Intelligence and Make Expertise a Commodity?

From Zero to Hero: A Brief Introduction to Spring Boot

NSA detected phone call between foreign intelligence and person close to Trump

How to Fake a Robotics Result