frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: PageEcho – Offline AI eBook Reader (On-Device TTS and AI)

https://apps.apple.com/us/app/pageecho/id6755965837
1•page_echo•1mo ago
Hi HN! I built PageEcho, a fully on-device eBook reader for iOS that integrates offline TTS and Apple Intelligence features. It started as a personal challenge to create a reading app that never sends data to a server while still supporting modern AI capabilities.

What’s unique: • Everything — TTS, summaries, Q&A, mind-maps, translation — runs locally on the device • Supports EPUB, PDF, MOBI, AZW3, TXT, and FB2 with a single unified reading pipeline • Uses Supertonic ONNX for high-quality offline speech (no cloud, no latency) • Integrates Apple Intelligence for chapter-level analysis on supported devices • Local SQLite storage for highlights, progress, and reading analytics

Technical notes for those curious: • EPUB/TXT/MOBI parsing is consolidated into a WebView-based renderer with CFI support • PDF mode uses native PDFKit overlays with TTS + translation layers • TTS runs on-device using a quantized Supertonic model with chunked streaming • AI summaries/Q&A rely on local Apple Intelligence calls with caching at the chapter level • The entire app operates offline — no accounts, no telemetry, no external servers

I’d love feedback from readers, mobile devs, and anyone interested in on-device AI design. Happy to answer any technical questions!

Comments

page_echo•1mo ago
Thanks for taking a look. I wanted to share some of the background and trade-offs that shaped the project, since the constraint of “everything must run locally” influenced almost every decision.

The biggest challenge was balancing feature completeness with device limitations. Running TTS and AI fully on-device meant carefully managing memory spikes, chunking long texts, avoiding UI stalls, and working within the restrictions of Apple Intelligence availability. Getting TTS to feel continuous required experimentation with segmentation, buffering, and timing, especially for long-form documents.

Another interesting challenge was unifying such different file formats into a consistent reading experience. Rather than building multiple rendering paths, I ended up normalizing most formats into HTML and relying on CFI anchors and a WebView-based system. This reduced code surface but introduced its own edge cases, especially around selection accuracy and highlight persistence.

PDF brought its own set of problems — mainly keeping performance stable on large files and making sure overlays (search, highlights, TTS, translation) stayed synchronized with page geometry. I’m still improving this area.

I’m very interested in learning how others approach on-device inference, streaming models, memory usage patterns, and PDF performance. If anyone has experience or ideas in those areas, I’d love to hear them.

Happy to answer questions about any part of the implementation.

Anthropic: Latest Claude model finds more than 500 vulnerabilities

https://www.scworld.com/news/anthropic-latest-claude-model-finds-more-than-500-vulnerabilities
1•Bender•2m ago•0 comments

Brooklyn cemetery plans human composting option, stirring interest and debate

https://www.cbsnews.com/newyork/news/brooklyn-green-wood-cemetery-human-composting/
1•geox•2m ago•0 comments

Why the 'Strivers' Are Right

https://greyenlightenment.com/2026/02/03/the-strivers-were-right-all-along/
1•paulpauper•4m ago•0 comments

Brain Dumps as a Literary Form

https://davegriffith.substack.com/p/brain-dumps-as-a-literary-form
1•gmays•4m ago•0 comments

Agentic Coding and the Problem of Oracles

https://epkconsulting.substack.com/p/agentic-coding-and-the-problem-of
1•qingsworkshop•5m ago•0 comments

Malicious packages for dYdX cryptocurrency exchange empties user wallets

https://arstechnica.com/security/2026/02/malicious-packages-for-dydx-cryptocurrency-exchange-empt...
1•Bender•5m ago•0 comments

Show HN: I built a <400ms latency voice agent that runs on a 4gb vram GTX 1650"

https://github.com/pheonix-delta/axiom-voice-agent
1•shubham-coder•6m ago•0 comments

Penisgate erupts at Olympics; scandal exposes risks of bulking your bulge

https://arstechnica.com/health/2026/02/penisgate-erupts-at-olympics-scandal-exposes-risks-of-bulk...
2•Bender•6m ago•0 comments

Arcan Explained: A browser for different webs

https://arcan-fe.com/2026/01/26/arcan-explained-a-browser-for-different-webs/
1•fanf2•8m ago•0 comments

What did we learn from the AI Village in 2025?

https://theaidigest.org/village/blog/what-we-learned-2025
1•mrkO99•8m ago•0 comments

An open replacement for the IBM 3174 Establishment Controller

https://github.com/lowobservable/oec
1•bri3d•10m ago•0 comments

The P in PGP isn't for pain: encrypting emails in the browser

https://ckardaris.github.io/blog/2026/02/07/encrypted-email.html
2•ckardaris•13m ago•0 comments

Show HN: Mirror Parliament where users vote on top of politicians and draft laws

https://github.com/fokdelafons/lustra
1•fokdelafons•13m ago•1 comments

Ask HN: Opus 4.6 ignoring instructions, how to use 4.5 in Claude Code instead?

1•Chance-Device•15m ago•0 comments

We Mourn Our Craft

https://nolanlawson.com/2026/02/07/we-mourn-our-craft/
1•ColinWright•17m ago•0 comments

Jim Fan calls pixels the ultimate motor controller

https://robotsandstartups.substack.com/p/humanoids-platform-urdf-kitchen-nvidias
1•robotlaunch•21m ago•0 comments

Exploring a Modern SMTPE 2110 Broadcast Truck with My Dad

https://www.jeffgeerling.com/blog/2026/exploring-a-modern-smpte-2110-broadcast-truck-with-my-dad/
1•HotGarbage•21m ago•0 comments

AI UX Playground: Real-world examples of AI interaction design

https://www.aiuxplayground.com/
1•javiercr•22m ago•0 comments

The Field Guide to Design Futures

https://designfutures.guide/
1•andyjohnson0•22m ago•0 comments

The Other Leverage in Software and AI

https://tomtunguz.com/the-other-leverage-in-software-and-ai/
1•gmays•24m ago•0 comments

AUR malware scanner written in Rust

https://github.com/Sohimaster/traur
3•sohimaster•26m ago•1 comments

Free FFmpeg API [video]

https://www.youtube.com/watch?v=6RAuSVa4MLI
3•harshalone•26m ago•1 comments

Are AI agents ready for the workplace? A new benchmark raises doubts

https://techcrunch.com/2026/01/22/are-ai-agents-ready-for-the-workplace-a-new-benchmark-raises-do...
2•PaulHoule•31m ago•0 comments

Show HN: AI Watermark and Stego Scanner

https://ulrischa.github.io/AIWatermarkDetector/
1•ulrischa•32m ago•0 comments

Clarity vs. complexity: the invisible work of subtraction

https://www.alexscamp.com/p/clarity-vs-complexity-the-invisible
1•dovhyi•33m ago•0 comments

Solid-State Freezer Needs No Refrigerants

https://spectrum.ieee.org/subzero-elastocaloric-cooling
2•Brajeshwar•33m ago•0 comments

Ask HN: Will LLMs/AI Decrease Human Intelligence and Make Expertise a Commodity?

1•mc-0•35m ago•1 comments

From Zero to Hero: A Brief Introduction to Spring Boot

https://jcob-sikorski.github.io/me/writing/from-zero-to-hello-world-spring-boot
1•jcob_sikorski•35m ago•1 comments

NSA detected phone call between foreign intelligence and person close to Trump

https://www.theguardian.com/us-news/2026/feb/07/nsa-foreign-intelligence-trump-whistleblower
14•c420•35m ago•2 comments

How to Fake a Robotics Result

https://itcanthink.substack.com/p/how-to-fake-a-robotics-result
1•ai_critic•36m ago•0 comments