frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Abogen – Generate audiobooks from EPUBs, PDFs and text

https://github.com/denizsafak/abogen
63•mzehrer•2h ago

Comments

nikolayasdf123•1h ago
can I choose any voice? would love to read software engineering books in voice of Morgan Freeman, or maybe even better, Scarlett Johansson
hulitu•44m ago
Why not Stephen Hawking ?
hajimuz•29m ago
Yeah, could be a buff like 500% brain supercharge.
pyman•9m ago
The voice of Mickey Mouse would be nice.
TOGoS•57m ago
The demo video doesn't seem to have any audio in it! At least none that either ffmpeg or whatever Firefox uses can recognize.
huseyinkeles•52m ago
I can hear it on safari
Daunk•49m ago
Same on my end, no audio in the video.
jamilton•30m ago
Same here, but it worked when I opened it in Chrome. What a weird error - you would think that playing an embedded mp4 with audio wouldn't differ from browser to browser.
ertian•22m ago
Yeah, I've run a local Kokoro instance, and it doesn't work with Firefox. This uses Kokoro under the hood.
8s2ngy•57m ago
I've been using Kokoro TTS with the CLI app, audiblez, mentioned in the "Similar Projects" section of the README. The model is fast and delivers impressive quality for its small size. Some issues I have faced, however, are: a) It doesn't distinguish periods at the end of sentences from the dots in abbreviations such as "Mr." or "Mrs." The result is an awkward pause between "Mr." and the name. b) It doesn't handle ellipses well. c) Words are pronounced the same way regardless of context.
rkagerer•55m ago
The Mr. / Mrs. thing feels like it would be a pretty easy fix, at least to eliminate a lot of the more common cases.
anotherpaul•49m ago
Does it turn it into spoken word or an audiobook? Because good audiobooks often have voice actors that read the characters with different emphasis and dialects. I imagine tools like chatgpt could do this for a few sentences but what about an 8-20 hour audiobook?

I think there are still basic hurdles to take before we can go epub to audiobook in a quality that can compete with current state of the art.

Or am I missing something?

jamilton•26m ago
Elevenlabs has a feature for a "full cast"-type generation, where different characters will get different voices. It's certainly not automatically sensitive to dialect though.

It's probably possible with current systems to do though. I believe there are TTS systems that can use context/prompting to change emphasis and other speech qualities, though I'm not sure how reliably.

floppyd•16m ago
I tried Kokoro for voicing blog posts and articles and wasn't impressed to be honest. Right now Gemini 2.5 Flash TTS is a much more capable system with generous free limits (about 10 minutes per generation and about 90 minutes per day). Voices are not very consistent between generations, but for shorter pieces it's not a big deal (but will obviously be for books)

Why Deep Learning Works Unreasonably Well

https://www.youtube.com/watch?v=qx7hirqgfuU
1•phildawes•8m ago•0 comments

CMakeDependencyDiagram – Interactive target dependency visualization for CMake

https://github.com/renn0xtek9/CMakeDependencyDiagram
1•renn0xtek9•16m ago•1 comments

Time to Talk Numbers

https://hugston.com/articles/Time_to_talk_numbers
1•trilogic•17m ago•1 comments

Who use and how are use the hand scan data?

1•aurelien•20m ago•0 comments

Why Paying for Spotify Mostly Pays Taylor Swift

https://mertbulan.com/2025/08/10/why-paying-for-spotify-mostly-pays-taylor-swift/
1•mertbio•21m ago•0 comments

Culture Game Over

https://web.archive.org/web/20171018143123/https://www.numair.com/culture/game-over
1•kwie•21m ago•1 comments

We're building "klarna" but for your annual software subscriptions

https://www.annualize.co/
2•bfayyumii•25m ago•2 comments

We're building "klarna" but for your annual software subscriptions

1•bfayyumii•26m ago•0 comments

Show HN: AI Coloring Pages Generator

https://aicoloringpages.app/
3•tomstig•31m ago•1 comments

Self-hosted open-source multi-user multi-platform secret management

http://day-to-day-stuff.blogspot.com/2025/08/self-hosted-open-source-multi-user.html
2•erikvanoosten•34m ago•0 comments

CUDA C++ Best Practices Guide [pdf]

https://docs.nvidia.com/cuda/pdf/CUDA_C_Best_Practices_Guide.pdf
2•throwawaybutwhy•36m ago•1 comments

'It's a Mess': A Brain-Bending Trip to Quantum Theory's 100th Birthday Party

https://www.quantamagazine.org/its-a-mess-a-brain-bending-trip-to-quantum-theorys-100th-birthday-party-20250808/
2•nsoonhui•37m ago•0 comments

Cloudflare vs. Perplexity:Why AI Scraping Without Paying Is Digital Theft [video]

https://www.youtube.com/watch?v=qehRsBYawkY
1•real-hacker•38m ago•1 comments

Cheaters Spotted in Battlefield 6 Beta, Despite Secure Boot Requirement

https://www.ign.com/articles/cheaters-already-spotted-in-battlefield-6-open-beta-despite-secure-boot-requirement
2•josephcsible•42m ago•0 comments

Show HN: Bugs and Feedback Collection Tool

https://www.feedbugs.com/
1•vignzviki•47m ago•0 comments

Pepe Auth

https://en.wikipedia.org/wiki/Pepe_Auth
2•motorest•47m ago•0 comments

Vvveb – open-source CMS drag-and-drop site builder

https://www.vvveb.com/
1•nreece•49m ago•0 comments

Being a Good PM at Google

https://thechrisperry.substack.com/p/being-a-good-pm-at-google
1•bruckie•49m ago•0 comments

David Chalmers: Could a Large Language Model Be Conscious?

https://arxiv.org/abs/2303.07103
2•hackandthink•54m ago•0 comments

Useful GitHub template for Git projects and readme.md

https://github.com/dec0dOS/amazing-github-template
1•pinter69•54m ago•0 comments

How to Use Snprintf

https://bernsteinbear.com/blog/snprintf/
1•ingve•58m ago•0 comments

Automation Tools Are Fast... Until You Pay

https://medium.com/@mohamedalibenothmen1/why-automation-tools-get-slower-after-you-pay-them-953d08e2142d
1•dalibenothmen•1h ago•0 comments

Show HN: MinimaTXT – local-only .txt notes App for iPhone

https://apps.apple.com/us/app/minimatxt/id6749180676
1•Banev•1h ago•0 comments

Why do we even need SIMD instructions?

https://lemire.me/blog/2025/08/09/why-do-we-even-need-simd-instructions/
5•ingve•1h ago•0 comments

AI coding tools considered harmful

https://twitter.com/QuentinAnthon15/status/1943948791775998069
2•lr0•1h ago•0 comments

GPT-5 model price comparison via pelicans on a bicycle

https://nezhar.com/blog/gpt-5-model-price-comparison-via-pelicans-on-bicycle/
1•nezhar•1h ago•0 comments

Digital resurrection: fascination and fear over the rise of the deathbot

https://www.theguardian.com/news/ng-interactive/2025/aug/10/artificial-intellligence-avatar-death-grief-digital-resurrection-fascination-deathbot
1•devuo•1h ago•0 comments

Move fast and don't break (safety critical) things

https://substack.com/home/post/p-170571873
2•amusely•1h ago•0 comments

Agentic Web – Weaving the Next Web with AI Agents

https://arxiv.org/abs/2507.21206
1•hunglee2•1h ago•0 comments

A Cautionary Tale for Stupid Idiots Who Think They Can Lead with Integrity

https://jamesjboyer.substack.com/p/a-cautionary-tale-for-stupid-idiots
1•gpi•1h ago•0 comments