there are two generation modes: 1) a basic single voice narrator and 2) a "dramatization" mode which generates a cast list from the book contents and converts the entire text into a script with distinct voices and instructions for each segment.
i used this with a self-hosted gpt-oss-120b, which turned out very well. the resulting mp3 files work well with the open source "Voice" audiobook player on F-Droid.
khimaros•1h ago
i used this with a self-hosted gpt-oss-120b, which turned out very well. the resulting mp3 files work well with the open source "Voice" audiobook player on F-Droid.
feedback welcome!