Show HN: Voiced, image-based D&D inspired AI-native RPG

https://i-am-neon.itch.io/infinit

1•tommywilczek•7h ago

I'm a solo dev and I built a visual novel-style RPG where you type what you want to do and an AI game master responds in real time. Free alpha, plays in the browser.

What makes it different from AI Dungeon: the AI doesn't just generate text. It emits structured commands that change the music, move NPCs between locations, give/remove items, swap character portraits based on emotional reactions, and trigger cutscenes. Cinematic stills are generated on the fly with Flux 2 Klein 4B, and characters are voiced in real time via Inworld. Separate AI agents maintain a quest journal and write save summaries. The result feels more like a tabletop RPG session than a chatbot conversation.

The world is hand-crafted, not AI-generated. I wrote all the locations, characters, and lore by hand (Himalayan fantasy setting inspired by travel through Nepal and Bhutan). The AI's job is to run the game inside that authored world. Everyone explores the same world, every playthrough is different.

Stack: Godot 4.5 client, FastAPI backend, WebSocket streaming. Some AI calls use Gemini 3.1 Flash Lite, others use Claude Haiku 4.5 (cannot wait for 4.6). Cutscene images generated on the fly with Flux 2 Klein 4B. Voice TTS via Inworld.

Every turn costs real money in AI inference and I'm covering it until the $100 runs out (which will be a while because these models are SO cheap to run). Happy to answer questions about the architecture.

Comments

vunderba•6h ago

Nice job. I've also experimented with an invisible AI overlord in conjunction with a hand-built interactive fiction environment mainly intended to graft a much more powerful and expressive "text parser" than older games had.

Are you at least caching the image gen in something like S3? Even with cheaper models, I would think that would add up fast.

tommywilczek•6h ago

Nice! And I'm running the image models through Replicate which gives back an ephemeral link. Those images then get saved to the client so I'm not paying to host them.

vunderba•6h ago

Very cool! Depending on your local hardware, you could also run something like Z-Image Turbo or Klein locally, then use a daily CRON to pre-compute a bunch of new images for the game and serve them online as well~

That way you've got a fallback when credits run out, or if the 3rd party API takes too long to generate a new image, etc.

tommywilczek•6h ago

totally. to get it out the door I kept it in the cloud but that would be a good optimization.

Honestly the biggest issue I've been facing is google's ability to host its 3 flash/3.1 flash-lite models. They CONSTANTLY return 503: model unavailable due to high usage... Next step will be to play with a more reliable LLM that's still fast and smart enough.

tommywilczek•6h ago

For the images that I do want to cache, I use Cloudflare's r2 which is basically S3 with no egress fees (as in no charge for reads). I'm well within the free tier with a few dozen users of the game.

The Window Chrome of Our Discontent

When Batteries Heat Up, This Membrane "Sweats" It Out

Show HN: Stratum - a pure JVM columnar SQL engine using the Java Vector API

Wild Crows in Sweden Help Clean Up Cigarette Butts

Show HN: BLOBs in MariaDB's Memory Engine – No More Disk Spills for Temp Tables

Tip me, my life depends on it (2021)

Show HN: OculOS – Give AI agents control of your desktop via MCP

New Strides Made on Deceptively Simple 'Lonely Runner' Problem

Ask HN: Why is Pi so good (and some observations)

Show HN: Speclint – OS spec linter for AI coding agents

Qwen3.5-35B – 16GB GPU – 100T/s with 120K context AND vision enabled

What Did Ilya See?

Rust Actor Framework Playground

Show HN: mTile – native macOS window tiler inspired by gTile

Show HN: Personalized financial literacy book for your kid

Ask HN: Has anyone built an autonomous AI operator for their side projects?

Obituary for António Lobo Antunes

The legendary Mojave Phone Booth is back (2013)

Autonomous AI Newsroom

People love to hate twice-a-year clock change but can't agree on how to fix it

To be a better programmer, write little proofs in your head

Show HN: ScreenTranslate – On-device screen translator for macOS (open source)

A New Way to Synthesize Peptides (2024)

Report from Vietnam (1968) Walter Cronkite [video]

Airtable: Rewriting Our Database in Rust

A workflow driven web framework for Clojure

Show HN: An AI-powered digital night vision system with drone video feed

The Start-Stop Problem

Show HN: PlateSpinner – A Kanban board that orchestrates AI coding agents

AI startup sues ex-CEO, saying he took 41GB of email and lied on Résumé

The Window Chrome of Our Discontent

When Batteries Heat Up, This Membrane "Sweats" It Out

Show HN: Stratum - a pure JVM columnar SQL engine using the Java Vector API

Wild Crows in Sweden Help Clean Up Cigarette Butts

Show HN: BLOBs in MariaDB's Memory Engine – No More Disk Spills for Temp Tables

Tip me, my life depends on it (2021)

Show HN: OculOS – Give AI agents control of your desktop via MCP

New Strides Made on Deceptively Simple 'Lonely Runner' Problem

Ask HN: Why is Pi so good (and some observations)

Show HN: Speclint – OS spec linter for AI coding agents

Qwen3.5-35B – 16GB GPU – 100T/s with 120K context AND vision enabled

What Did Ilya See?

Rust Actor Framework Playground

Show HN: mTile – native macOS window tiler inspired by gTile

Show HN: Personalized financial literacy book for your kid

Ask HN: Has anyone built an autonomous AI operator for their side projects?

Obituary for António Lobo Antunes

The legendary Mojave Phone Booth is back (2013)

Autonomous AI Newsroom

People love to hate twice-a-year clock change but can't agree on how to fix it

To be a better programmer, write little proofs in your head

Show HN: ScreenTranslate – On-device screen translator for macOS (open source)

A New Way to Synthesize Peptides (2024)

Report from Vietnam (1968) Walter Cronkite [video]

Airtable: Rewriting Our Database in Rust

A workflow driven web framework for Clojure

Show HN: An AI-powered digital night vision system with drone video feed

The Start-Stop Problem

Show HN: PlateSpinner – A Kanban board that orchestrates AI coding agents

AI startup sues ex-CEO, saying he took 41GB of email and lied on Résumé

Show HN: Voiced, image-based D&D inspired AI-native RPG

Comments