Show HN: Inworld Runtime – A C++ graph-based runtime for production AI apps

3•rogilop•1h ago

Hey HN, this is Igor, one of the engineers behind Inworld Runtime, which we're releasing today in public preview.

We built it to solve the common problem we and our customers had: engineers spend more time on AI ops and plumbing than on actual feature development. This was often due to the challenge of using Python for I/O-bound, high-concurrency workloads and complexity maintaining pipelines with streams that use always-changing ML models.

Our solution is a high-performance runtime written in C++ with the core idea of defining AI logic as graphs. For instance, a basic voice-to-voice agent consists of STT → LLM → TTS nodes, while the connecting edges stream data and enforce conditions. This graph engine is portable (Linux, Windows, macOS) and can run on-device.

We built a few key features on top of this C++ core:

- Extensions. Runtime architecture decouples graph definition from implementation. If a pre-built component doesn't exist, you can register your own custom node/code and reuse it in any graph without writing any glue code.

- Routers. You can dynamically select models/settings on the per-node basis depending on the traffic as well as configure policies for fallbacks and retries to get the app ready for production.

- The Portal. A web-based control plane UI to deploy graphs, push config changes instantly, run A/B tests on live traffic, and monitor your app with logs, traces, and metrics.

- Unified API. Use our optimized models or route to providers like OpenAI, Anthropic, and Google through a single, consistent interface and one API key.

We have a Node.js SDK out now, with Python, Unity, Unreal, and native C++ coming soon. We plan to open-source the SDKs, starting with Node.js.

The docs are here: https://docs.inworld.ai/docs/runtime/overview

We're eager for feedback from fellow engineers and builders. What do you think?

Comments

skyzouwdev•58m ago

This is interesting — I’ve run into that exact pain point of spending more time on plumbing than building features.

Curious about the choice of C++ for the core runtime. Was the main driver raw performance, or more about portability and low overhead in production environments? In my case, I’ve mostly worked with Python pipelines and they tend to choke when you try to scale concurrency without overcomplicating the architecture.

The graph-based approach reminds me a bit of Unreal’s Blueprints, but for AI ops. If you can make it genuinely easy to plug in custom nodes without touching glue code, that could be a big productivity boost.

rogilop•20m ago

So... The true story is that it all started as an internal infra.

Originally, Inworld was mostly involved into gaming (read as Unreal & Unity) and therefore the choice. Yet (surprise, surprise) we want native x-platform and on-device (whose official support is coming soon).

Also, not a new thing, but C++ is more reliable than Python under really high concurrency, which was critical for us (a lot of people using same graphs in parallel kind of use case).

IBM Cloud hit by Severity One incident with the same symptoms as recent SNAFUs

Old Movie User Interfaces

Debian GNU/Hurd 2025 Released with Completed 64-Bit Support, Rust Ported

April Fools 2014: The *Real* Test Driven Development

Ask HN: How did Windsurf investors make out like bandits when employees got $0?

Show HN: YC ED – Search YC co-founders by stack, stage, and location(no swiping)

Securing Local MCP Servers with Docker

NFS Client and Server Feature Updates for Linux 6.17

What Is America, and for Whom?

Knee implant used by NHS in thousands of operations known for years to be faulty

Show HN: Chrome ext to open HN links in new tab

The Web Book

Should You Retrain That Model?

Sonnet 4 crushes competition on long context benchmark

GPT-5 at top of Kotlin bench

New evidence suggests Neolithic farmers cannibalized enemies

Implications of reviving historical fungi for plant pathology and agriculture

We Killed Hold Music — Here’s How You Can Too

Dear Chat-bot Platforms: Please, just one feature

Who Wins and Who Loses When AI Makes Decisions [video]

Same AI, Different Answer: How Tiny Prompts Can Change Everything

Hire Me

Russian hackers took control of Norwegian dam, police chief says

Distributed Denial of Secrets: hacked and leaked documents

How Stock Options Work

PowerToys 0.93

Cross-Site Request Forgery

Implementing a basic equivalent of OpenBSD's pflog in Linux nftables

Google Preferred Sources Rolling Out in US and India

Why do we even need SIMD instructions?