Storage is S3 instead of broker disks. Agents are stateless — if one dies, another picks up. Metadata lives in Postgres. You can use the managed platform at streamhouse.app, or self-host the whole thing with Docker Compose. Either way it's one service instead of five
What works today:
- CLI + Python/TypeScript SDKs
- REST and gRPC APIs (~2,600 msg/s on one EC2 instance)
- Pipelines that sink to Postgres and ClickHouse
- Schema registry (Avro, Protobuf, JSON Schema)
- SQL over streaming data
- Multi-tenant with org isolation and API key auth
What doesn't work yet: confluent-kafka clients need wire protocol fixes, and I need more soak testing before I'd trust it for massive sustained loads.