Transactional AI: Saga Pattern for Reliable AI Agent Workflows (v0.2)

https://github.com/Grafikui/Transactional-ai

2•grafikui•3w ago

Comments

grafikui•3w ago

Earlier this week I launched Transactional AI v0.1 to solve a problem I kept hitting: AI agents that half-executed and left systems in broken states.

The core idea: apply the Saga pattern (from distributed systems) to AI workflows. Every step has automatic rollback. If OpenAI succeeds but Stripe fails, the system automatically deletes the AI-generated content and refunds—no manual cleanup.

v0.2 adds production features based on feedback:

Distributed Execution (v0.2.0):

Redis-based distributed locking (prevents race conditions with multiple workers) PostgreSQL storage adapter (ACID compliance for regulated industries) Retry policies with exponential backoff (handles flaky LLM APIs) Observability & Reliability (v0.2.1):

Event hooks for monitoring (12 lifecycle events: step start/complete/fail/timeout/retry, compensation events, transaction lifecycle) Per-step timeouts (kill hung OpenAI calls after 30s) Testing utilities (in-memory storage/locks, no Redis/Postgres needed for tests) Example:

const tx = new Transaction('workflow-123', storage, { lock: new RedisLock('redis://localhost'), events: { onStepTimeout: (step, ms) => alerting.sendAlert(`${step} hung after ${ms}ms`), onStepFailed: (step, err, attempt) => logger.error(`${step} failed`, { err, attempt }) } });

await tx.run(async (t) => { const report = await t.step('generate-ai-report', { do: async () => await openai.createCompletion({...}), undo: async (result) => await db.reports.delete(result.id), retry: { attempts: 3, backoffMs: 2000 }, timeout: 30000 });

  await t.step('charge-customer', {
    do: async () => await stripe.charges.create({...}),
    undo: async (charge) => await stripe.refunds.create({ charge: charge.id }),
    timeout: 10000
  });

}); If anything fails: Automatic rollback in reverse order. Report deleted, payment refunded.

Architecture:

TypeScript, 21 passing tests, strict mode Storage adapters: File (dev), Redis (performance), Postgres (ACID), Memory (tests) Lock adapters: NoOp (single process), Redis (distributed), Mock (tests) CLI inspector: tai-inspect for debugging transaction state No heavyweight orchestration engines (Temporal, AWS Step Functions). Just a 450-line TypeScript library.

Production readiness: 8.0/10 (up from 6.5 in v0.1)

Considering for v0.3.0: compensation retry policies, parallel steps, OpenTelemetry integration, MongoDB/DynamoDB adapters.

GitHub: https://github.com/Grafikui/Transactional-ai NPM: npm install transactional-ai

Happy to answer questions about the implementation, saga patterns, or production experiences!

"There must be something like the opposite of suicide "

Ask HN: Why doesn't Netflix add a “Theater Mode” that recreates the worst parts?

Show HN: Engineering Perception with Combinatorial Memetics

Show HN: Steam Daily – A Wordle-like daily puzzle game for Steam fans

The Anthropic Hive Mind

Just Started Using AmpCode

LLM as an Engineer vs. a Founder?

Crosstalk inside cells helps pathogens evade drugs, study finds

Show HN: Design system generator (mood to CSS in <1 second)

Show HN: 26/02/26 – 5 songs in a day

Toroidal Logit Bias – Reduce LLM hallucinations 40% with no fine-tuning

Top AI models fail at >96% of tasks

The Science of the Perfect Second (2023)

Bob Beck (OpenBSD) on why vi should stay vi (2006)

Show HN: a glimpse into the future of eye tracking for multi-agent use

The Optima-l Situation: A deep dive into the classic humanist sans-serif

Barn Owls Know When to Wait

Implementing TCP Echo Server in Rust [video]

LicGen – Offline License Generator (CLI and Web UI)

Service Degradation in West US Region

The Janitor on Mars

Bringing Polars to .NET

Adventures in Guix Packaging

Show HN: We had 20 Claude terminals open, so we built Orcha

Your Best Thinking Is Wasted on the Wrong Decisions

Warcraftcn/UI – UI component library inspired by classic Warcraft III aesthetics

Trump Vodka Becomes Available for Pre-Orders

Velocity of Money

Stop building automations. Start running your business

You can't QA your way to the frontier