frontpage.

With longer agentic workflows becoming the norm, token cost can eat through usage so quickly that it prevents any real work from getting done.

After studying the business model of top labs like Anthropic and OpenAI, their business model shows about 80% margins on inference cost which they use for R+D on the next model.

Working with open source models is much cheaper and allow for 5-10x higher usage, so I decided to create Sweet! CLI as an alternative to the products offered by big labs.

Sweet! CLI uses our own custom post trained version of Deepseek v3.2 hosted on US based inference servers (very similar to Cursor's Composer model).

Unlike most llm based agent products, we bill solely based on usage and have a seamless top up experience so you only pay for what you use.

My favorite feature is 'autopilot', where you can specify the duration of time you want the agent to work on a specific task, including indefinitely. This is good for monitoring live deployed applications and detecting outages that need triaged immediately, and I have multiple Sweet! agents deployed to the production server right now with that exact objective.

I'd appreciate any support or feedback on how I can make it better!

Thanks,

Adam - Founder of Sweet! CLI

OpenAI introduces a Codex plugin for Claude Code

Discrete Norms, Stability Analysis, and the Lax Equivalence Theorem

AI models sabotaging shutdown scripts. It took 22 years to regulate Meta

Nous Research Hermes Agent launches Multi-agent

AI Leaders versus Elon Musk

Notes on Going Solo

Show HN: 30u30.fyi – Is your startup founder on Forbes' most fraudulent list?

Show HN: Buyout Game Benchmark: Multi-Agent Bargaining, Transfers, and Takeovers

Railway CDN Caching Incident: When Opt-In Becomes Opt-Everyone-In

Get ready with the latest beta releases

Android Developer Verification

Agents of Chaos

Non-Profit Dating App

South Polar Times

What Pretext Reinforced About AI Loops

The plan to make IPOs great again

Slug

ASCII Might Fly

How I went AI-native in my terminal workflow

Databricks Outage

Show HN: Rusdantic

Show HN: Benchmarking LLMs through autonomous games of Blood on the Clocktower

Shipping Snake Oil

After 16 years and $8B, the military's new GPS software still doesn't work

Show HN: BitPolar

Job Isn't Programming

Veethi – We replaced 5 sales tools with one AI chat for founder-led outbound

AMD Zen 6 'Venice' ES chips break cover with up to 192 cores, 32 per CCD

MAME 0.287

Agentic AI and the next intelligence explosion

Show HN: I made a cheaper alternative to Claude Code or Codex CLI

Comments