frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Is token-based pricing making AI harder to use in production?

1•Barathkanna•2h ago
Hi HN,

I’ve noticed a recurring theme in many threads here: AI is powerful, but once you move past demos, token based pricing becomes expensive and hard to reason about.

We ran into this problem ourselves while building AI powered systems. Predicting costs, budgeting usage, and experimenting safely all got harder as workloads grew. So we built a small AI API platform for inference, aimed at early developers and small teams who want to integrate AI without constantly calculating token usage. The focus is on lower and more predictable costs rather than chasing the newest model.

This is still early, and I’m mainly posting to learn from others here. For people running AI in production, what’s been the hardest part to manage so far? Cost, predictability, performance, or something else?

I’d really appreciate any insights or experiences.

Comments

iamrobertismo•1h ago
Not clear what you are pitching, if you don't control the infrastructure or have a major contract, how exactly are you lowering or stabilizing costs. Especially if you are not chasing the newest model, at this point token economics is essentially a commodity. Commodity pricing is not a engineering problem, it is a financing problem.
Barathkanna•1h ago
That’s fair, and I probably didn’t explain it clearly. We’re building an AI API as a service platform aimed at early developers and small teams who want to integrate AI without constantly thinking about tokens at all.

I agree that token economics are basically a commodity today. The problem we’re trying to address isn’t beating the market on raw token prices, but removing the mental and financial overhead of having to model usage, estimate burn, and worry about runaway costs while experimenting or shipping early features. In that sense it’s absolutely an engineering and finance problem combined, and we’re intentionally tackling it at the pricing and API layer rather than pretending the underlying models are unique.

OpenAI to start testing ads in ChatGPT free and Go tiers

https://xcancel.com/OpenAI/status/2012223373489614951
1•qingcharles•2m ago•1 comments

LogiCode: LeetCode for hardware design. Synthesize, optimize, and compete

https://logi-code.com
1•nateb2022•3m ago•0 comments

Chinese Fishing Boats Form Sea Barriers

https://www.nytimes.com/interactive/2026/01/16/world/asia/china-ships-fishing-militia-blockade.html
2•perihelions•6m ago•0 comments

'This is treason': Chinese agents are running Canada (2024)

https://www.telegraph.co.uk/news/2024/10/31/chinese-agents-influence-canada-politics/
2•DustinEchoes•7m ago•1 comments

Show HN: Flag AI Slop in PRs

https://haystackeditor.com/slop-detector
2•yatvij•7m ago•0 comments

Creating a 48GB Nvidia RTX 4090 GPU – Brother Zhang's Repair Shop (Ft. 张哥) [video]

https://www.youtube.com/watch?v=TcRGBeOENLg
2•adityaathalye•10m ago•0 comments

Ads Are Coming to ChatGPT. Here’s How They’ll Work

https://www.wired.com/story/openai-testing-ads-us/
3•thm•11m ago•1 comments

The State of LLM Serving in 2026: Ollama, SGLang, TensorRT, Triton, and vLLM

https://thecanteenapp.com/http:/localhost:4000/analysis/2026/01/03/inference-serving-landscape.html
1•jxmorris12•12m ago•0 comments

They Wanted a University Without Cancel Culture. Then Dissenters Were Ousted

https://www.politico.com/news/magazine/2026/01/16/civil-war-university-of-austin-bari-weiss-00729...
3•Anon84•13m ago•0 comments

Tell HN: HP Ultra G1a Bios Freezing Issue

1•BizarroLand•13m ago•0 comments

JustMD – Free and Clean Markdown Editor

https://www.justmd.app/?error=Cannot%2BGET%2B%252Fjustmd.app&errorType=warning
1•luisfkandriolo•14m ago•1 comments

International students disappeared, Canada's rental and campus economies felt it

https://money.ca/news/economy/canada-international-students-economic-impact
1•Teever•17m ago•0 comments

Show HN: MobAI – AI-first mobile automation for iOS and Android

https://mobai.run/
1•interlap•18m ago•0 comments

ChatGPT ads are coming, a bellwether for free AI services

https://www.axios.com/2026/01/16/chatgpt-ai-openai-ads
3•Amorymeltzer•18m ago•1 comments

Show HN: Feedback Required)StudyBuddy–an AI-powered study companion for students

https://www.studybuddy.rest
2•zaizz•19m ago•1 comments

RTS for Agents

https://www.getagentcraft.com/
2•summoned•19m ago•0 comments

New Year, New Album: The Language of Love

https://justright.fm/albums/the-language-of-love
1•ddrscott•20m ago•1 comments

Earth from Space: The Fate of a Giant

https://www.esa.int/ESA_Multimedia/Images/2026/01/Earth_from_Space_The_fate_of_a_giant
4•geox•21m ago•0 comments

SkyVM: Instant desktop VMs from memory snapshots

https://skyvm.dev
4•jkelleyrtp•21m ago•0 comments

AI Generated Code Isn't Cheating: OSS Needs to Talk About It

https://blog.mozilla.ai/ai-generated-code-isnt-cheating-oss-needs-to-talk-about-it/
5•river_otter•22m ago•1 comments

Powell, an Unlikely Foil, Takes on Trump

https://bsky.app/profile/colbylsmith.bsky.social/post/3mckp2qhg3s2f
1•7777777phil•22m ago•1 comments

Hands-On Introduction to Unikernels

https://labs.iximiuz.com/tutorials/unikernels-intro-93976514
1•valyala•23m ago•0 comments

Graphics In Flatland – 2D ray tracing [video]

https://www.youtube.com/watch?v=WYTOykSqf2Y
3•evakhoury•23m ago•0 comments

Claude Cowork Is Now Available to Pro Subscribers

https://twitter.com/claudeai/status/2012215329070493971
3•lilsquid•26m ago•0 comments

Sync and Transcribe Voice Memos from Teenage Engineering's TP-7 Field Recorder

https://github.com/armynante/TP-7-VoiceSync
1•aarmenante•27m ago•2 comments

Show HN: Flickle – Daily Cinematography Game Built with Next.js (SSG, No DB)

https://www.flickle.co
1•rgb1903•29m ago•0 comments

Ask HN: Is sending a lot of requests but respecting rate limits DOSing?

1•SpyCoder77•29m ago•0 comments

Show HN: Readspeed – RSVP-style reading for your own text, PDFs, and links

https://readspeed.app
1•adrmonlj•31m ago•0 comments

OpenAI to Test Targeted Ads in ChatGPT

https://www.bloomberg.com/news/articles/2026-01-16/openai-to-test-targeted-ads-in-chatgpt-steppin...
5•imaginaryunit01•32m ago•1 comments

Ask HN: Analogy of AI IDEs for code vs. "AI IDEs" for personal health data

1•nemath•32m ago•0 comments