frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

I spent $638 on AI coding agents in 6 weeks.

1•nthypes•1h ago
I just got my Cursor invoices and I'm genuinely shocked. Between October and early November, I've burned through $638 on AI coding assistance. That's more than some cloud bills.

Context: I'm an Founder & CTO building an AI-First CRM product.

Here's what happened:

October: Started the month thinking "I'll stay within the Pro limits, no problem." By mid-month, Cursor hit me with a $280 invoice. By month end? $348.56 total in on-demand charges. I literally maxed out the $400 limit.

November: It's only November 12 and I've already been invoiced $289.38:

Cost per request: Claude 4.5 Sonnet Thinking ranges from $0.02 to $0.06 depending on context size. Doesn't sound like much until you realize you're hitting it 200+ times per day.

I tried 7 different models (GPT-5, Gemini 2.5 Pro, Cheetah, etc.) thinking I'd save money. Claude still ate 85% of my budget because, honestly, it's the best.

Am I more productive? Absolutely. Is it worth $638 every 6 weeks? Idk. That's $5,500+ annually just for code assistance.

So I'm curious:

What are YOU spending? Am I an outlier or is this the new normal?

Have you changed behavior to cut costs? (Using faster models? Being more selective? Bringing your own API keys?)

At what price point would you stop? $100/month? $500? $1000?

Is anyone actually staying within the included limits? Or is that just marketing?

I feel like we're in this weird phase where the value is obvious but the pricing model hasn't stabilized.

Would love to hear how others are navigating this.

Comments

mnky9800n•1h ago
I just use claude code max (125/month flat) with no problems like this. I constantly have multiple agents doing things on clusters figuring out problems, writing codes, or otherwise doing research for me (I am a scientist). Maybe you should swich from loser Cursor to winner Claude? Haha. And anyways, you are a founder, is 638$/6weeks affordable in your budget? It is up to you to decide what you think is a reasonable expense on your adventure towards profitability.
nthypes•1h ago
Thanks for the input! I'm checking on Claude Code Max now - from what I'm seeing, even the $200/month plan has weekly rate limits (240-480 hours of Sonnet 4, 24-40 hours of Opus 4 per week).. so not quite unlimited tokens either, though definitely more predictable billing.

$638/6 weeks won't make me broke, but here's my main issue: for me it's about the value-to-token ratio feeling off.

What bugs me most is that many of those 340M tokens feel wasteful? Like the LLM will use 50k tokens exploring dead ends before finding a solution that could have been expressed in 5k tokens. The productivity gain is real, but it feels like I'm paying 10x more than what should be "fair" for the actual value delivered.

Maybe this is just the current state of AI coding - the models need that exploration space to get to the answer. Or maybe I need to get better at constraining the context and being more surgical with my prompts.

For me as a founder, it's less "can I afford this" and more "does this pricing model make sense long-term?" If AI coding becomes a $5-6k/year baseline expense per developer, that changes a lot of unit economics, especially for early-stage companies.

Are you finding Claude Code Max more token-efficient for similar tasks, or is it just easier to stomach because the billing is flat?

Woods369•1h ago
I totally agree with the closing questions and statements here.

Yes, there is an apparent fluctuation in pricing for said tokens/credits/etc.. and they are just finding the sweet spot.

I primarily use the https://www.warp.dev/ terminal and they have just released a new plan which outwardly states lower cost per request.

They mentioned 'At full usage, the plans didn’t scale sustainably'. This is encouraging to know.

Right now, I'm looking around and trying out. Nice read, thanks.

6510•56m ago
Also interesting is what it will cost if you do or don't train yourself to use it. You can't be a very good spell caster if you don't practice as much as the next wizard. Also, from now on everything you do will be this expensive :P
Rochus•21m ago
And was it worth it? What were you able to achieve with the LLM-support you payed for? How much time did you save?

I did experiments with Claude Sonnet and Opus and also with Gpt-5, the latter via my Perplexity subscription. My experience with Claude was mixed; most output required significant re-work which mostly consumed the time savings enabled by Claude. In contrast, Gpt-5 was able to generate code for several complex problems and made different transpilations for me, and the generated code usually compiles and correctly runs up front, all covered by my Perplexity subscription.

nthypes•8m ago
For me it's about the value-to-token ratio feeling off, as I stated in another comment.

> What bugs me most is that many of those 340M tokens feel wasteful? Like the LLM will use 50k tokens exploring dead ends before finding a solution that could have been expressed in 5k tokens. The productivity gain is real, but it feels like I'm paying 10x more than what should be "fair" for the actual value delivered.

nthypes•16m ago
I've asked Sonnet 4.5 (Cursor) to generate some stats based on my cursor CSV file usage. Here it is the output:

Key Findings from Your Cursor Usage Here are the most interesting insights from your usage data:

Cost Summary

Total Cost: $928.45 over 70 days

Average per Request: $0.06

Cost per Task (Request): Ranges from $0.00 to $2.78, with 65.7% costing under $0.05

Projected Monthly Cost: ~$416 (based on average daily spend of $13.86)

Request Patterns

Requests per 5 Hours: Average 70.7, ranging from 1 to 451

Average Time Between Requests: 6 minutes 33 seconds

Median Time Between Requests: Just 13 seconds (shows bursts of activity)

Peak Activity: 1-2 PM (10.4% of all requests at 1 PM)

Busiest Day: Saturday with 21.7% of requests

Token Efficiency

Average Tokens per Request: 83,371 tokens

Median Tokens per Request: 38,342 tokens

Average Output per Request: 876 tokens

Cache Hit Rate: 88.8% (excellent! saves money)

Cost per 1,000 Tokens: $0.0009 (very efficient due to caching)

Cost per 1,000 Output Tokens: $0.14

Notable Stats

Most Expensive Request: $2.78 using 6.8M tokens (mostly cached)

Total Hours of Active Usage: 1,692 hours (~9 requests/hour)

Most Used Models: claude-4.5-sonnet-thinking, claude-3.5-sonnet, and others

Your cache hit rate of 88.8% is excellent and is saving you significant costs! Without caching, your costs would be much higher.

I'm a Millionaire. No One Needs More Than $30M

https://time.com/7325632/universal-basic-income-millionaire/
1•jamesgill•51s ago•0 comments

Show HN: Link Snapper – Copy URLs as plain text for unclickable sharing

https://devapt.com/link-snapper
1•wahvinci•2m ago•0 comments

Why every AI coding tool gets pricing wrong

https://getlago.substack.com/p/why-every-ai-coding-tool-gets-pricing
1•FinnLobsien•4m ago•0 comments

Automated Contiguous Layer Pruning for Large Language Models

https://arxiv.org/abs/2510.23652
1•PaulHoule•4m ago•0 comments

Elon Musk's trillion dollar pay package depends on emerging markets

https://restofworld.org/2025/elon-musk-trillionaire/
1•donohoe•7m ago•0 comments

Universe Simulation Now in Maintenance Mode (Post-Patch Hypothesis)

https://medium.com/@OverthinkingVoid/universe-simulation-now-in-maintenance-mode-c01bafd6e607
1•murugaviki•7m ago•1 comments

In Defence of Personal Finance

https://www.economist.com/finance-and-economics/2025/11/12/in-defence-of-personal-finance
1•andsoitis•7m ago•0 comments

Gemini 3 rolling out (unconfirmed) to mobile users

https://old.reddit.com/r/Bard/comments/1ovvmjo/not_hype_posting_gemini_3_is_rolling_out_to/
1•martinald•10m ago•1 comments

Useless AI Products Are Getting Worse [video]

https://www.youtube.com/watch?v=vhvk0kxX1wg
1•askl•11m ago•0 comments

When Reverse Proxies Surprise You: Hard Lessons from Operating at Scale

https://www.infoq.com/articles/scaling-reverse-proxies/
1•miggy•11m ago•0 comments

What Is a Production Process?

https://www.construction-physics.com/p/what-is-a-production-process
1•chmaynard•13m ago•0 comments

Extraordinary Popular Delusions and the Madness of Crowds

https://www.gutenberg.org/files/24518/24518-h/24518-h.htm
2•andsoitis•13m ago•0 comments

Oldversion.com Archive Facing Shutdown due to Financing Issues

https://hackaday.com/2025/11/07/oldversion-com-archive-facing-shutdown-due-to-financing-issues/
3•uyzstvqs•16m ago•0 comments

Show HN: Solve logic puzzles while waiting for AI

https://www.antibrainrot.xyz/
1•zknowledge•17m ago•0 comments

Intelligence per Watt: A Study of Local Intelligence Efficiency

https://hazyresearch.stanford.edu/blog/2025-11-11-ipw
1•simonpure•21m ago•0 comments

Ask HN: Is there a market for niche aesthetic AI generators?

1•Yreminder•23m ago•0 comments

Heartbeats in Distributed Systems

https://arpitbhayani.me/blogs/heartbeats-in-distributed-systems/
2•sebg•23m ago•0 comments

What the Look of Your Favorite Podcast Is Trying to Tell You

https://www.nytimes.com/interactive/2025/11/12/arts/podcast-sets-design.html
1•jbegley•23m ago•0 comments

They're Firing Everyone and Getting Rich from It [video]

https://www.youtube.com/watch?v=b3h5e-A6dfs
1•EPendragon•25m ago•0 comments

Cursor Is Now Worth $29.3B

https://www.wsj.com/tech/ai/the-ai-coding-startup-favored-by-tech-ceos-is-now-worth-29-3-billion-...
2•giuliomagnifico•25m ago•1 comments

Claude Code hooks are crazy

https://www.augmentedswe.com/p/guide-to-claude-code-hooks
1•wordsaboutcode•26m ago•0 comments

Show HN: A game of higher or lower using GitHub stars

https://higher-lower.muxo.ai/
1•alexander2002•27m ago•0 comments

Blender Lab

https://www.blender.org/news/introducing-blender-lab/
9•radeeyate•28m ago•1 comments

Loeb's 3I/Atlas "Anomalies" Explained

https://sites.psu.edu/astrowright/2025/11/09/loebs-3i-atlas-anomalies-explained/
2•geox•28m ago•0 comments

Show HN: Free Y2K horror image generator using Gemini prompts

https://dreamyy2k.app/
1•Yreminder•29m ago•0 comments

Michael Burry of 'Big Short' Fame Deregisters Scion Asset Management

https://www.cnbc.com/2025/11/13/michael-burry-of-big-short-fame-deregisters-scion-asset-managemen...
3•mrtksn•30m ago•0 comments

The fanatic, fraudulent Mother Theresa (2003)

https://slate.com/news-and-politics/2003/10/the-fanatic-fraudulent-mother-teresa.html
3•simonebrunozzi•31m ago•0 comments

Asleep at the Keyboard? Assessing the Security of GitHub Copilot's Code

https://cacm.acm.org/research-highlights/asleep-at-the-keyboard-assessing-the-security-of-github-...
1•b-man•32m ago•0 comments

Britain's Railway Privatization Was an Abject Failure

https://www.rosalux.de/en/news/id/53917/britains-railway-privatization-was-an-abject-failure
20•robtherobber•32m ago•2 comments

Show HN: Patchsmith – Agentic wrapper for CodeQL (finetune, triage, fix with AI)

https://github.com/eschnou/patchsmith
1•eschnou•35m ago•1 comments