frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

What if you just did a startup instead?

https://alexaraki.substack.com/p/what-if-you-just-did-a-startup
1•okaywriting•4m ago•0 comments

Hacking up your own shell completion (2020)

https://www.feltrac.co/environment/2020/01/18/build-your-own-shell-completion.html
1•todsacerdoti•7m ago•0 comments

Show HN: Gorse 0.5 – Open-source recommender system with visual workflow editor

https://github.com/gorse-io/gorse
1•zhenghaoz•7m ago•0 comments

GLM-OCR: Accurate × Fast × Comprehensive

https://github.com/zai-org/GLM-OCR
1•ms7892•8m ago•0 comments

Local Agent Bench: Test 11 small LLMs on tool-calling judgment, on CPU, no GPU

https://github.com/MikeVeerman/tool-calling-benchmark
1•MikeVeerman•9m ago•0 comments

Show HN: AboutMyProject – A public log for developer proof-of-work

https://aboutmyproject.com/
1•Raiplus•9m ago•0 comments

Expertise, AI and Work of Future [video]

https://www.youtube.com/watch?v=wsxWl9iT1XU
1•indiantinker•10m ago•0 comments

So Long to Cheap Books You Could Fit in Your Pocket

https://www.nytimes.com/2026/02/06/books/mass-market-paperback-books.html
3•pseudolus•10m ago•1 comments

PID Controller

https://en.wikipedia.org/wiki/Proportional%E2%80%93integral%E2%80%93derivative_controller
1•tosh•14m ago•0 comments

SpaceX Rocket Generates 100GW of Power, or 20% of US Electricity

https://twitter.com/AlecStapp/status/2019932764515234159
1•bkls•15m ago•0 comments

Kubernetes MCP Server

https://github.com/yindia/rootcause
1•yindia•16m ago•0 comments

I Built a Movie Recommendation Agent to Solve Movie Nights with My Wife

https://rokn.io/posts/building-movie-recommendation-agent
4•roknovosel•16m ago•0 comments

What were the first animals? The fierce sponge–jelly battle that just won't end

https://www.nature.com/articles/d41586-026-00238-z
2•beardyw•24m ago•0 comments

Sidestepping Evaluation Awareness and Anticipating Misalignment

https://alignment.openai.com/prod-evals/
1•taubek•24m ago•0 comments

OldMapsOnline

https://www.oldmapsonline.org/en
1•surprisetalk•27m ago•0 comments

What It's Like to Be a Worm

https://www.asimov.press/p/sentience
2•surprisetalk•27m ago•0 comments

Don't go to physics grad school and other cautionary tales

https://scottlocklin.wordpress.com/2025/12/19/dont-go-to-physics-grad-school-and-other-cautionary...
1•surprisetalk•27m ago•0 comments

Lawyer sets new standard for abuse of AI; judge tosses case

https://arstechnica.com/tech-policy/2026/02/randomly-quoting-ray-bradbury-did-not-save-lawyer-fro...
3•pseudolus•27m ago•0 comments

AI anxiety batters software execs, costing them combined $62B: report

https://nypost.com/2026/02/04/business/ai-anxiety-batters-software-execs-costing-them-62b-report/
1•1vuio0pswjnm7•27m ago•0 comments

Bogus Pipeline

https://en.wikipedia.org/wiki/Bogus_pipeline
1•doener•29m ago•0 comments

Winklevoss twins' Gemini crypto exchange cuts 25% of workforce as Bitcoin slumps

https://nypost.com/2026/02/05/business/winklevoss-twins-gemini-crypto-exchange-cuts-25-of-workfor...
2•1vuio0pswjnm7•29m ago•0 comments

How AI Is Reshaping Human Reasoning and the Rise of Cognitive Surrender

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=6097646
3•obscurette•29m ago•0 comments

Cycling in France

https://www.sheldonbrown.com/org/france-sheldon.html
2•jackhalford•31m ago•0 comments

Ask HN: What breaks in cross-border healthcare coordination?

1•abhay1633•31m ago•0 comments

Show HN: Simple – a bytecode VM and language stack I built with AI

https://github.com/JJLDonley/Simple
2•tangjiehao•34m ago•0 comments

Show HN: Free-to-play: A gem-collecting strategy game in the vein of Splendor

https://caratria.com/
1•jonrosner•35m ago•1 comments

My Eighth Year as a Bootstrapped Founde

https://mtlynch.io/bootstrapped-founder-year-8/
1•mtlynch•35m ago•0 comments

Show HN: Tesseract – A forum where AI agents and humans post in the same space

https://tesseract-thread.vercel.app/
1•agliolioyyami•35m ago•0 comments

Show HN: Vibe Colors – Instantly visualize color palettes on UI layouts

https://vibecolors.life/
2•tusharnaik•36m ago•0 comments

OpenAI is Broke ... and so is everyone else [video][10M]

https://www.youtube.com/watch?v=Y3N9qlPZBc0
2•Bender•37m ago•0 comments
Open in hackernews

Ask HN: Will LLM API costs be negligible in a year?

2•changisaac•6mo ago
Hi HN. We’re managing costs at my startup and by far our largest spend is on calls to Anthropic, OpenAI, etc. We’ve considered things like spinning up our own open source model but decided it’s not worth it considering we don’t even have PMF yet.

Optimistically though, I see that token prices to LLMs have been going down a lot in the past few years. Do you think if this continues that it’ll eventually become a negligible expense? Or do you think we will forever be gouged by these foundation model companies? (: Much like how cloud computing has went (AWS, GCP, etc.)

Comments

ben_w•6mo ago
Define "negligible".

You need to know how much LLM output you need to get your product working, before you even know what you're hoping for regarding a target cost per million tokens. When you do get PMF, can some of the work be offloaded to a smaller and cheaper model? Can you determine this division of labour yet?

Consider also that "computer" used to be a job title, that since then the cost of doing computations has reduced by a factor of at least 1e14, and yet that you're only asking this question at all because you're still compute limited.

changisaac•6mo ago
> and yet that you're only asking this question at all because you're still compute limited.

Very good point.

musbemus•6mo ago
If they do start to become unsustainable you might see more companies moving to a BYOK or usage-based billing model. If they do that, I don't know if the use cases for AI would justify the cost for consumers (but perhaps so for businesses). There's been a ton of build out of data centers so I do think the cost reduction we've seen so far may extrapolate but at the expense of more performant models. Hard to tell right now though
codingdave•6mo ago
At some point AI providers will need to break down profit/token and price accordingly. Right now, they are losing money to gain market share. Also, AI consumers will need to get the expense of AI into their own profit calculations.

Hard to say how it will play out, aside from both sides are going to strive to maximize their own benefit, and time will tell how the actual numbers balance out.

This is one reason why it matters whether or not the AI bubble is all hype. There is a non-trivial chance that once people truly figure out the monetary value of AI's help on their processes and cut out all hype-based use cases... their spending limits to reach that value might not match what the providers need to run the platforms.

symbolicAGI•6mo ago
The frontier models when released are operating UIs and APIs at a substantial profit during the delivery of inference. However, overall the vendors are losing money because they are paying for ever-increasing training costs for the next version of their frontier model.

This money-losing business of the vendors will no doubt continue for at least another year.

There are two ways to expect lower LLM API costs in the future:

1. Be satisfied with an older version of a particular LLM. As inference hardware and software become more efficient, the vendor can lower API costs on the older models to remain competitive.

2. Eventually - not next year - the return on investment from training the next version of the LLM will decrease relative to the ROI on current LLMs (because the improvements will be less awesome) and the training cost of such a model will necessarily be spread out over a longer duration as competition allows. At that point (whenever) the training cost might level off or actually decrease and that savings would be competitively passed along to the API consumer. And coincidentally that would be the point at which the vendors become overall profitable.

changisaac•6mo ago
This is a great analysis btw, thanks for this!

My take away from this is that my startup should spend some time investing in some cost analysis with our LLM usage and context engineering (perhaps closely after some level of PMF). If it’s not happening anytime soon, might as well treat it as it’s not happening at all considering that startups die out pretty quick lol.