frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: Infer0 – do AI apps need subscriptions?

https://infer0.com/
5•sumolessons•1h ago
One thing that’s been bothering me about AI side projects is inference costs. With traditional software, a successful launch usually means higher profits. But with AI products, success can mean unexpectedly large bills.

This has pushed me toward cheaper, less capable models and made me hesitate to even explore certain ideas. I don’t want every side project to become another $20/month subscription, but I also can’t compete with VC-backed companies willing to subsidize inference costs.

Then I had this idea: what if users simply paid for their own inference?

This already happens in some apps through locally configured API keys, but could the model be extended? If users bring their own AI account, developers can build AI-powered products without taking on variable inference costs. How many AI applications shouldn’t be subscription businesses at all?

The challenge is that developers don’t want to handle user API keys, users don’t want to hand them out to every app they try, and nobody wants the friction of collecting payment methods just to pass through inference costs.

That’s the backstory to my latest side project, infer0.com.

It's a bit like SSO for AI inference. Users connect their AI providers once, and apps use auth tokens to request inference through infer0. Developers don’t manage API keys or pay model costs themselves, while users can bring the same AI accounts across multiple applications.

This may be a terrible idea, both because nobody will trust it and because I’m sure there are risks around handling user API credentials that I haven’t fully appreciated. But I felt the need to build it. So here’s a rough first pass.

Agent Architecture Is a Compute Allocation Problem: The Advisor Strategy

https://harrisonsec.com/blog/agent-architecture-compute-allocation-advisor-strategy/
1•gzxharrison001•37s ago•0 comments

How we evaluate our LLM judge

https://build.forus.com/how-we-evaluate-our-llm-judge-a-perturbation-based-approach
1•abeinstein•1m ago•0 comments

Can gzip be a language model?

https://nathan.rs/posts/gzip-lm/
1•nathan-barry•2m ago•0 comments

The Faithfulness of LLMs as Solvers and Autoformalizers in Legal Reasoning

https://arxiv.org/abs/2606.16118
1•root-parent•3m ago•0 comments

The AI Hype – Too Costly – Alternative Rock, Original Lyrics

https://www.youtube.com/watch?v=jwfuNk2cRDc
1•NedCode•4m ago•0 comments

The Same Hetzner VM Cost $60 Last Week. Today It Costs $154

https://webbynode.com/articles/same-hetzner-vm-cost-60-last-week-today-hetzner-offers-it-at-154
1•gsgreen•4m ago•2 comments

Python 3.13 gets a JIT (2024)

https://tonybaloney.github.io/posts/python-gets-a-jit.html
1•tosh•5m ago•0 comments

TreeTrace, Git records what changed; this records how you steered

https://github.com/TreeTraceTool/TreeTrace
1•ZionBoggan•5m ago•0 comments

Never Talk to the Police. Period

https://www.campolalaw.com/why-you-should-never-talk-to-the-po
2•Cider9986•6m ago•0 comments

Databricks Acquires Panther

https://www.databricks.com/company/newsroom/press-releases/databricks-agrees-acquire-panther-furt...
1•scapecast•8m ago•0 comments

Show HN: Sentinel – prevent duplicate execution using Postgres

https://github.com/Sreejay-reddy/Sentinel
1•Sreejay_reddy•8m ago•0 comments

GateGPT: 56k tokens per second Transformer (KV cache) on FPGA at 80 MHz

https://twitter.com/fguzmanai/status/2065832668172845209
5•laxmena•10m ago•0 comments

Hardware Is Asynchronous. Most of Our Operating Systems Still Aren't

https://vorjdux.com/articles/hardware-is-async.html
3•homarp•11m ago•0 comments

Apple's weird anti-nausea dots cured my car sickness

https://www.theverge.com/tech/942854/apple-vehicle-motion-cues-review-really-work
3•neilfrndes•11m ago•0 comments

Steve Jobs in Exile by Geoffrey Cain

https://auxiliarymemory.com/2026/06/01/steve-jobs-in-exile-by-geoffrey-cain/
1•speckx•11m ago•0 comments

Stop rebuilding your billing system

https://useautumn.com/blog/stop-rebuilding-billing
1•johnyeocx•11m ago•0 comments

Russian frigate fires warning shots at British yacht in English Channel

https://www.theguardian.com/uk-news/2026/jun/16/russian-frigate-fires-warning-shots-at-british-ya...
3•manarth•12m ago•0 comments

We should vaccinate wild animals

https://worksinprogress.co/issue/why-we-should-vaccinate-wild-animals/
5•duffydotsvg•13m ago•0 comments

Show HN: Docket – Semantic search over your local files, runs in the browser

https://docketapp.netlify.app/
1•owenthecoder13•13m ago•0 comments

2024-25 Covid-19 Vaccine and Major Adverse Cardiovascular Events in US Veterans

https://jamanetwork.com/journals/jamainternalmedicine/fullarticle/2850241
1•bookofjoe•14m ago•0 comments

The Dangerous Tech Found Aboard 'Dark-Fleet' Tankers Captured by the U.S.

https://www.wsj.com/articles/the-dangerous-tech-found-aboard-dark-fleet-tankers-captured-by-the-u...
2•CSMastermind•15m ago•0 comments

Arrests, prosecutions, convictions or fines for online speech by country

https://github.com/kevinnbass/state_action_against_online_speech_globally
5•MrBuddyCasino•15m ago•1 comments

Show HN: In Browser semantic wallpaper search over 16k+ wallpapers

https://web-inky-ten-60.vercel.app
3•rdksu•16m ago•0 comments

Good Pricing Grows with the Value You Deliver

https://www.hauser.io/good-pricing-grows-with-the-value-you-deliver/
3•bkfh•16m ago•0 comments

NovaVest/VN-Noxa-v1-7B-Beta-Low

https://huggingface.co/NovaVest/VN-Noxa-v1-7b-Beta-Low
2•ilreb•18m ago•0 comments

Brazos: Liquid cooling system for air-cooled data centers

https://cloud.google.com/blog/topics/systems/brazos-liquid-cooling-system-for-air-cooled-data-cen...
3•ilreb•19m ago•0 comments

Show HN: Shivvr – Ephemeral semantic embedding and cognitive agent service

https://shivvr.nuts.services/
2•kordlessagain•20m ago•0 comments

SpaceX Set to Overtake Amazon in Value as It Soars for Third Day

https://www.bloomberg.com/news/articles/2026-06-16/spacex-spcx-stock-set-for-more-than-50-jump-in...
4•pera•20m ago•1 comments

Tell HN: Anthropic walks back on Agent SDK credit changes

2•lostmsu•20m ago•0 comments

Commodore announces Linux-based flip phone with 'no social media, no browser'

https://www.tomshardware.com/phones/commodore-announces-linux-based-flip-phone-with-no-social-med...
4•neilfrndes•20m ago•0 comments