frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Running a real consumer app on a 70B LLM at sub-cent cost per scan

https://www.cornstarch.ai/
1•rs1996•17h ago

Comments

rs1996•17h ago
We built a consumer app that does deep ingredient and health analysis (food, supplements, skincare, cat treats, etc.) using llama-3.3-70b in production.

Some numbers from the last month: - ~3.0M+ tokens processed - ~$2.07 total inference cost - ~0.5–0.6 cents per scan - Median latency ~3s, typical range 3–5s - Long prompts, structured outputs, ingredient-level caching

This isn’t a demo or batch job — it’s a real latency-constrained mobile workload with thousands of active scanning users.

The main takeaway for us was that deep, high-quality inference can be surprisingly cheap and predictable if you design for it intentionally.

Happy to answer questions or share more details if useful.

Palo Alto Crosswalk Signals Had Default Passwords

https://padailypost.com/2025/12/29/crosswalk-signals-were-hacked-because-of-a-weak-password/
1•zdw•1m ago•0 comments

Georgism Through Land Leasing

https://progressandpoverty.substack.com/p/georgism-through-land-leasing
1•viajante1882•2m ago•0 comments

The Genius Whose Simple Invention Saved Us from Shame at the Gas Station

https://www.wsj.com/business/autos/ford-gas-arrow-inventor-jim-moylan-6b2ef066
1•zdw•3m ago•0 comments

Making Sense of Memory and Attention

https://memory.briankitano.com/
1•bkitano19•4m ago•0 comments

The Case for Blogging in the Ruins

https://www.joanwestenberg.com/the-case-for-blogging-in-the-ruins/
1•Kerrick•5m ago•0 comments

Explaining Cloud-9: A Celestial Object Like No Other

https://www.centauri-dreams.org/2026/01/07/explaining-cloud-9-a-celestial-object-like-no-other/
1•romperstomper•8m ago•0 comments

Federal Judge Lookup: Research Any Judge's Track Record

https://asklexi.com/blog/federal-judge-lookup-research-track-record
1•sarachekroun•10m ago•0 comments

Three GPU Markets, Three Volatility Regimes

https://davefriedman.substack.com/p/three-gpu-markets-three-volatility
1•gmays•11m ago•0 comments

Pacer Isn't Just Expensive – It Forces You to Pay Before You Understand Anything

https://asklexi.com/blog/pacer-forces-you-to-pay-before-you-understand
2•sarachekroun•11m ago•0 comments

Show HN: The internet's meanest product manager

https://www.burnmywebsite.com
1•Roozka•12m ago•1 comments

Show HN: Discover and fund the open source projects your code depends on

https://github.com/jshchnz/tribute
1•jshchnz•14m ago•0 comments

Learning Retro Computer Electronics Fault Finding and Restoration

https://retrogamecoders.com/learning-retrocomputer-electronics/
3•ibobev•15m ago•0 comments

Boomshare: Free Forever Alternative to Loom

https://www.boomshare.ai/
1•buchanae•16m ago•1 comments

Per-query energy consumption of LLMs

https://muxup.com/2026q1/per-query-energy-consumption-of-llms
2•birdculture•18m ago•0 comments

Life after LeBron James: who will inherit the NBA's future?

https://www.theguardian.com/sport/2025/dec/31/nba-american-era-fading-cooper-flagg-next-face
2•PaulHoule•19m ago•1 comments

There is No Now – Problems with simultaneity in distributed systems

https://queue.acm.org/detail.cfm?id=2745385
2•onurkanbkrc•19m ago•0 comments

Show HN: ElixirBrowser – Android Chromium fork with extensions, inspired by Kiwi

https://github.com/SF-FLAM/ElixirBrowser
1•SF-FLAM•20m ago•0 comments

Challenges and Research Directions for Large Language Model Inference Hardware

https://arxiv.org/abs/2601.05047
1•matt_d•20m ago•0 comments

The Office Twitter account went private

https://x.com/office
1•antiloper•20m ago•0 comments

Show HN: I made a memory game to teach you to play piano by ear

https://lend-me-your-ears.specr.net
2•vunderba•21m ago•0 comments

Please stop building scroll-driven websites

https://adamhl.dev/blog/stop-building-scroll-driven-websites
4•genshii•22m ago•2 comments

What Happened to the Webmaster (2020)

https://thehistoryoftheweb.com/postscript/what-happened-to-the-webmaster/
2•carlos-menezes•23m ago•0 comments

I got paid minimum wage to solve an impossible problem

https://tiespetersen.substack.com/p/i-got-paid-minimum-wage-to-solve
1•coffeeaddict1•23m ago•0 comments

Meta-Atheism: Religious Avowal as Self-Deception [pdf]

https://gwern.net/doc/philosophy/religion/2009-rey.pdf
3•bondarchuk•24m ago•0 comments

Avoid aligning keys and values in source code unnecessarily

https://www.databasesandlife.com/dont-align-struct-values/
2•adrianmsmith•24m ago•1 comments

Zymtrace vs. Nsight: Profiling Nvidia GPU Clusters at Scale

https://zymtrace.com/article/zymtrace-nsight/
1•tanelpoder•25m ago•0 comments

How the AI 'bubble' compares to history

https://www.ft.com/content/41e9d03a-e5c1-4862-9836-b3c80b3f9be4
1•chalst•26m ago•0 comments

Macinfo Reconstructed Source Code Reveals Find My Mac Logic

https://onejailbreak.com/blog/macinfo-source-release/
1•mpweiher•26m ago•0 comments

Distinct neuronal populations in the human brain combine content and context

https://www.nature.com/articles/s41586-025-09910-2
2•bookofjoe•28m ago•0 comments

Largest cargo sailboat makes first transatlantic crossing [video]

https://www.youtube.com/watch?v=dUdaBnJ58jI
1•tartoran•31m ago•0 comments