frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

AI Use Makes Us Overestimate Our Cognitive Performance

https://www.aalto.fi/en/news/ai-use-makes-us-overestimate-our-cognitive-performance
1•jruohonen•4m ago•1 comments

Comparing Claude Code vs. OpenCode

https://www.andreagrandi.it/posts/comparing-claude-code-vs-opencode-testing-different-models/
1•behnamoh•8m ago•0 comments

A Brief History of the Cypherpunk Movement

https://seykhel.org/en/blog/history/
1•suioir•9m ago•0 comments

China's New Influencer Law: Only Degree-Holders Can Discuss Professional Topics

https://www.moroccoworldnews.com/2025/10/265324/chinas-new-influencer-law-says-only-degree-holder...
2•ghssds•16m ago•0 comments

Why is Python's OrderedDict ordered?

https://www.piglei.com/articles/en-why-is-python-ordereddict-ordered/
1•misonic•18m ago•0 comments

Unscreen Pro: Remove Video Background with AI

https://unscreen.pro
1•sparkalpha•22m ago•0 comments

Blank Page Website for Writing

https://focusforge.net
3•chwiho•27m ago•0 comments

Germany examines nationalising Rosneft arm after US sanctions

https://www.reuters.com/business/energy/germany-examines-nationalising-rosneft-arm-after-trump-sa...
2•geox•27m ago•0 comments

Affluent Investors Are Using Options Math to Borrow on the Cheap

https://www.bloomberg.com/news/articles/2025-10-29/trump-and-xi-set-to-formalize-trade-truce-afte...
1•imichael•29m ago•1 comments

Upply – AI that auto-fills any online form

https://goapply.today/
1•longama•35m ago•2 comments

Solving the Character Encoding Issue When Reading DuckDB via ODBC in Excel VBA

https://redraiment.medium.com/solving-the-character-encoding-issue-when-reading-duckdb-via-odbc-i...
1•redraiment•36m ago•1 comments

Show HN: Claim-Detective – Verify Suspicious Tech Claims Collaboratively

https://www.claim-detective.com/
1•stackoversnow•40m ago•0 comments

Trump asks Pentagon to resume testing US nuclear weapons

https://www.reuters.com/world/china/trump-asks-pentagon-immediately-start-testing-us-nuclear-weap...
5•JKCalhoun•42m ago•3 comments

One Year with Next.js App Router – Why We're Moving On

https://paperclover.net/blog/webdev/one-year-next-app-router
2•nnx•43m ago•1 comments

Found a clean subnet cheatsheet website

https://subnetmaskcheatsheet.com
1•chwiho•43m ago•0 comments

Data centers turn to commercial aircraft jet engines as AI power crunch bites

https://www.tomshardware.com/tech-industry/data-centers-turn-to-ex-airliner-engines-as-ai-power-c...
1•pabs3•44m ago•3 comments

Top researchers consider leaving US amid funding cuts:Science world is ending [video]

https://www.youtube.com/watch?v=yLvO070E_dI
3•thelastgallon•45m ago•0 comments

Use the XDG Base Directory Specification

https://xdgbasedirectoryspecification.com/
1•Bogdanp•47m ago•0 comments

GitHub MCP Server now with server instructions, better tools, and more

https://github.blog/changelog/2025-10-29-github-mcp-server-now-comes-with-server-instructions-bet...
2•kordlessagain•51m ago•0 comments

Hello-World iOS App in Assembly

https://gist.github.com/nicolas17/966a03ce49f949dd17b0123415ef2e31
7•pabs3•54m ago•2 comments

No Nvidia Chips Needed Amazon's New AI Data Center for Anthropic [video]

https://www.youtube.com/watch?v=vnGC4YS36gU
1•mgh2•55m ago•0 comments

IRCd service written in awk

https://example.fi/blog/ircd.html
22•pabs3•1h ago•3 comments

InlinedVector: Header-only SBO that supports const members in insert/erase

https://blog.lloyal.ai/inlinedvector-yet-another-sbo-container-but-with-a-good-reason
1•zuhair•1h ago•0 comments

FVDB: Large scale GPU reality capture from Nvidia

https://fvdb.ai/reality-capture/
5•fwilliams•1h ago•0 comments

Trump orders immediate resumption of US nuclear weapons testing

https://www.france24.com/en/live-news/20251030-trump-orders-immediate-resumption-of-us-nuclear-we...
12•hackthemack•1h ago•11 comments

The Dumb Design of Modern Cars [video]

https://www.youtube.com/watch?v=HauQtcj7UTM
2•viewtransform•1h ago•0 comments

AI Agents Are Terrible Freelance Workers

https://www.wired.com/story/ai-agents-are-terrible-freelance-workers/
3•botanicals6•1h ago•0 comments

How the most feared algorithm in algebra is simple

3•diegoofernandez•1h ago•0 comments

How Google handles JavaScript throughout the indexing process

https://vercel.com/blog/how-google-handles-javascript-throughout-the-indexing-process
2•lelandfe•1h ago•0 comments

Wan 2.5 AI Video Generator with Audio Sync

https://www.jxp.com/wan
1•cy1414569•1h ago•1 comments
Open in hackernews

LLMs Are Moving Local – So Why Are We Still Paying for Tokens?

3•josephinepqt•11h ago
Is anyone still using LLM APIs?

Open models like SmolLM3 (~3B) and Qwen2-1.5B are getting surprisingly capable - and they run fine on laptops or even phones. With Apple rolling out on-device LLMs in iOS 18, it feels like we’re entering a real local-first phase.

Small models already handle focused jobs: lightweight copilots, captioning, inspection.

And not just text - Gemma 2 2B Vision and Qwen2-VL can caption and reason about images locally.

Hardware’s there too: Apple’s M-series Neural Engine hits ~133 TOPS, and consumer GPUs chew through 4-8B models.

Tooling’s catching up fast: * Ollama for local runtimes (GGUF, simple CLI) * Cactus / RunLocal for mobile * ExecuTorch / LiteRT for on-device inference

Still some pain: iOS memory limits, packaging overhead, distillation quirks. Quantization helps, but 4-bit isn’t magic.

The upside’s clear: privacy by default, offline by design, zero latency, no token bills.

The cloud won’t die, but local compute finally feels fun again.

What’s keeping small models from going fully on-device?

Comments

almosthere•11h ago
It's too slow for agentic flows
josephinepqt•11h ago
can you give an example?
sitestable•11h ago
Hybrid is the answer. Local for high-volume/low-stakes, APIs for quality-critical tasks.

Token costs matter at scale. Engineering time matters more when you're small.