frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Running Gemma 4 locally with LM Studio's new headless CLI and Claude Code

https://ai.georgeliu.com/p/running-google-gemma-4-locally-with
57•vbtechguy•3h ago

Comments

vbtechguy•3h ago
Here is how I set up Gemma 4 26B for local inference on macOS that can be used with Claude Code.
canyon289•1h ago
This is a nice writeup!
trvz•1h ago

  ollama launch claude --model gemma4:26b
datadrivenangel•1h ago
It's amazing how simple this is, and it just works if you have ollama and claude installed!
jonplackett•1h ago
So wait what is the interaction between Gemma and Claude?
unsnap_biceps•1h ago
lm studio offers an Anthropic compatible local endpoint, so you can point Claude code at it and it'll use your local model for it's requests, however, I've had a lot of problems with LM Studio and Claude code losing it's place. It'll think for awhile, come up with a plan, start to do it and then just halt in the middle. I'll ask it to continue and it'll do a small change and get stuck again.

Using ollama's api doesn't have the same issue, so I've stuck to using ollama for local development work.

keerthiko•58m ago
Claude Code is fairly notoriously token inefficient as far as coding agent/harnesses go (i come from aider pre-CC). It's only viable because the Max subscriptions give you approximately unlimited token budget, which resets in a few hours even if you hit the limit. But this also only works because cloud models have massive token windows (1M tokens on opus right now) which is a bit difficult to make happen locally with the VRAM needed.

And if you somehow managed to open up a big enough VRAM playground, the open weights models are not quite as good at wrangling such large context windows (even opus is hardly capable) without basically getting confused about what they were doing before they finish parsing it.

storus•52m ago
Can't you use Claude caveman mode?

https://github.com/JuliusBrussee/caveman

unsnap_biceps•41m ago
I use CC at work, so I haven't explored other options. Is there a better one to use locally? I presumed they were all going to be pretty similar.
Someone1234•1h ago
Using Claude Code seems like a popular frontend currently, I wonder how long until Anthropic releases an update to make it a little to a lot less turn-key? They've been very clear that they aren't exactly champions of this stuff being used outside of very specific ways.
moomin•57m ago
Right now it suits them down to the ground. You pay for the product and you don’t cost their servers anything.
phainopepla2•50m ago
You don't pay anything to use Claude Code as a front end to non-Anthropic models
quinnjh•16m ago
so no subscription is needed?
chvid•33m ago
Is it not about the same as using OpenCode?

And is running a local model with Claude Code actually usable for any practical work compared to the hosted Anthropic models?

wyre•14m ago
I think CC is popular because they are catering to the common denominator programmer and are going to continue to do that, not because CC is particularly turn-key.
martinald•55m ago
Just FYI, MoE doesn't really save (V)RAM. You still need all weights loaded in memory, it just means you consult less per forward pass. So it improves tok/s but not vram usage.
IceWreck•36m ago
It does if you use an inference engine where you can offload some of the experts from VRAM to CPU RAM. That means I can fit a 35 billion param MoE in let's say 12 GB VRAM GPU + 16 gigs of memory.

Gemma 4 on iPhone

https://apps.apple.com/nl/app/google-ai-edge-gallery/id6749645337
109•janandonly•1h ago•25 comments

LÖVE: 2D Game Framework for Lua

https://github.com/love2d/love
54•cl3misch•1d ago•18 comments

Artemis II crew see first glimpse of far side of Moon [video]

https://www.bbc.com/news/videos/ce3d5gkd2geo
309•mooreds•6h ago•222 comments

Eight years of wanting, three months of building with AI

https://lalitm.com/post/building-syntaqlite-ai/
429•brilee•7h ago•131 comments

Caveman: Why use many token when few token do trick

https://github.com/JuliusBrussee/caveman
566•tosh•11h ago•276 comments

Running Gemma 4 locally with LM Studio's new headless CLI and Claude Code

https://ai.georgeliu.com/p/running-google-gemma-4-locally-with
61•vbtechguy•3h ago•17 comments

A tail-call interpreter in (nightly) Rust

https://www.mattkeeter.com/blog/2026-04-05-tailcall/
96•g0xA52A2A•5h ago•8 comments

Computational Physics (2nd Edition)

https://websites.umich.edu/~mejn/cp2/
62•teleforce•5h ago•7 comments

Nanocode: The best Claude Code that $200 can buy in pure JAX on TPUs

https://github.com/salmanmohammadi/nanocode/discussions/1
98•desideratum•6h ago•19 comments

LibreOffice – Let's put an end to the speculation

https://blog.documentfoundation.org/blog/2026/04/05/lets-put-an-end-to-the-speculation/
78•eisa01•2h ago•40 comments

From birds to brains: My path to the fusiform face area (2024)

https://www.kavliprize.org/nancy-kanwisher-autobiography
22•everbody•3h ago•0 comments

Lisette a little language inspired by Rust that compiles to Go

https://lisette.run/
232•jspdown•13h ago•122 comments

Just 'English with Hanzi'

https://www.oldnorthwhale.com/p/why-modern-chinese-is-just-english
58•scour•1d ago•34 comments

Finnish sauna heat exposure induces stronger immune cell than cytokine responses

https://www.tandfonline.com/doi/full/10.1080/23328940.2026.2645467#abstract
265•Growtika•7h ago•178 comments

Baby's Second Garbage Collector

https://www.matheusmoreira.com/articles/babys-second-garbage-collector
35•matheusmoreira•3d ago•7 comments

Friendica – A Decentralized Social Network

https://friendi.ca/
100•janandonly•9h ago•36 comments

Show HN: Contrapunk – Real-time counterpoint harmony from guitar input

https://contrapunk.com/
97•waveywaves•19h ago•40 comments

DNS is Simple. DNS is Hard

https://www.wespiser.com/posts/2026-03-29-dns-simple-dns-hard.html
6•wespiser_2018•5d ago•3 comments

Bacteria found in the human intestine capable of improving muscle strength

https://www.ugr.es/en/about/news/bacteria-found-human-intestine-capable-improving-muscle-strength
82•gnabgib•1h ago•45 comments

The threat is comfortable drift toward not understanding what you're doing

https://ergosphere.blog/posts/the-machines-are-fine/
726•zaikunzhang•10h ago•481 comments

Hightouch (YC S19) Is Hiring

https://hightouch.com/careers#open-positions
1•joshwget•8h ago

Music for Programming

https://musicforprogramming.net
4•merusame•2h ago•2 comments

Microsoft Hasn't Had a Coherent GUI Strategy Since Petzold

https://www.jsnover.com/blog/2026/03/13/microsoft-hasnt-had-a-coherent-gui-strategy-since-petzold/
27•naves•3h ago•17 comments

The Enigma of Gertrude Stein

https://www.thenation.com/article/culture/gertrude-stein-afterlife-wade-review/
15•samclemens•3d ago•1 comments

Tracing Goroutines in Realtime with eBPF

https://sazak.io/articles/tracing-goroutines-in-realtime-with-ebpf-2026-03-31
47•darccio•3d ago•6 comments

My Google Workspace account suspension

https://zencapital.substack.com/p/sad-story-of-my-google-workspace
299•zenincognito•8h ago•158 comments

Show HN: Enter an Instagram/TikTok handle, get a data-backed price for collab

https://priceinfluencer.com
32•bozkan•2h ago•6 comments

Samsung Raises DRAM Prices Another ~30% for Q2 2026

https://old.reddit.com/user/BuySellRam/comments/1sd9z9k/samsung_raises_dram_prices_another_30_for...
18•jeffufl•2h ago•3 comments

Perfmon – Consolidate your favorite CLI monitoring tools into a single TUI

https://github.com/sumant1122/Perfmon
31•paperplaneflyr•6h ago•6 comments

Introduction to Computer Music (2009) [pdf]

https://composerprogrammer.com/introductiontocomputermusic.pdf
218•luu•18h ago•73 comments