April 2026 TLDR Setup for Ollama and Gemma 4 26B on a Mac mini

https://gist.github.com/greenstevester/fc49b4e60a4fef9effc79066c1033ae5

24•greenstevester•1h ago

Comments

greenstevester•1h ago

Right. So Google released Gemma 4, a 26B mixture-of-experts model that only activates 4B parameters per token.

It's essentially a model that's learned to do the absolute minimum amount of work while still getting paid. I respect that enormously.

It scores 1441 on Arena Elo — roughly the same as Qwen 3.5 at 397B and Kimi k2.5 at 1100B.

Ollama v0.19 switched to Apple's MLX framework on Apple Silicon. 93% faster decode.

They've also improved caching so your coding agents don't have to re-read the entire prompt every time, about time I'd say.

The gist covers the full setup: install, auto-start on boot, keep the model warm in memory.

It runs on a 24GB Mac mini, which means the most expensive part of your local AI setup is still the desk you put it on.

redrove•47m ago

There is virtually no reason to use Ollama over LM Studio or the myriad of other alternatives.

Ollama is slower and they started out as a shameless llama.cpp ripoff without giving credit and now they “ported” it to Go which means they’re just vibe code translating llama.cpp, bugs included.

iLoveOncall•37m ago

> There is virtually no reason to use Ollama over LM Studio or the myriad of other alternatives.

Hmm, the fact that Ollama is open-source, can run in Docker, etc.?

alifeinbinary•15m ago

I really like LM Studio when I can use it under Windows but for people like me with Intel Macs + AMD gpu ollama is the only option because it can leverage the gpu using MoltenVK aka Vulkan, unofficially. We're still testing it, hoping to get the Vulkan support in the main branch soon. It works perfectly for single GPUs but some edge cases when using multiple GPUs are unsupported until upstream support from MoltenVK comes through. But yeah, I agree, it wasn't cool to repackage Georgi's work like that.

easygenes•32m ago

Why is ollama so many people’s go-to? Genuinely curious, I’ve tried it but it feels overly stripped down / dumbed down vs nearly everything else I’ve used.

Lately I’ve been playing with Unsloth Studio and think that’s probably a much better “give it to a beginner” default.

polotics•5m ago

Ollama got some first-mover advantage at the time when actually building and git pulling llama.cpp was a bit of a moat. The devs' docker past probably made them overestimate how much they could lay claim to mindshare. However, no one really could have known how quickly things would evolve... Now I mostly recommend LM-studio to people.

What does unsloth-studio bring on top?

robotswantdata•16m ago

Why are you using Ollama? Just use llama.cpp

brew install llama.cpp

use the inbuilt CLI, Server or Chat interface. + Hook it up to any other app

Who Holds the Keys to the Agent Web – Part 2: Who Gets the Receipt

Show HN: Koriander – Recipe manager that parses ingredients and tracks nutrition

Bun: cgroup-aware AvailableParallelism / HardwareConcurrency on Linux

Show hn: Speedy-Claude

I built a clean yet cheap Superhuman alternative

For people in product/software, how have your teams changed since Opus 4.5

WTI Prices Soar Past Brent

Renewables dominate 2025's newly installed generating capacity

OpenAI Acquires Tech Talk Show 'TBPN'–and Buys Itself Some Positive News

Show HN: Event photo matching with badge markers, no face scanning

What Every Security Engineer Should Learn from Star Trek

How Steve Jobs Brought the Apple II to the Classroom

'Fatal decision': EU slammed for caving to US pressure on digital rules

China's AI Education Experiment

R/programming on Reddit just banned all content related to AI LLMs

GraalVM and Java and Native Images, compile Java to standalone executables

Ruby on Rails and SQLite3: fix for silent data loss on column rename or removal

The Vibecoders Are Coming For Us

AI Baby Dance Video Generator: Turn Baby Photos into Dancing Videos

International law experts allege violations in Iran war

SpaceX Files FCC Complaint over Ariane 64 Amazon Leo Launch

Europe's AI sovereignty just became a security emergency

RiskReady-open-source GRC platform with MCP gateway and human-approved mutations

We're Addicted to Our Devices

The "Passive Income" trap ate a generation of entrepreneurs

Show HN: LLMnesia – search across ChatGPT, Claude, Gemini chats locally

From Hierarchy to Intelligence

The feet don't rotate with the body

Atmospheric CO₂ accelerating faster than fossil fuel emissions are growing

From Reversibility to Irreversibility: Biggest shift in the history of physics