frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

I have written gemma3 inference in pure C

https://github.com/robitec97/gemma3.c
35•robitec97•2d ago

Comments

w4yai•1h ago
> It proves that modern LLMs can run without Python, PyTorch, or GPUs.

Did we need any proof of that ?

skybrian•1h ago
Knowing the performance is interesting. Apparently it's 1-3 tokens/second.
kgeist•1h ago
ikllama.cpp is a fork of llama.cpp which specializes on CPU inference, some benchmarks from 1 year ago: https://github.com/ikawrakow/ik_llama.cpp/discussions/164
jasonjmcghee•1h ago
I guess llama.cpp isn't quite as popular as I had assumed.
tolerance•1h ago
I imagine so regarding GPUs, right? Is this is a legitimate project then doesn’t it provide a proof of concept for performance constraints that relate to them? Couldn't the environmentally concerned take this as an indicator that the technology can progress without relying on as much energy is potentially spent now? Shouldn’t researchers in the industry be thinking of ways to prevent the future capabilities of the technology from outrunning the capacity of the infrastructure?

I know very little about AI but these are things that come to mind here for me.

yorwba•1h ago
GPUs are more efficient than CPUs for LLM inference, using less energy per token and being cheaper overall. Yes, a single data center GPU draws a lot of power and costs a fortune, but it can also serve a lot more people in the time your CPU or consumer GPU needs to respond to a single prompt.
tolerance•1h ago
I got you, thanks!
jdefr89•1h ago
Python and PyTorch all call out to C libraries… I don’t get what he means by “proving LLMs can run without Python and PyTorch” at all. Seems like they don’t understand basic fundamentals about things here…
austinvhuang•1h ago
My first implementation of gemma.cpp was kind of like this.

There's such a massive performance differential vs. SIMD though that I learned to appreciate SIMD (via highway) as one sweet spot of low-dependency portability that sits between C loops and the messy world of GPUs + their fat tree of dependencies.

If anyone want to learn the basics - whip out your favorite LLM pair programmer and ask it to help you study the kernels in the ops/ library of gemma.cpp:

https://github.com/google/gemma.cpp/tree/main/ops

janwas•1h ago
:D Your code was nicely written and it was a pleasure to port to SIMD because it was already very data-parallel.
behnamoh•1h ago
but why tho? next gemma is coming and no one uses gemma 3 in prod anyway.
NitpickLawyer•1h ago
> no one uses gemma 3 in prod anyway.

Umm, we do. It's still one of the best for eu countries support / help chatbot style. It's got good (best?) multilingual support ootb, it's very "safe" (won't swear, won't display chinese characters, etc) and it's pretty fast.

behnamoh•47m ago
but it lacks system prompt support.
gunalx•44m ago
Yep. Before gemma3 we where struggling with multilinguality on smaller European languages, and it is still one of the batter ones in that regard (even large open or closed models struggle with this to some extent). Gemma3 also is still pretty decent multi modal wise.
uncognic•1h ago
I think /* */ single-line comments is a pretty good indication.
data-ottawa•13m ago
Gemma3 is probably the best supported fine tunable model.

Mousefood – Build embedded terminal UIs for microcontrollers

https://github.com/ratatui/mousefood
104•orhunp_•4h ago•31 comments

Airfoil (2024)

https://ciechanow.ski/airfoil/
283•brk•6h ago•45 comments

Oban, the job processing framework from Elixir, has come to Python

https://www.dimamik.com/posts/oban_py/
123•dimamik•4h ago•59 comments

Android's desktop interface leaks

https://9to5google.com/2026/01/27/android-desktop-leak/
81•thunderbong•17h ago•160 comments

The Five Levels: From Spicy Autocomplete to the Software Factory

https://www.danshapiro.com/blog/2026/01/the-five-levels-from-spicy-autocomplete-to-the-software-f...
15•benwerd•4d ago•5 comments

Computer History Museum Launches Digital Portal to Its Collection

https://computerhistory.org/press-releases/computer-history-museum-launches-digital-portal-to-its...
57•ChrisArchitect•3h ago•11 comments

Microsoft forced me to switch to Linux

https://www.himthe.dev/blog/microsoft-to-linux
1375•bobsterlobster•7h ago•1072 comments

Trying to craft AI images that are worth displaying to end users

https://www.approachwithalacrity.com/how-to-turn-sfo-jfk-into-a-beautiful-photo/
8•bblcla•2h ago•2 comments

Who sets the Doomsday Clock?

https://www.popularmechanics.com/science/a70162364/setting-the-doomsday-clock/
7•littlexsparkee•1h ago•5 comments

Spinning around: Please don't – Common problems with spin locks

https://www.siliceum.com/en/blog/post/spinning-around/
45•bdash•4h ago•15 comments

Native Linux VST plugin directory

https://linuxmusic.rocks
36•Aldipower•1h ago•14 comments

Tuning Semantic Search on JFMM.net – Joint Fleet Maintenance Manual

https://carlkolon.com/2026/01/27/jfmm-semantic-search/
5•cckolon•6h ago•0 comments

Show HN: The HN Arcade

https://andrewgy8.github.io/hnarcade/
268•yuppiepuppie•10h ago•69 comments

I overengineered a spinning top [video]

https://www.youtube.com/watch?v=Wp5NodfvvF4
95•bane•5d ago•25 comments

Amazon cuts 16k jobs

https://www.reuters.com/legal/litigation/amazon-cuts-16000-jobs-globally-broader-restructuring-20...
365•DGAP•5h ago•488 comments

Show HN: Cursor for Userscripts

https://github.com/chebykinn/browser-code
10•mifydev•1h ago•0 comments

Trinity large: An open 400B sparse MoE model

https://www.arcee.ai/blog/trinity-large
4•linolevan•20h ago•3 comments

That's not how email works

https://danq.me/2026/01/28/hsbc-dont-understand-email/
169•HotGarbage•3h ago•104 comments

Amazon One palm authentication discontinued

https://amazonone.aws.com/help
38•KerryJones•4h ago•81 comments

I have written gemma3 inference in pure C

https://github.com/robitec97/gemma3.c
35•robitec97•2d ago•16 comments

What does Werner Herzog's nihilist penguin teach us?

https://lwlies.com/article/werner-herzog-penguin-encounters-at-the-end-of-the-world
13•Marceltan•48m ago•14 comments

Native Instruments enters into insolvency proceedings

https://www.engadget.com/audio/native-instruments-enters-into-insolvency-proceedings-leaving-its-...
53•elevaet•1h ago•10 comments

Show HN: SHDL – A minimal hardware description language built from logic gates

https://github.com/rafa-rrayes/SHDL
14•rafa_rrayes•9h ago•3 comments

Kairos: AI interns for everyone

https://www.kairos.computer/
7•bamitsmanas•13h ago•2 comments

Kyber (YC W23) Is Hiring a Staff Engineer

https://www.ycombinator.com/companies/kyber/jobs/GPJkv5v-staff-engineer-tech-lead
1•asontha•9h ago

How London became the rest of the world’s startup capital

https://www.economist.com/britain/2026/01/26/how-london-became-the-rest-of-the-worlds-startup-cap...
4•ellieh•21h ago•0 comments

Did a celebrated researcher obscure a baby's poisoning?

https://www.newyorker.com/magazine/2026/02/02/did-a-celebrated-researcher-obscure-a-fatal-poisoning
3•littlexsparkee•21h ago•0 comments

Show HN: A MitM proxy to see what your LLM tools are sending

https://github.com/jmuncor/sherlock
8•jmuncor•2h ago•2 comments

Some notes on starting to use Django

https://jvns.ca/blog/2026/01/27/some-notes-on-starting-to-use-django/
160•ingve•22h ago•94 comments

3D-Printed Mathematical Lampshades

https://hessammehr.github.io/blog/posts/2025-12-24-maths-to-lampshade.html
35•hessammehr•4d ago•13 comments