frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Local LLM on a Pi 4 controlling hardware via tool calling

https://github.com/stfurkan/pi-llm
2•stfurkan•1h ago

Comments

stfurkan•1h ago
Hi HN,

I spent the weekend experimenting to see if I could get a proper LLM running locally on an old Raspberry Pi 4 (4GB), and more importantly, if I could get it to interact with the physical world.

I ended up using PrismML's new Bonsai models. Because they are genuinely 1-bit (trained from scratch at 1-bit, not quantized down to 4-bit), they actually fit. The 4B parameter model is ~570 MB, and the 1.7B is ~240 MB.

I loaded them through llama.cpp's router mode. I get around 2 tok/s on the 4B model for better reasoning, and 4-5 tok/s on the 1.7B when I just need speed. I tried Gemma 4 E2B first, but it was just too slow on 4GB of RAM.

The fun part: I wired up a cheap TM1637 4-digit display to the GPIO pins. Since Bonsai supports native tool calling, I wrote a small Python proxy that injects an update_display function into requests. When the model decides to use the tool, the proxy catches the streaming call, extracts the text, and drives the display. You can tell it to "show 1453" and it physically lights up.

It’s definitely just a weekend project (7-segment displays can't render W or M, self-signed certs, etc.). The code and setup scripts are all in the repo.

I’m thinking about adding servos or sensors next. Would love to hear your thoughts or see if anyone else is building edge AI hardware projects!

trailheadsec•1h ago
What’s the quality of the model output at this RAM / model selection? Local models fascinate me; I run Ollama on an M1 Max MacBook Pro with 64GB of RAM, but I am a little bit inexperienced with the ins and outs. Thank you for sharing!
stfurkan•48m ago
I specifically chose PrismML's 1-bit models because their tiny size allows them to actually fit on smaller hardware like the Pi. The 1.7B model is great for basic tasks and tool triggers, while the 4B model seems reasonable for some daily tasks, though it's much slower on this setup. If you try these models on your M1 Max, I assume they'll run incredibly fast. I previously tried them on a VPS and the inference speed was really good for my experiment.

Günther Anders's Bleak Picture of the Tech-Perfected Society

https://www.thepublicdiscourse.com/2026/03/100527/
1•StatsAreFun•35s ago•0 comments

Specs over Vibes: Consistent AI Results Ft. Mark Freeman

https://motherduck.com/blog/specs-over-vibes-consistent-ai-results/
1•eigenBasis•5m ago•0 comments

KldloadOS 1.0.4 – Kubernetes on (ZFS and Cilium eBPF and WireGuard) in 15 Mins

https://kldload.com/releases/1.0.4
1•kldload•6m ago•0 comments

Most AI travel apps don't help you travel

https://navoy.io/
1•tnaaron•7m ago•0 comments

I don't want to fill out your contact form (2024)

https://adamjones.me/blog/dont-use-contact-forms/
1•JustSkyfall•13m ago•1 comments

Open Source MCP server that refines prompts from retrieval evidence

https://github.com/farukalpay/prompt-refinery
4•Zelray0•18m ago•0 comments

GitHub Copilot Session Search and Resume CLI

https://jonmagic.com/posts/github-copilot-session-search-and-resume-cli/
1•willf•20m ago•0 comments

B-trees and database indexes (2024)

https://planetscale.com/blog/btrees-and-database-indexes
1•cebert•23m ago•0 comments

DNA forensics is transforming studies of ancient manuscripts

https://www.scientificamerican.com/article/how-dna-forensics-is-transforming-studies-of-ancient-m...
1•1659447091•24m ago•0 comments

'"one" | "two" | string' autocomplete TypeScript trick

https://jcbhmr.com/2026/04/07/any-string-autocomplete-ts/
2•jcbhmr•27m ago•0 comments

Ongoing system issues w/ state distributor has Mississippi running out of liquor

https://www.washingtonpost.com/nation/2026/04/12/mississippi-liquor-sales-breakdown/
1•WarOnPrivacy•32m ago•1 comments

AI Integration Pack: 9 Production Python Modules for Payments, CRM, SMS

https://etilabs.gumroad.com/l/gseyqv
1•DruEdwards•33m ago•0 comments

Surely there must be a way to make container secrets less dangerous?

https://dalmatian.life/2026/04/11/surely-there-must-be-a-way-to-make-container-secrets-less-dange...
1•birdculture•43m ago•0 comments

You can have an RSS dependent website in 2026

https://matduggan.com/you-can-absolutely-have-an-rss-dependent-website-in-2026/
1•OberstKrueger•44m ago•1 comments

The AI Industry's Most Expensive Mistake

https://www.thealgorithmicbridge.com/p/inside-the-ai-industrys-most-expensive
1•gmays•47m ago•0 comments

A Macroeconomic Perspective on Stock Market Valuation Ratios

https://researchdatabase.minneapolisfed.org/concern/publications/5x21tf60n
1•mooreds•56m ago•0 comments

2026 Is the New 2016

https://en.wikipedia.org/wiki/2026_is_the_new_2016
3•CHB0403085482•56m ago•0 comments

Why Trump Mishandled Iran

https://foreignpolicy.com/2026/04/08/iran-war-ceasefire-trump-united-states-israel/
4•mooreds•56m ago•1 comments

Any USB drive or cable you plug in might be a silent killer

https://www.askwoody.com/newsletter/free-edition-any-usb-drive-or-cable-you-plug-in-might-be-a-si...
4•devonnull•57m ago•0 comments

Made an eBPF syscall tracer with a live TUI

https://github.com/pandaadir05/snoop
1•pigeon1231•58m ago•1 comments

Old, Discontinued Fiats Are Outselling New Fiats

https://www.jalopnik.com/2144450/old-discontinued-fiat-500x-outselling-new-500e/
1•mooreds•59m ago•1 comments

I solved NP‑complete problems by turning them into planets

https://github.com/TheAnalyticalAbsurdist/vedic-planetary-transformers
1•AbsurdityBureau•1h ago•0 comments

Gliding on Snow: One Man's Dream

https://bd-numerique.museebombardier.com/en/
2•sxzygz•1h ago•0 comments

Give Them Two Choices

https://chadnauseam.com/coding/tips/give-them-two-choices
1•ChadNauseam•1h ago•0 comments

The AI Productivity Paradox: Why the AI Multiplier Is Less Than 2x

https://www.forrestthewoods.com/blog/the-ai-productivity-paradox/
1•forrestthewoods•1h ago•0 comments

Javier Milei's bribery scandal may have derailed Argentina's crypto investment

https://www.theguardian.com/world/2026/apr/12/crypto-javier-milei-bribery-argentina
4•kasperni•1h ago•0 comments

Language, Curiosity and Life – By Masato Hagiwara

https://masatohagiwara.net/lcl.html
1•jotaefea•1h ago•0 comments

New metal with triple copper's heat conduction challenges fundamental physics

https://www.scientificamerican.com/article/new-metal-with-triple-coppers-heat-conduction-challeng...
4•thunderbong•1h ago•1 comments

Harvesting easter eggs: An exploratory study of enjoying transnarrative media

https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0341588
1•PaulHoule•1h ago•0 comments

ORAC-NT MedChem Copilot that blocks synthetically infeasible molecules

https://github.com/Kretski/ORAC-NT
1•DREDREG•1h ago•0 comments