frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Local LLM on a Pi 4 controlling hardware via tool calling

https://github.com/stfurkan/pi-llm
3•stfurkan•4h ago

Comments

stfurkan•4h ago
Hi HN,

I spent the weekend experimenting to see if I could get a proper LLM running locally on an old Raspberry Pi 4 (4GB), and more importantly, if I could get it to interact with the physical world.

I ended up using PrismML's new Bonsai models. Because they are genuinely 1-bit (trained from scratch at 1-bit, not quantized down to 4-bit), they actually fit. The 4B parameter model is ~570 MB, and the 1.7B is ~240 MB.

I loaded them through llama.cpp's router mode. I get around 2 tok/s on the 4B model for better reasoning, and 4-5 tok/s on the 1.7B when I just need speed. I tried Gemma 4 E2B first, but it was just too slow on 4GB of RAM.

The fun part: I wired up a cheap TM1637 4-digit display to the GPIO pins. Since Bonsai supports native tool calling, I wrote a small Python proxy that injects an update_display function into requests. When the model decides to use the tool, the proxy catches the streaming call, extracts the text, and drives the display. You can tell it to "show 1453" and it physically lights up.

It’s definitely just a weekend project (7-segment displays can't render W or M, self-signed certs, etc.). The code and setup scripts are all in the repo.

I’m thinking about adding servos or sensors next. Would love to hear your thoughts or see if anyone else is building edge AI hardware projects!

trailheadsec•3h ago
What’s the quality of the model output at this RAM / model selection? Local models fascinate me; I run Ollama on an M1 Max MacBook Pro with 64GB of RAM, but I am a little bit inexperienced with the ins and outs. Thank you for sharing!
stfurkan•3h ago
I specifically chose PrismML's 1-bit models because their tiny size allows them to actually fit on smaller hardware like the Pi. The 1.7B model is great for basic tasks and tool triggers, while the 4B model seems reasonable for some daily tasks, though it's much slower on this setup. If you try these models on your M1 Max, I assume they'll run incredibly fast. I previously tried them on a VPS and the inference speed was really good for my experiment.

Show HN: boringBar – a taskbar-style dock replacement for macOS

https://boringbar.app/
265•a-ve•10h ago•160 comments

Show HN: Oberon System 3 runs natively on Raspberry Pi 3 (with ready SD card)

https://github.com/rochus-keller/OberonSystem3Native/releases
173•Rochus•14h ago•36 comments

Show HN: Claudraband – Claude Code for the Power User

https://github.com/halfwhey/claudraband
97•halfwhey•10h ago•32 comments

Show HN: Local LLM on a Pi 4 controlling hardware via tool calling

https://github.com/stfurkan/pi-llm
3•stfurkan•4h ago•3 comments

Show HN: A social feed with no strangers

https://www.grateful.so/
3•rpatni•4h ago•0 comments

Show HN: Pardonned.com – A searchable database of US Pardons

483•vidluther•1d ago•262 comments

Show HN: Rekal – Long-term memory for LLMs in a single SQLite file

https://github.com/janbjorge/rekal
2•jeeybee•6h ago•1 comments

Show HN: Stork – MCP server so Claude/Cursor can search 14k MCP servers AI tools

https://www.stork.ai
2•usestork•7h ago•0 comments

Show HN: FluidCAD – Parametric CAD with JavaScript

https://fluidcad.io/
153•maouida•2d ago•36 comments

Show HN: T4 – a versioned datastore with branching and time-travel (S3-backed)

https://github.com/t4db/t4
6•amakhov•10h ago•0 comments

Show HN: Waffle – Native macOS terminal that auto-tiles sessions into a grid

https://waffle.baby
38•olleeolleeollee•1d ago•17 comments

Show HN: A WYSIWYG word processor in Python

https://codeberg.org/chrisecker/miniword
85•chrisecker•2d ago•37 comments

Show HN: Eve – Managed OpenClaw for work

https://eve.new/login
71•zachdive•2d ago•39 comments

Show HN: Bullseye2D – A Dart library for cross-platform 2D games

https://github.com/bullseye2d/bullseye2d
4•joemanaco•18h ago•0 comments

Show HN: ReverseYC

https://rocketplace.org/reverseyc
2•remarketme•13h ago•1 comments

Show HN: I built a Cargo-like build tool for C/C++

https://github.com/randerson112/craft
172•randerson_112•3d ago•166 comments

Show HN: Formal – Formal verification for AI-generated code using Lean 4

https://github.com/yamafaktory/formal
4•yamafaktory•13h ago•4 comments

Show HN: ApplePy – Embed and Call Swift from Python (Like PyO3, but for Swift)

https://github.com/jagtesh/ApplePy
2•sheepscreek•13h ago•2 comments

Show HN: Telegram feed reader using DNS TXT records for Iran's Internet shutdown

https://github.com/sartoopjj/thefeed
2•znano•13h ago•2 comments

Show HN: Marimo pair – Reactive Python notebooks as environments for agents

https://github.com/marimo-team/marimo-pair
139•manzt•5d ago•34 comments

Show HN: CSS Studio. Design by hand, code by agent

https://cssstudio.ai
170•SirHound•3d ago•107 comments

Show HN: Real-Time OLAP Infrastructure

https://modolap.com
4•ronfriedhaber•14h ago•0 comments

Show HN: Chunk – macOS menu bar time-blocking app with Claude AI integration

https://www.chunkapp.net
2•dudleyspence•14h ago•1 comments

Show HN: Android AI agent-assistant operating your apps (no adb,PC,root,etc.)

2•sshnaidm1•15h ago•2 comments

Show HN: Uncook, the Social Network for Food

https://uncook.xyz
4•moffers•15h ago•2 comments

Show HN: Minnow – minimal now pages via chat

https://minnow.social/
3•freshman_dev•16h ago•1 comments

Show HN: A Better Internet

https://tomclancy.info/pages/abi-a-better-internet.html
10•tclancy•1d ago•3 comments

Show HN: Keeper – embedded secret store for Go (help me break it)

https://github.com/agberohq/keeper
63•babawere•2d ago•33 comments

Show HN: Moon simulator game, ray-casting

https://mooncraft2000.com
114•JKCalhoun•6d ago•25 comments

Show HN: Toy Python Lisp interpreters based on the 1960 McCarthy paper

https://github.com/jhud/lisp
3•disconnection•17h ago•0 comments