frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

EdgeFoundry – Deploy and Monitor Local LLMs

https://github.com/TheDarkNight21/edge-foundry
1•allaffa•1h ago

Comments

allaffa•1h ago
Hey HN,

I’ve been working on EdgeFoundry, an open-source DevOps and observability toolkit that makes it easy to deploy, monitor, and manage local LLMs on your own machine or private server.

What it does EdgeFoundry helps you: • Run quantized LLMs locally (like TinyLlama or Phi-3) using LlamaCPP • Monitor telemetry such as latency, tokens per second, and memory usage • Use a simple CLI to deploy, start, stop, and view models • Store and visualize metrics in a local SQLite database and React dashboard • Keep everything offline-first and privacy-friendly

In short: Ollama runs your model — EdgeFoundry helps you deploy and observe it like a production system.

Key Features (MVP) • CLI: edgefoundry deploy/start/stop/status • Local agent (FastAPI + LlamaCPP) to run the model • Telemetry logging for latency, memory, and token throughput • Local dashboard (React) for visualizing metrics • SQLite backend for offline data storage • Support for TinyLlama and Phi-3 Mini out of the box

Why I built this While building local AI projects like offline RAG assistants, I realized there was no easy way to deploy and track local models with observability and lifecycle management like we have in the cloud. Developers want control, privacy, and insight — but tools like Ollama lack monitoring, telemetry, or multi-device orchestration.

EdgeFoundry fills that gap by offering the DevOps and observability layer for edge AI.

Who it’s for • Developers running quantized models locally • Teams building offline-first AI apps • Startups needing on-prem AI for compliance • Anyone who wants visibility into local LLM performance

Quick Start

# 1. Install pip install edgefoundry

# 2. Deploy a local model edgefoundry deploy --model tinyllama-1b-3bit.gguf

# 3. Start the agent edgefoundry start

# 4. Open the dashboard edgefoundry dashboard

You’ll see live metrics like latency, memory usage, and tokens per second for each inference.

Future Plans The next phase of EdgeFoundry is to enable mass deployment and testing of local AI models across devices. The goal is to make it possible for companies to: • Deploy local models at scale to phones, laptops, or IoT devices • Collect telemetry and performance data from real devices or simulations (for example, using Android Studio or local emulators) • Use this data to evaluate, tune, and monitor model performance before and after rollout

This would let teams building privacy-first or on-device AI systems manage fleets of local deployments with the same level of visibility and control they have in the cloud.

Feedback wanted This is an early MVP. I’d love feedback on: • What features you’d want for multi-device orchestration • Whether cloud sync or over-the-air updates would be useful • What matters most for large-scale local deployments on phones or computers

GitHub: https://github.com/TheDarkNight21/edge-foundry

If you try it, please share your experience or open an issue. I’m eager to hear from others building privacy-first AI tools or deploying LLMs locally.

Thanks for reading. I’ll be in the comments to answer questions and discuss next steps.

ESIM Coupons compare travel eSIM plans and find coupons

https://esim.coupons/
1•flippo1•2m ago•1 comments

Yann LeCun: Self-Supervised Learning, JEPA, World Models, and the Future of AI

https://www.youtube.com/watch?v=yUmDRxV0krg
1•twoodfin•8m ago•0 comments

Jane Goodall's legacy: three ways she changed science

https://www.nature.com/articles/d41586-025-03209-y
1•pseudolus•8m ago•1 comments

High energy density carbon–cement supercapacitors for energy storage

https://www.pnas.org/doi/10.1073/pnas.2511912122
1•gnabgib•12m ago•0 comments

China fields Golden Dome prototype before the US can come up with a plan

https://www.scmp.com/news/china/science/article/3327224/china-fields-golden-dome-prototype
1•wakawaka28•13m ago•0 comments

MIT's concrete battery just got 10 times more powerful

https://newatlas.com/energy/mit-concrete-battery-powerful-supercapacitor/
1•thelastgallon•17m ago•0 comments

PodRocket Podcast: Inside the Recent NPM Supply Chain Attacks

https://socket.dev/blog/podrocket-podcast-npm-supply-chain-attacks
1•feross•18m ago•0 comments

Basic Math Textbook: The Napkin Project

https://web.evanchen.cc/napkin.html
1•eapriv•19m ago•0 comments

Larry Krantz Digital Playground

https://www.larrykrantz.com/
1•dmbche•20m ago•0 comments

City Was Forced to Overhaul Its Police Department. Crime Plummeted

https://www.nytimes.com/2025/10/02/nyregion/newark-police-federal-oversight.html
4•defrost•21m ago•0 comments

Retrieval Embedding Benchmark

https://mteb-leaderboard.hf.space:443/?benchmark_name=RTEB(beta)
3•fzliu•25m ago•0 comments

InstaDeep delivers AI-powered decision-making systems for the Enterprise

https://instadeep.com/
2•doener•28m ago•0 comments

Peter Thiel Essay on the Antichrist, One Piece, and More

https://firstthings.com/voyages-to-the-end-of-the-world/
1•Balbus•28m ago•2 comments

BioNTech to Host Second AI Day

https://investors.biontech.de/news-releases/news-release-details/biontech-host-second-ai-day-edit...
2•doener•29m ago•0 comments

The Last Answer – Isaac Asimov

https://www.highexistence.com/the-last-answer-short-story/
1•htk•34m ago•0 comments

Linked Pay Attention

https://exple.tive.org/blarg/2025/10/02/linked-pay-attention/
2•pavel_lishin•34m ago•0 comments

A better GPT resume builder for job seekers

1•cvnomist•41m ago•1 comments

Shade Ball

https://en.wikipedia.org/wiki/Shade_ball
1•red369•41m ago•0 comments

Atomic Neighborhoods in Semiconductors Provide Avenue for Microelectronics

https://newscenter.lbl.gov/2025/09/25/atomic-neighborhoods-in-semiconductors-provide-new-avenue-f...
1•gnabgib•41m ago•0 comments

Math Academy

https://www.mathacademy.com
1•emrehan•42m ago•0 comments

Chinese Ship Accused of Looting Iconic WWII Wrecks

https://www.historynet.com/chinese-accused-of-looting-iconic-wwii-wrecks/
4•ChuckMcM•44m ago•2 comments

Now Arriving: A New Theory of In-Flight Turbulence

https://www.nytimes.com/2025/09/24/science/physics-airplanes-turbulence.html
3•bookofjoe•44m ago•1 comments

Show HN: Running QR codes for YouTube audio (share moments with a scan)

https://github.com/bwagner/self_ref_yt_vid
1•loopology•46m ago•0 comments

Shutdown Risks Leaving Millions with Costlier Health Insurance

https://www.bloomberg.com/news/articles/2025-10-02/obamacare-premiums-for-2026-left-in-limbo-by-g...
5•petethomas•46m ago•0 comments

Repeat Covid-19 Infections Could Double Your Risk of Long Covid

https://time.com/7322188/covid-reinfection-long-covid/
2•amichail•46m ago•0 comments

Fair 1.0: Decentralised WordPress packages

https://fair.pm/blog/2025/09/24/discover-trust-install-fair-1-0-is-here/
1•Malakun•48m ago•0 comments

Lex Fridman Podcast #482 – Pavel Durov

https://lexfridman.com/pavel-durov/
2•uyzstvqs•50m ago•0 comments

Recycled Components in 3D Concrete Printing Mixes: A Review

https://www.mdpi.com/1996-1944/18/19/4517
1•PaulHoule•51m ago•0 comments

How much RAM does your Linux PC need in 2025

https://www.zdnet.com/article/how-much-ram-does-your-linux-pc-really-need-in-2025-i-did-the-math-...
1•teleforce•53m ago•0 comments

US to open more public land for coal mining

https://apnews.com/article/trump-coal-mining-power-climate-burgum-electricity-eebec80c6060f37890d...
5•softwaredoug•54m ago•0 comments