frontpage.

Llamactl is a unified management system for running local LLMs across llama.cpp, MLX, and vLLM backends, with a web dashboard and OpenAI-compatible API.

I originally built this because I got tired of constantly SSHing to my server to edit a config just try out a new model. It's grown a lot since then.

What it does:

Web UI for creating and managing LLM instances from your browser

Full llama.cpp model lifecycle - download from HuggingFace, create preset.ini configs with an in-browser editor, load/unload models via router mode

Automatic idle timeout, LRU eviction, and instance limits

llama.cpp, mlx_lm and vllm backends

OpenAI and Anthropic API compatible endpoints (backend-dependent)

Multi-node support for distributing instances across hosts

Inference API keys with per-instance access control

docs: https://llamactl.org/stable/

Show HN: I built a 7-agent AI marketing crew – 235 replies, /bin/zsh revenue

Real-World Industrial-Scale Verification: LLM-Driven Theorem Proving on SeL4

Show HN: Complete Guide to AI Agent Observability in Production

The Byzantine MCP Router – AI Safety and Security via Semantic Consensus

More Big Tech Layoffs Loom as Meta Mulls 20% Cut to Its Workforce

Oils for Unix – A Pause in the Project

Free alternative to Harvey/Legora's tabular document review

Pgtui, a Postgres TUI Client

Migrating from DigitalOcean to Hetzner

What Does the Future of Programming Look Like?

Simulation we live in was created to develop AGI, and will soon be turned off

RocketRide – Build and run AI/data pipelines within VS Code, Cursor etc.

I canceled my Antigravity subscription today. Here is why

Tab Organizer for Developer

Email for agents – agent doesn't need another Gmail

Webtool: Let AI agents control your live Chrome session with CDP

Show HN: Railguard – A safer –dangerously-skip-permissions for Claude Code

Top US counterterrorism official resigns over Iran war, urging Trump to 'reverse

Ecological Institutionalism: Toward a Constitutional Architecture for Reciprocal

Less code, more power: Why we rolled our own React Server Components framework

QuickBooks Online MCP Server

How to Effectively Adopt AI Tooling in Software Development

Show HN: One, cross domain auto-researching knowledge graph Claude orchestrator

Rando Sans solves the biggest problem with handwriting fonts

The Cost of Agentic Failure

Thinking Big

Show HN: Flowershow Publish Markdown in seconds. Hosted, free, zero config

Spaceflight Started 100 Years Ago in a Massachusetts Cabbage Patch

AI Entrepreneurs

Alive – Five Markdown files that give Claude Code a persistent memory

Show HN: Llamactl – Self-hosted LLM manager with OpenAI-compatible routing