frontpage.

Show HN: LLM-use – Open-source tool to route and orchestrate multi-LLM tasks

2•justvugg•2h ago

I built llm‑use, an open‑source Python framework for orchestrating large language model workflows across local and cloud models with smart routing, cost tracking, session logs, optional web scraping, and optional MCP integration. It’s designed for agent workflows (planner + workers + synthesis) that leverage multiple LLMs without manual switching or custom glue code.

Examples

Simple local usage:

ollama pull llama3.1:70b ollama pull llama3.1:8b

python3 cli.py exec \ --orchestrator ollama:llama3.1:70b \ --worker ollama:llama3.1:8b \ --task "Summarize 10 news articles"

This runs a planner + worker flow fully locally.

Hybrid cloud + local usage:

export ANTHROPIC_API_KEY="sk-ant‑..." ollama pull llama3.1:8b

python3 cli.py exec \ --orchestrator anthropic:claude-3-7-sonnet-20250219 \ --worker ollama:llama3.1:8b \ --task "Compare 5 products"

export ANTHROPIC_API_KEY="sk-ant‑..." ollama pull llama3.1:8b

python3 cli.py exec \ --orchestrator anthropic:claude-3-7-sonnet-20250219 \ --worker ollama:llama3.1:8b \ --task "Compare 5 products"

Routes tasks between cloud provider models and a local worker.

TUI chat mode:

python3 cli.py chat \ --orchestrator anthropic:claude-3 \ --worker ollama:llama3.1:8b

Interactive CLI chat with live logs and cost breakdown.

Why it matters • Orchestrate multiple LLMs — OpenAI, Anthropic, Ollama/llama.cpp — without writing custom routing logic. • Smart routing and fallback — choose better models for each task and fall back heuristically or learned over time. • Cost tracking & session logs — see costs per run and preserve history locally. • Optional scraping + caching — enrich tasks with real web data if needed. • Optional MCP server integration — serve llm‑use workflows via PolyMCP.

llm‑use makes it easier to build robust, multi‑model LLM systems without being tied to a single API or manual orchestration.

Repo: https://github.com/llm‑use/llm‑use

Protocol Validation with Affine MPST in Rust

Female Asian Elephant Calf Born at the Smithsonian National Zoo

Show HN: Zest – A hands-on simulator for Staff+ system design scenarios

Show HN: DeSync – Decentralized Economic Realm with Blockchain-Based Governance

Automatic Programming Returns

Why Are There Still So Many Jobs? The History and Future of Workplace Automation [pdf]

The Search Engine Map

Show HN: Souls.directory – SOUL.md templates for AI agent personalities

Real-Time ETL for Enterprise-Grade Data Integration

Economics Puzzle Leads to a New Understanding of a Fundamental Law of Physics

Switzerland's Extraordinary Medieval Library

A new comet was just discovered. Will it be visible in broad daylight?

ESR: Comes the news that Anthropic has vibecoded a C compiler

Frisco residents divided over H-1B visas, 'Indian takeover' at council meeting

If CNN Covered Star Wars

Show HN: I built the first tool to configure VPSs without commands

AI agents from 4 labs predicting the Super Bowl via prediction market

EU bans infinite scroll and autoplay in TikTok case

Benchmarking how well LLMs can play FizzBuzz

Why I Joined OpenAI

Octave GTM MCP Server

Show HN: Portview what's on your ports (diagnostic-first, single binary, Linux)

Voyager CEO says space data center cooling problem still needs to be solved

Boilerplate Tax – Ranking popular programming languages by density

Zen: A Browser You Can Love

My GPT-5.3-Codex Review: Full Autonomy Has Arrived

Show HN: FastLog: 1.4 GB/s text file analyzer with AVX2 SIMD

God said it (song lyrics) [pdf]

I left Linus Tech Tips [video]

Program Theory