frontpage.

Hi HN, Built llm-use: a lightweight Python toolkit for efficient agent workflows with multiple LLMs. Core pattern: strong model (Claude/GPT-4o/big local) for planning + synthesis; cheap/local workers for parallel subtasks (research, scrape, summarize, extract…). Features: • Mix Anthropic, OpenAI, Ollama, llama.cpp • Smart router: cheap/local first, escalate only if needed (learned + heuristic) • Parallel workers (–max-workers) • Real scraping + cache (BS4 or Playwright) • Offline-first (full Ollama support) • Cost tracking ($ for cloud, 0 local) • TUI chat + MCP server mode • Local session logs Quick example (hybrid):

python3 cli.py exec \ --orchestrator anthropic:claude-3-7-sonnet-20250219 \ --worker ollama:llama3.1:8b \ --enable-scrape \ --task "Summarize 6 recent sources on post-quantum crypto"

Or routed version:

python3 cli.py exec \ --router ollama:llama3.1:8b \ --orchestrator openai:o1 \ --worker gpt-4o-mini \ --task "Explain recent macOS security updates"

MIT licensed, minimal deps, embeddable. Repo: https://github.com/llm-use/llm-use Feedback welcome on: • Routing heuristics you’d find useful • Pain points with agent costs / local vs cloud • Missing integrations? Thanks!

How the US Won Back Chip Manufacturing

Buses should not be "free"

The Biophysical World Inside a Jam-Packed Cell

Show HN: VibeScan – Free client-side security scanner for AI-generated code

The Human Exposome Project will map how environmental factors shape health

Brain-like computers could be built out of perovskites

New GoDaddy Terms of Service: we no longer serve "consumers"

UMD Scientists Create 'Smart Underwear' to Measure Human Flatulence

The biggest app in the whole wide world

Google AI Studio and NBP are down

Datadog: Give Your Agent a Puppy: Introducing Pup CLI

Adquira agora Controle de VIPs e seguros para DayZ

Show HN: Design and print allergy cards for free in multiple languages

Show HN: Feuxo – Real-Time hiring posts and contacts and personalized outreach

Gemini 3.1 Pro Preview

Efficient Ralph Wiggum Loops on a Raspberry Pi

You had a story

AICode: A VSCode methodology for long-term maintainable AI coding

Token Compression, achieving more with less

Warning to Humanity: Why We Must Not Trust the AI "Fluency Trap"

Ask HN: Frameworks for 2D Browser Games?

People Will Sometimes Just Lie About You

Waymo Faces Setback as New York Withdraws Robotaxi Service Plan

The State of Machine Learning Competitions – 2025 Edition

How I launched 3 consoles and found true love at Babbage's store no. 9 (2013)

Can LLMs Play Catan?

Canaries in the Coal Mine? Six Facts about the Recent Employment Effects of AI

OpenClaw security fears lead Meta, other AI firms to restrict its use

Running Cosmos-Reason2-2B on 8GB Jetson Orin Nano

Show HN: FSM-agent-flow – Write LLM workflows that test themselves