frontpage.

I kept running into the same pattern: calling an LLM API thousands of times with the same prompt template, just swapping in different text. Classify this contract clause. Route this support ticket. Categorize this log line. For teams handling contracts, patient records, or internal logs, sending that data to a third-party API isn't always an option. And at scale, you're paying per-token for what's essentially pattern matching. So I built a CLI that trains a small local classifier from labeled examples. Give it 50 input/output pairs, it trains a ~230KB model on your machine, and you run inference locally. No network calls. Everything runs on Node.js - no Python, no GPU, no Docker. Under the hood it uses all-MiniLM-L6-v2 for sentence embeddings (runs locally, downloads once at ~80MB) and trains a small neural network on top of your labels. For topic/domain classification - where categories are about different things - I'm seeing 80-95% accuracy with 50 examples. It struggles with sentiment and tone (44-50%), because "amazing camera" and "terrible camera" produce nearly identical embedding vectors. I documented this openly in the benchmarks. The benchmarks use real text from AG News (127K articles) and 20 Newsgroups (18K posts), with only 50 training samples drawn from each. The test harness and all fixture data are in the repo - clone it and run npx tsx tests/harness/run.ts to reproduce. This isn't trying to replace LLMs. It's specifically for the repetitive classification tasks where the same prompt structure processes different data every time. Open source, Apache 2.0. Still early. Curious whether anyone has tried similar embedding+classifier approaches for their own workflows, or if there's demand for multi-label classification.

Show HN: SQL-tap now has a browser-based Web UI

Bel interpeter vibe coded with Claude Code

Show HN: DataPorter – Rails engine for data imports

The Seamstress Who Solved the Ancient Mystery of the Argonaut

Show HN: Rungs.dev – IDE for PLC AOIs with Structured Text and Ladder Logic

Gogi – An AI-powered terminal assistant with native device-code OAuth

Timing Superintelligence

A simple L7 proxy for vLLM that manages LoRA adapter storage via NVMes

React Native Comes to Meta Quest

Matrix Inverse Square Root (SPD) with Fixed-Budget GEMM Kernels

Rascal's Wager

An AI doomsday report shook US markets

Flake Checks in Shell

Another Viral AI Doomer Article, the Fundamental Error, DoorDash's AI Advantages

The Edge of Mathematics – Interview with Terence Tao

Zero Human Involvement: Why Full Autonomy Is the Only Measure of Personalisation

This YouTube video is a drawing app

Letters to a Young Creator

The Data Architecture Behind Autonomous Hyper-Personalisation

Fire Up the Wayback Machine

Open Data Hub Data Browser – Explore and Query Open Datasets

Show HN: Code Execution for LLMs via a Sandboxed Lua REPL

Novo Nordisk to Cut U.S. List Prices for Ozempic, Wegovy by Up to 50%

What's New in PHP Extension for Visual Studio Code

Paying publishers when LLMs use their pages: recognyze.ai

Artificial Intelligence: What Should the Educational System Do to Survive?

Why do falls rise with age? Study points to cerebellar neuron firing

Show HN: A fast way to clean and convert messy CSV/JSON files in the browser

Show HN: Detect LLM hallucinations via geometric drift (0.9 AUC, 1% overhead)

Unequal Treatment Perceptions and Rural Backlashes Against Carbon Taxation [pdf]

Show HN: Train a 230KB text classifier from 50 examples – no API keys, no GPU