frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: CriteriaBot – A Universal Customizable Classifier

https://criteriabot.io/
3•RoyalTnetennba•2h ago
I needed a classifier for nuanced, subjective buckets that fell outside of typical ML use-cases (e.g., "is this a spoiler?", "is this factually correct?", "is this user being mean?"). I ended up really happy with the architecture I built to solve it, so I rolled it out as a standalone API and service called CriteriaBot.

WHAT IT DOES:

You give it content and plain-English criteria. It gives you a true/false verdict on whether the content meets those criteria.

HOW IT WORKS:

In addition to a traditional classifier, the classification request is routed through a pool of small, open-weight LLMs to achieve a consensus verdict.

I built a pre-vote factorization machine that selects a sub-pool of LLMs optimized for signal strength based on the embedding of the subject/category. A second factorization machine then reads the votes and the embedding to arrive at a single verdict. That verdict is dynamically modified based on the user's history of agreement/disagreement with the models in semantically similar evaluations.

The models are also hooked up to Wikipedia and Wolfram to support edge cases requiring current information or mathematical grounding.

FINDINGS:

* With the same harness and sample set, Gemma 4 26B's accuracy is only ~1 percentage point below Opus 4.8.

* Pure oracle is theoretically very good - currently ~98% accuracy for the datasets. I'm using the second factorization machine as a combiner as it can theoretically push past oracle results, but it's an interesting fallback.

* The single most useful LLM surprised me - LFM2 24B contributes the most to the consensus, despite being the worst individually (of the current pool of LLMs). It correlates the least with the other models (perhaps due to its unique architecture?) which makes it a useful signal for some of the problems.

* The legal obligations of handling user-submitted images are... involved. I've disabled image support for non-me users while I sort that out (in case you were hoping to try out "Hotdog, Not Hotdog").

* Rails singularizes "criteria" as "criterium" and I didn't realize that was incorrect until it was kind of a lot of work to fix.

WHY I'M POSTING: I’d been dealing with burnout for a while, and getting this running has been incredibly rewarding. The majority of people in my personal life are non-technical so it's been hard to get reactions to it beyond "what is it?".

Would be thrilled with whatever honest feedback you have.

Show HN: machine0 – Persistent NixOS VMs You Control from the CLI

https://machine0.io
15•bwm•1h ago•2 comments

Show HN: Exploiting Slack's video embeds to achieve E2EE communication

https://v1c.rocks/log/exploiting-slack-video/
17•victorio•1h ago•2 comments

Show HN: Fata – Spaced repetition to fight skill rot from AI coding

https://fata.dev
3•djoume•4d ago•0 comments

Show HN: Kage – Shadow any website to a single binary for offline viewing

https://github.com/tamnd/kage
652•tamnd•1d ago•127 comments

Show HN: Micro Coach – an AI workout planner built by a former personal trainer

https://microcoachapp.com/
2•coreysutton7•40m ago•0 comments

Show HN: I wrote a C++ ray tracer from scratch without AI

https://github.com/themartiano/luz
131•martiano•8h ago•54 comments

Show HN: Nxui – Copy-paste animated UI components for Vue

https://nxui.geoql.in/docs/
8•vinayakkulkarni•4h ago•0 comments

Show HN: Track trending GitHub repos on HN

https://hn-github-reader.vercel.app/
4•vipulag•2h ago•0 comments

Show HN: Trace – Offline Mac meeting transcripts you can flag mid-call

https://traceapp.info
194•AG342•1d ago•77 comments

Show HN: Mood

https://mood.lasagna.pizza/
3•kinduff•2h ago•0 comments

Show HN: CriteriaBot – A Universal Customizable Classifier

https://criteriabot.io/
3•RoyalTnetennba•2h ago•0 comments

Show HN: Can Europe train a frontier AI model on the compute it owns?

https://github.com/sammysltd/euromesh
48•smashini•4h ago•60 comments

Show HN: Steal-a-GIF – A browser tool to export GIFs from locked down platforms

https://vorpus.github.io/steal-a-gif/
6•lizhang•3h ago•0 comments

Show HN: Discover Wikipedia articles popular on Hacker News

https://www.orangecrumbs.com/
131•octopus143•23h ago•29 comments

Show HN: PIM – app blocker that inconveniences you instead of locking you out

https://pleaseinconvenienceme.com
2•SMV279438•4h ago•1 comments

Show HN: 3D print Z reinforcement via injected loops

https://mgunlogson.github.io/magma/
65•mgunlogson•6d ago•40 comments

Show HN: Memorypad – A note editor for your daily notes, in Markdown

https://memorypad.io/
6•Malfunction92•6h ago•0 comments

Show HN: I am building a map of people who lived in the Roman Empire

https://new.roman-names.com/
203•metiscus•4d ago•47 comments

Show HN: AwsmAudio – a WebAudio editor with native MCP

https://audio.awsm.fun
7•dakom•11h ago•0 comments

Show HN: Paca – Lightweight Jira alternative for human-AI collaboration

https://github.com/Paca-AI/paca
168•pikann22•2d ago•60 comments

Show HN: X – A programming language with switchable memory modes

https://github.com/xdotxxx/x
4•x-xxx•9h ago•1 comments

Show HN: Homebrew 6.0.0

https://brew.sh/2026/06/11/homebrew-6.0.0/
1463•mikemcquaid•4d ago•360 comments

Show HN: I used Claude Mythos to build my startup in 1 day

https://www.brandlm.ai/
8•trungnx2605•11h ago•4 comments

Show HN: Philosophy for Kids

https://philosophy.ocaho.com/
14•rahimnathwani•23h ago•9 comments

Show HN: Dual YOLOv8n UAV Detection on RK3588S at 42 FPS Using NPU

https://github.com/alebal123bal/khadas_yolov8n_multithread
71•alebal123bal•1d ago•9 comments

Show HN: Putt.day a daily mini golf game

https://putt.day/
314•ellg•2d ago•110 comments

Show HN: Lightweight Task queue on Erlang/OTP, SQLite-backed, no overengineering

https://github.com/entGriff/ezra
74•ent1c3d•5d ago•17 comments

Show HN: Prela – A Compositional and Controllable Query Language

https://prela-lang.org
4•remywang•13h ago•0 comments

Show HN: Bastion – isolated Linux VMs for background coding agents

https://bastion.computer/
28•almostlit•1d ago•2 comments

Show HN: Dream Server – Turn your PC, Mac, or Linux box into a private AI server

https://github.com/Light-Heart-Labs/DreamServer
7•dreamserver•14h ago•0 comments