frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: CriteriaBot – A Universal Customizable Classifier

https://criteriabot.io/
2•RoyalTnetennba•1h ago
I needed a classifier for nuanced, subjective buckets that fell outside of typical ML use-cases (e.g., "is this a spoiler?", "is this factually correct?", "is this user being mean?"). I ended up really happy with the architecture I built to solve it, so I rolled it out as a standalone API and service called CriteriaBot.

WHAT IT DOES:

You give it content and plain-English criteria. It gives you a true/false verdict on whether the content meets those criteria.

HOW IT WORKS:

In addition to a traditional classifier, the classification request is routed through a pool of small, open-weight LLMs to achieve a consensus verdict.

I built a pre-vote factorization machine that selects a sub-pool of LLMs optimized for signal strength based on the embedding of the subject/category. A second factorization machine then reads the votes and the embedding to arrive at a single verdict. That verdict is dynamically modified based on the user's history of agreement/disagreement with the models in semantically similar evaluations.

The models are also hooked up to Wikipedia and Wolfram to support edge cases requiring current information or mathematical grounding.

FINDINGS:

* With the same harness and sample set, Gemma 4 26B's accuracy is only ~1 percentage point below Opus 4.8.

* Pure oracle is theoretically very good - currently ~98% accuracy for the datasets. I'm using the second factorization machine as a combiner as it can theoretically push past oracle results, but it's an interesting fallback.

* The single most useful LLM surprised me - LFM2 24B contributes the most to the consensus, despite being the worst individually (of the current pool of LLMs). It correlates the least with the other models (perhaps due to its unique architecture?) which makes it a useful signal for some of the problems.

* The legal obligations of handling user-submitted images are... involved. I've disabled image support for non-me users while I sort that out (in case you were hoping to try out "Hotdog, Not Hotdog").

* Rails singularizes "criteria" as "criterium" and I didn't realize that was incorrect until it was kind of a lot of work to fix.

WHY I'M POSTING: I’d been dealing with burnout for a while, and getting this running has been incredibly rewarding. The majority of people in my personal life are non-technical so it's been hard to get reactions to it beyond "what is it?".

Would be thrilled with whatever honest feedback you have.

Nextcloud: Public link share of a folder inside a Team folder ignores permission

https://github.com/nextcloud/groupfolders/issues/4752
1•alternatetwo•46s ago•1 comments

PRC-linked spies hid inside medical and military networks for more than a year

https://www.theregister.com/research/2026/06/15/google-says-prc-linked-spies-hid-in-medical-resea...
1•Bender•56s ago•0 comments

Microsoft site throwing warnings after someone forgot to renew cert

https://www.theregister.com/security/2026/06/15/microsoft-site-throwing-warnings-after-someone-fo...
1•Bender•1m ago•0 comments

Good news, we have extra time before the Sun ends life on Earth

https://arstechnica.com/science/2026/06/good-news-we-have-extra-time-before-the-sun-ends-life-on-...
1•Bender•2m ago•0 comments

I built open child-support calculators for 7 states, then found my own bugs

https://csg.tcblaw.org/
1•tcbmem•2m ago•0 comments

German Air Force chief names Russian targets NATO would hit in a war

https://www.yacnews.com/german-air-force-chief-names-russian-targets-nato-would-hit-in-a-war/
1•ortr•3m ago•0 comments

Russian attacks set fire to the 1000 year old Dormition Cathedral

https://www.yacnews.com/russian-attacks-set-fire-to-the-1-000-year-old-dormition-cathedral-at-kyi...
1•ortr•4m ago•0 comments

Show HN: Offline AI assistant for Android (PDFs, Wikipedia, more)

https://github.com/geograms/eva
1•nunobrito•6m ago•0 comments

First Steps Toward Automated AI Research

https://www.recursive.com/articles/first-steps-toward-automated-ai-research
1•gmays•7m ago•0 comments

Show HN: AI traders you author, argue with and coach

https://degen.strayforge.com
1•litlig•8m ago•0 comments

IBM and Norway's sovereign fund CEO: Is AI a bubble?

https://www.youtube.com/watch?v=IxqrSXiRja4
1•pandoro•9m ago•1 comments

The APLR(1) Algorithm Is Simpler and More Capable Than IELR(1)

https://branchtaken.com/reports/aplr1/aplr1
2•jasone•9m ago•0 comments

Did a medieval flying monk spot Halley's comet, twice? It's complicated

https://arstechnica.com/science/2026/06/did-a-medieval-flying-monk-spot-halleys-comet-twice-its-c...
1•Brajeshwar•9m ago•0 comments

Detecting and Steering Sycophancy in Qwen

https://palashtaneja.com/blog/posts/detecting-and-steering-sycophancy-in-qwen.html
1•737max•10m ago•0 comments

How the UK social media ban could affect you

https://www.bbc.com/newsround/articles/c802ypdxm4po
1•DropDead•10m ago•1 comments

We are living in the dial-up era of AI

https://www.xydac.com/blog/dial-up-era-of-ai/
1•xydac•11m ago•0 comments

Immutability Changes Everything (2016) [pdf]

https://www.cidrdb.org/cidr2015/Papers/CIDR15_Paper16.pdf
1•let_rec•11m ago•0 comments

Show HN: I turned the Lex Fridman podcast archive into a browsable idea map

https://tlexdr.com/discover
1•bwdey•11m ago•0 comments

US says Trump, Vance and Iran's parliament speaker have signed deal to end war

https://www.reuters.com/world/iran-war-live-trump-says-us-tehran-have-reached-peace-deal-2026-06-15/
1•ktm5j•13m ago•1 comments

How to Measure WWDC

https://asymco.com/2026/06/08/how-to-measure-wwdc/
1•ndr42•13m ago•0 comments

Marc Andreessen on X: "SpaceX and the Sentient Sun " / X

https://twitter.com/pmarca/status/2066523671456579828
1•bilsbie•14m ago•0 comments

Is This the End of Political Islam?

https://www.nytimes.com/2026/06/15/magazine/political-islam-middle-east.html
1•JumpCrisscross•15m ago•1 comments

Mathematicians use Lean to verify proofs, whats the equivalent for patent claims

https://fearn.ai/newsletter/building-formalization-infrastructure-for-patents
1•marclave•16m ago•0 comments

Protect an MCP Server with an Authorization Server

https://fusionauth.io/blog/mcp-authorization-server
1•mooreds•17m ago•0 comments

Terraform Registry Is Down

https://status.hashicorp.com/incidents/01KV60Z6KMP2TGHVJYC87MK4CM
2•tellnes•17m ago•0 comments

Show HN: 0-0.io – Multiplayer browser football with server-authoritative physics

https://0-0.io/
1•mgc8•18m ago•0 comments

Patched Claude Code, now 2–8× faster ultracode workflow execution

https://github.com/Functio-AI/claude-go-brr
2•guidotrev•18m ago•2 comments

OpenAI wins dismissal of trade secret lawsuit by Musk's xAI

https://www.reuters.com/legal/litigation/openai-wins-dismissal-trade-secret-lawsuit-by-musks-xai-...
1•JumpCrisscross•18m ago•0 comments

Building an AI skill marketplace for GTM teams

https://newsletter.gtmengineering.ai/p/why-every-gtm-org-will-need-ai-skill
2•alexjl1226•23m ago•0 comments

Show HN: I built an open-source financial research terminal (SEC data and SQL)

https://terminal.tesseractanalytics.ai/gate
1•tessbi•23m ago•0 comments