frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: Autofit2 – End-to-end pipeline for multilingual text classification

https://github.com/neospe/autofit2
11•leschak•1d ago
Hi HN, Stefan here. autofit2 is a project I have been using at my previous company and is now opensourced. It has been used extensively in automated text moderation, but can be applied to any text/document classification task. We had success modeling offensive texts in 20+ languages (cf. github.com/neospe/dataload for all the datasets).

It's an integrated pipeline for lightweight multilingual text classification, covering preprocessing, training, and evaluation. It implements SetFit, a few-shot learning technique that works well for low-data regimes (down to a few dozen examples), and offers high throughput on CPUs, since it's based on Sentence Transformers. Dependencies are kept lean, but of course PyTorch itself isn't exactly small.

autofit2 takes a base model and a JSON config as input, and outputs a TorchServe model archive as well as a model card. The model card includes any benchmarks you have for your task, self-consistency tests, estimated CO2 emissions of the finetune, as well as an entropy-based bias analysis. For the bias eval, small test corpora for 50 languages are included. It works best with my EAR (Entropy-based Attention Regularization) fork of Sentence Transformers.

Feedback is welcome.

Show HN: Smart model routing directly in Claude, Codex and Cursor

https://github.com/workweave/router
131•adchurch•6h ago•83 comments

Show HN: Autofit2 – End-to-end pipeline for multilingual text classification

https://github.com/neospe/autofit2
11•leschak•1d ago•0 comments

Show HN: WebBase-III – dBASE III rebuilt in the browser with its own interpreter

https://github.com/DDecoene/WebBaseIII
76•ddecoene•2d ago•26 comments

Show HN: OpenKnowledge – open source AI-first alternative to Obsidian/Notion

https://github.com/inkeep/open-knowledge
360•engomez•1d ago•168 comments

Show HN: I Derived a Steak

https://www.absurdlyoptimized.com/recipes/grilled-meats/
4•bkazez•2h ago•0 comments

Show HN: Mantis, A self-hosted LLM gateway

https://github.com/mantis-llm-gateway
5•rizsyed1•4h ago•0 comments

Show HN: Overfitted a 900KB Transformer to Compress a 100MB CSV into 7MB

97•spidy__•3d ago•60 comments

Show HN: Chess-Inspired Roguelike

https://princechazz.com
423•cowboy_henk•5d ago•146 comments

Show HN: I made Google Trends for Hacker News by indexing 18 years of comments

https://hackernewstrends.com
761•ytkimirti•1d ago•151 comments

Show HN: Closing the public-key authenticity gap in our E2EE social network

https://mosslet.com/blog/articles/19
2•mosspigletdev•5h ago•0 comments

Show HN: AgentBrush – Your coding agent's missing tool: image generation

https://agentbrush.dev/
4•Yan4300•5h ago•0 comments

Show HN: I built a hardware quantum RNG and wired it into a Magic 8-Ball

https://dnhkng.github.io/posts/building-the-beam-universe-splitter/
10•dnhkng•5h ago•0 comments

Show HN: Turn native language audio into flashcards and shadowing practice

https://lingochunk.com/try
87•alder•1d ago•36 comments

Show HN: MiniPCs.zip – Charting the Pareto frontier of Mini PCs

https://minipcs.zip
113•yathern•6d ago•46 comments

Show HN: A map of every UK railway, including stations that no longer exist

https://trainmap.co.uk/map.html
2•optionalltd•5h ago•0 comments

Show HN: TBD, a Mac-native CLI-forward coding agent multiplexer

https://github.com/cheapsteak/tbd
4•cheapsteak•6h ago•0 comments

Show HN: Puzzle with Strangers. A free multiplayer jigsaw

https://endtime-instruments.org/puzzle/
4•janoelze•6h ago•4 comments

Show HN: Brain Frog – Can you be random enough for 11 lines of JavaScript?

https://brainfrog.lol
48•AlexanderZ•1w ago•44 comments

Show HN: Bible as RAG Database

https://www.crosscanon.com/
152•jacksonastone•1d ago•90 comments

Show HN: Jargo – a Golang port of Pipecat for conversational-AI apps

https://github.com/gojargo/jargo
5•fallais•10h ago•0 comments

Show HN: `uvx ptn` and expose any system to agents (dangerously)

https://pypi.org/project/ptn/
3•yxl448•7h ago•0 comments

Show HN: StartupsBR – A map of Brazilian startups

https://www.startupsbr.com/sao-paulo
54•leonagano•1w ago•26 comments

Show HN: ZeroGate – API gateway to scale cloud GPUs to zero when idle

https://github.com/noah-garner/zerogate
5•ngarner•9h ago•0 comments

Show HN: Monolisa v3 – a typeface for developers and creatives

https://www.monolisa.dev/
189•bebraw•4d ago•92 comments

Show HN: I built a tool that matches Founders to VCs based on their pitch deck

https://investormatch.pro
3•MartinTobias•10h ago•0 comments

Show HN: Ponder – the best articles and essays on the internet

https://www.readponder.com/
3•wingdiction•10h ago•2 comments

Show HN: Motif Atlas – recurring patterns behind complex systems

https://nikitph.github.io/motifs/
8•loaderchips•15h ago•2 comments

Show HN: Nimic – Pure Python as a systems language with AOT compilation

https://github.com/dima-quant/nimic
42•dima-quant•3d ago•27 comments

Show HN: Wordit – Change One Letter, Keep the Chain Going

https://victorribeiro.com/wordit/
43•atum47•3d ago•28 comments

Show HN: Nub – A Bun-like all-in-one toolkit for Node.js

https://github.com/nubjs/nub
271•colinmcd•2d ago•80 comments