frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

OpenCiv3: Open-source, cross-platform reimagining of Civilization III

https://openciv3.org/
539•klaussilveira•9h ago•150 comments

The Waymo World Model

https://waymo.com/blog/2026/02/the-waymo-world-model-a-new-frontier-for-autonomous-driving-simula...
865•xnx•15h ago•525 comments

How we made geo joins 400× faster with H3 indexes

https://floedb.ai/blog/how-we-made-geo-joins-400-faster-with-h3-indexes
73•matheusalmeida•1d ago•15 comments

Show HN: Look Ma, No Linux: Shell, App Installer, Vi, Cc on ESP32-S3 / BreezyBox

https://github.com/valdanylchuk/breezydemo
185•isitcontent•10h ago•21 comments

Monty: A minimal, secure Python interpreter written in Rust for use by AI

https://github.com/pydantic/monty
186•dmpetrov•10h ago•82 comments

Show HN: I spent 4 years building a UI design tool with only the features I use

https://vecti.com
296•vecti•12h ago•132 comments

Dark Alley Mathematics

https://blog.szczepan.org/blog/three-points/
72•quibono•4d ago•15 comments

Microsoft open-sources LiteBox, a security-focused library OS

https://github.com/microsoft/litebox
346•aktau•16h ago•168 comments

Sheldon Brown's Bicycle Technical Info

https://www.sheldonbrown.com/
341•ostacke•15h ago•90 comments

Hackers (1995) Animated Experience

https://hackers-1995.vercel.app/
437•todsacerdoti•17h ago•226 comments

Unseen Footage of Atari Battlezone Arcade Cabinet Production

https://arcadeblogger.com/2026/02/02/unseen-footage-of-atari-battlezone-cabinet-production/
8•videotopia•3d ago•0 comments

Show HN: If you lose your memory, how to regain access to your computer?

https://eljojo.github.io/rememory/
240•eljojo•12h ago•147 comments

What Is Ruliology?

https://writings.stephenwolfram.com/2026/01/what-is-ruliology/
4•helloplanets•4d ago•0 comments

Delimited Continuations vs. Lwt for Threads

https://mirageos.org/blog/delimcc-vs-lwt
15•romes•4d ago•2 comments

PC Floppy Copy Protection: Vault Prolok

https://martypc.blogspot.com/2024/09/pc-floppy-copy-protection-vault-prolok.html
43•kmm•4d ago•3 comments

An Update on Heroku

https://www.heroku.com/blog/an-update-on-heroku/
378•lstoll•16h ago•253 comments

How to effectively write quality code with AI

https://heidenstedt.org/posts/2026/how-to-effectively-write-quality-code-with-ai/
222•i5heu•12h ago•166 comments

Show HN: ARM64 Android Dev Kit

https://github.com/denuoweb/ARM64-ADK
14•denuoweb•1d ago•2 comments

Why I Joined OpenAI

https://www.brendangregg.com/blog/2026-02-07/why-i-joined-openai.html
94•SerCe•5h ago•77 comments

Show HN: R3forth, a ColorForth-inspired language with a tiny VM

https://github.com/phreda4/r3
62•phreda4•9h ago•11 comments

Learning from context is harder than we thought

https://hy.tencent.com/research/100025?langVersion=en
162•limoce•3d ago•82 comments

I spent 5 years in DevOps – Solutions engineering gave me what I was missing

https://infisical.com/blog/devops-to-solutions-engineering
128•vmatsiiako•14h ago•55 comments

Introducing the Developer Knowledge API and MCP Server

https://developers.googleblog.com/introducing-the-developer-knowledge-api-and-mcp-server/
38•gfortaine•7h ago•11 comments

Zlob.h 100% POSIX and glibc compatible globbing lib that is faste and better

https://github.com/dmtrKovalenko/zlob
6•neogoose•2h ago•2 comments

Understanding Neural Network, Visually

https://visualrambling.space/neural-network/
261•surprisetalk•3d ago•35 comments

Female Asian Elephant Calf Born at the Smithsonian National Zoo

https://www.si.edu/newsdesk/releases/female-asian-elephant-calf-born-smithsonians-national-zoo-an...
18•gmays•5h ago•2 comments

I now assume that all ads on Apple news are scams

https://kirkville.com/i-now-assume-that-all-ads-on-apple-news-are-scams/
1030•cdrnsf•19h ago•428 comments

FORTH? Really!?

https://rescrv.net/w/2026/02/06/associative
55•rescrv•17h ago•19 comments

Show HN: Smooth CLI – Token-efficient browser for AI agents

https://docs.smooth.sh/cli/overview
84•antves•1d ago•60 comments

WebView performance significantly slower than PWA

https://issues.chromium.org/issues/40817676
19•denysonique•6h ago•2 comments
Open in hackernews

Adventures in Imbalanced Learning and Class Weight

http://andersource.dev/2025/05/05/imbalanced-learning.html
49•andersource•9mo ago

Comments

ipunchghosts•9mo ago
I read the article and the take away is that class weights and stratified sampling did not help for the OPs problem.
lamename•9mo ago
Nice writeup. F1, balanced accuracy, etc. In truth it depends on your problem and what a practical "best" solution is, especially in imbalanced scenarios, but Matthews Correlation Coefficient (MCC) is probably the best comprehensive and balanced blind go-to metric, because it guarantees that more portions of the confusion matrix are good [0,1].

I made a quick interactive, graphical exploration to demonstrate this in python [2].

[0]: https://biodatamining.biomedcentral.com/articles/10.1186/s13...

[1]: https://biodatamining.biomedcentral.com/articles/10.1186/s13...

[2]: https://www.glidergrid.xyz/post-archive/understanding-the-ro...

klysm•9mo ago
MCC also generalizes to multi-class well. I wish it had a better name though. It seems like F1 score has better marketing
andersource•9mo ago
Really neat visualization! And thanks for the tip on MCC.

Out of curiosity I plugged it to the same visualization (performance vs. class weight when optimized with BCE) and it behaves similar to F1, i.e. best without weighting.

gitroom•9mo ago
Insane how tricky imbalanced stuff gets- I always end up second guessing my metrics tbh. You think there's ever such thing as a "right" number for real world junk or is it just endless tradeoffs?
andersource•9mo ago
Yeah it gets tricky. I think eventually it has to be about tradeoffs - no ML system can be 100% correct. I do think there's a "right" decision (up to a point) in the context of the product or business.
zai_nabasif1234•9mo ago
[5/11, 1:41 PM] Meta AI: Imbalanced learning refers to the challenge of training machine learning models on datasets where the classes are not represented equally. This can lead to biased models that perform well on the majority class but poorly on the minority class.

Class Weight Class weight is a technique used to address class imbalance by assigning different weights to each class during training. The idea is to give more weight to the minority class and less weight to the majority class.

Key Considerations 1. Class weight calculation: Weights can be calculated based on class frequency or other heuristics. 2. Hyperparameter tuning: Class weights can be tuned as hyperparameters during model training. 3. Evaluation metrics: Metrics like F1-score, precision, and recall are often used to evaluate model performance on imbalanced datasets.

Techniques 1. Oversampling: Oversampling the minority class to balance the dataset. 2. Undersampling: Undersampling the majority class to balance the dataset. 3. SMOTE: Synthetic Minority Over-sampling Technique (SMOTE) generates synthetic samples of the minority class.

Applications 1. Fraud detection: Imbalanced learning is crucial in fraud detection, where the minority class (fraudulent transactions) is often much smaller than the majority class (legitimate transactions). 2. [5/11, 1:41 PM] Meta AI: Imbalanced learning refers to the challenge of training machine learning models on datasets where the classes are not represented equally. This can lead to biased models that perform well on the majority class but poorly on the minority class.

Class Weight Class weight is a technique used to address class imbalance by assigning different weights to each class during training. The idea is to give more weight to the minority class and less weight to the majority class.

Key Considerations 1. Class weight calculation: Weights can be calculated based on class frequency or other heuristics. 2. Hyperparameter tuning: Class weights can be tuned as hyperparameters during model training. 3. Evaluation metrics: Metrics like F1-score, precision, and recall are often used to evaluate model performance on imbalanced datasets.

Techniques 1. Oversampling: Oversampling the minority class to balance the dataset. 2. Undersampling: Undersampling the majority class to balance the dataset. 3. SMOTE: Synthetic Minority Over-sampling Technique (SMOTE) generates synthetic samples of the minority class.

Applications 1. Fraud detection: Imbalanced learning is crucial in fraud detection, where the minority class (fraudulent transactions) is often much smaller than the majority class (legitimate transactions). 2. Medical diagnosis: Imbalanced learning can be applied to medical diagnosis, where the minority class (diseased patients) may be much smaller than the majority class (healthy patients).

Would you like to know more about imbalanced learning or class weight?

bbstats•9mo ago
The only thing that matters is your estimation of how the balance will change out of distribution or with future data etc