frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Endgame – Production-aware ML under the sklearn API

1•cameronhamilton•1h ago
Most ML frameworks optimize for leaderboard accuracy. But in finance and healthcare, accuracy is often the least interesting part of the system. If you can’t explain a prediction, you can’t deploy it. If your probabilities aren’t calibrated, you can’t trust them. If your pipeline doesn’t enforce constraints, you can’t ship it. I built Endgame after repeatedly running into that gap in production.

Anti-money laundering (banks) Early in my career, I was hired to improve an anti-money laundering system. The incumbent model was 28 hard-coded rules. If enough thresholds fired (e.g., $3,000 ATM withdrawals over 30 days), the account was flagged. No one knew where the thresholds came from. There was no modeling of the underlying behavior. Just rule accumulation. I convinced the bank to provide the raw financial features behind those rule firings. We trained an interpretable ML model directly on the underlying activity patterns. The result: ~200% more true positives (accounts actually involved in fraud or laundering). But what leadership cared about most wasn’t the metric. It was this: “Why is this account suspicious?” That theme repeated across industries.

Insurance claim adjudication I later built a claim adjudication model for a major health insurer. The legacy system was massive, brittle, and effectively a black box. It would frequently deny claims incorrectly, and no one fully understood how it worked. We built a new ML system that brought claim-level adjudication accuracy to ~95%. Again, the metric wasn’t the headline internally. The headline was: “Why did this claim get denied?” In regulated environments, interpretability isn’t optional.

Stock forecasting and calibration I also learned this lesson personally. I built stock-forecasting models that performed well in historical backtests. Some predictions showed 80% probability of a price increase. Then the market regime shifted. The probabilities were overconfident. Some trades went the opposite direction. I lost money. Accuracy ≠ trustworthy probabilities. Calibration and drift awareness matter far more in deployment than most tutorials suggest. That experience fundamentally changed how I think about ML systems.

The core idea Endgame is my attempt to encode those lessons into a framework. It’s not trying to replace scikit-learn. Every estimator implements fit / predict / transform. But it extends the ecosystem with: Glass-box models (EBM, GAM, CORELS, SLIM, GOSDT, etc.) SOTA deep tabular models (FT-Transformer, TabPFN, SAINT, etc.) Conformal prediction and Venn-ABERS calibration Deployment guardrails (leakage detection, latency constraints, drift checks) 42 self-contained HTML visualizations Super Learner, BMA, cascade ensembles A full AutoML pipeline that respects deployment constraints All under a unified sklearn-compatible API.

Agent-native ML (MCP) We’re in the agentic AI era. You can ask an LLM to build a pipeline for you, but it often requires multiple prompts and manual corrections. Endgame ships with a native MCP server. This lets agents: load data train models compare results generate reports export reproducible scripts Through structured tool calls, not fragile prompt chains. My belief is that ML pipelines will increasingly become conversational infrastructure.

A small contrarian view The ML community is underestimating the problems left to solve in tabular data and overestimating the demand for accuracy-optimized models. Most real-world data in business, healthcare, and finance is tabular (often multimodal). And most real-world systems need to be interpretable, calibrated, and deployable — not just accurate.

Endgame v1.0.0 is open source (Apache 2.0). Python 3.10+. If you work on production ML systems, especially in regulated domains, I’d genuinely value feedback. GitHub: https://github.com/allianceai/endgame Install: pip install endgame-ml Happy to answer technical questions.

Show HN: I built a human rights evaluator for HN (content vs. site behavior)

https://observatory.unratified.org
1•9wzYQbTYsAIc•42s ago•0 comments

The Supreme Court doesn't care if you want to copyright your AI-generated art

https://www.engadget.com/ai/the-supreme-court-doesnt-care-if-you-want-to-copyright-your-ai-genera...
1•latexr•2m ago•0 comments

Google Chrome switches to two-week release cycle

https://developer.chrome.com/blog/chrome-two-week-release
1•mkurz•4m ago•0 comments

ChatGPT Health 'under-triaged' half of medical emergencies in a new study

https://www.nbcnews.com/health/health-news/chatgpt-health-under-triaged-half-medical-emergencies-...
1•0in•5m ago•0 comments

Universal-3 Pro Streaming

https://www.assemblyai.com/universal-3-pro-streaming
1•handfuloflight•9m ago•0 comments

Show HN: Dracula-AI – A lightweight, async SQLite-backed Gemini wrapper

https://github.com/suleymanibis0/dracula
1•suleymanibis•10m ago•0 comments

Show HN: Sovereign Trace Stamp – Frozen triple-time cryptographic timestamp

https://github.com/AionSystem/AION-BRAIN/tree/main/projects/sovereign-trace
1•sheldonksalmon•13m ago•0 comments

Cancel ChatGPT AI boycott surges after OpenAI pentagon military deal

https://www.euronews.com/next/2026/03/02/cancel-chatgpt-ai-boycott-surges-after-openai-pentagon-m...
6•nothrowaways•16m ago•0 comments

Show HN: Demarkus – De-centralized Markup for Us:memory for AI agents and humans

https://github.com/latebit-io/demarkus
1•ontehfritz•18m ago•0 comments

Quantifying the Swiss Marriage Tax

https://gendx.dev/blog/2026/03/02/swiss-marriage-tax.html
1•birdculture•21m ago•0 comments

Migrating 11,000 JavaScript files to TypeScript over 7 years at Patreon

https://www.patreon.com/posts/seven-years-to-typescript-152144830
2•satvikpendem•22m ago•0 comments

Engineering over Enforcement (2023)

https://www.contraption.co/engineering-over-enforcement/
1•mooreds•23m ago•0 comments

The jellyfish knows how to survive uncertain times

https://herbertlui.net/the-jellyfish-knows-how-to-survive-uncertain-times/
1•herbertl•25m ago•0 comments

Obama is right about aliens

https://www.doomsdayscenario.co/p/obama-is-right-about-aliens
1•mooreds•28m ago•0 comments

Life in the Endless Scroll: What We're Losing

https://www.medscape.com/viewarticle/life-endless-scroll-what-were-losing-2026a10005od
2•wjb3•28m ago•0 comments

Zen of AI Coding

https://nonstructured.com/zen-of-ai-coding/
2•vinhnx•28m ago•0 comments

Intent-Based Access Control (IBAC) – FGA for AI Agent Permissions

https://ibac.dev
1•ERROR_0x06•29m ago•0 comments

Moving to 199-day validity for public TLS certificates

https://knowledge.digicert.com/alerts/public-tls-certificates-199-day-validity
1•thread_id•32m ago•0 comments

The Wealth of Wall Street with Oren Cass [video]

https://www.youtube.com/watch?v=SL2aA8cgIB8
1•mooreds•33m ago•0 comments

Eurosky.social accounts – launching early February

https://www.eurosky.tech/register
2•doener•34m ago•0 comments

Spain says we have the necessary resources to contain US trade embargo

https://www.marketscreener.com/news/spanish-government-on-trump-threat-to-cut-trade-we-have-the-n...
3•rguiscard•34m ago•2 comments

Four Decades of Inquiry into the Genetic Bases of Specific Reading Disability

https://pubs.asha.org/doi/epdf/10.1044/2025_JSLHR-25-00050
2•wjb3•34m ago•0 comments

Show HN: Finqual – Free SEC-based API for fundamentals, insider and 13F data

https://finqual.app/
1•myztika•35m ago•0 comments

Reverse engineering "Hello World" in QuickBASIC 3.0

https://marnetto.net/2026/03/01/brun-hello-world
1•avadodin•35m ago•0 comments

Ask HN: Are you running a free product (pre-revenue)?

1•LeanVibe•36m ago•0 comments

Interactive Fiction Theory and Criticism

https://the-rosebush.com/
1•agnishom•36m ago•0 comments

The evolution of background job frameworks in Ruby

https://riverqueue.com/blog/ruby-queue-history
2•thunderbong•36m ago•0 comments

Tunesia authoritative nameservers for .tn are down

https://www.google.com/
2•NoahZuniga•37m ago•1 comments

"We have made the decision to permanently shut down Highguard."

https://twitter.com/PlayHighguard/status/2028923492125819287
1•minimaxir•37m ago•0 comments

The missing piece for AI coding agents

https://www.buildbuddy.io/blog/remote-bazel-with-agents/
2•jshchnz•37m ago•0 comments