frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: I got Stability AI's small audio model into a consumer iOS app

1•kilojules•57s ago•0 comments

RFS: Cheap AI lawyer for company footer policies

1•lizagub•1m ago•0 comments

New Book:Forward Deployed Engineer Rule Book

https://www.amazon.com/dp/B0FXSZW5HB
2•STEM-AI-FDE•4m ago•0 comments

CyberSlop – meet the new threat actor, MIT and Safe Security

https://doublepulsar.com/cyberslop-meet-the-new-threat-actor-mit-and-safe-security-d250d19d02a4
2•chillax•7m ago•0 comments

Theory of Self-Reproducing Automata [pdf]

https://cba.mit.edu/events/03.11.ASE/docs/VonNeumann.pdf
2•andsoitis•8m ago•0 comments

Show HN: Bet on Auction Results with PolyBets

https://polybets.fun/
1•h100ker•8m ago•0 comments

Live Translation on AirPods Expands to the EU (IE)

https://www.apple.com/ie/newsroom/2025/11/live-translation-on-airpods-expands-to-the-eu/
1•rbanffy•9m ago•0 comments

BBC on Gaza-Israel: One Story, Double Standards

https://cfmm.org.uk/bbc-on-gaza-israel-one-story-double-standards/
1•stuaxo•10m ago•0 comments

Nvidia's H100 GPU Takes AI Processing to Space

https://spectrum.ieee.org/nvidia-h100-space
1•rbanffy•12m ago•1 comments

UK outperforms US in creating unicorns from early stage VC investment

https://www.cityam.com/uk-outperforms-us-in-creating-unicorns-from-early-stage-vc-investment/
4•mmarian•12m ago•1 comments

ClusterMAX 2.0: The Industry Standard GPU Cloud Rating System

https://newsletter.semianalysis.com/p/clustermax-20-the-industry-standard
2•MasterScrat•12m ago•0 comments

How to lead products through layoff fear

https://www.mindtheproduct.com/scary-times-how-to-lead-through-layoff-fear/
1•mooreds•13m ago•0 comments

Self-Replicating Probes Could Be Operating in the Solar System

https://www.universetoday.com/articles/self-replicating-probes-could-be-operating-right-now-in-th...
3•rbanffy•18m ago•0 comments

Antropocene

https://feralatlas.org
1•RoyBean•20m ago•0 comments

II. Leaflet of the White Rose

https://www.weisse-rose-stiftung.de/white-rose-resistance-group/leaflets-of-the-white-rose/ii-lea...
1•breppp•20m ago•0 comments

One porn platform made millions suing its viewers

https://www.theguardian.com/society/ng-interactive/2025/nov/04/strike-3-porn-copyright-lawsuits
3•belter•20m ago•0 comments

NASA instrument arrives at ISS to demonstrate quantum entanglement

https://spaceandtelescope.com/nasa-instrument-arrives-at-iss-to-demonstrate-quantum-entanglement/
1•belter•21m ago•0 comments

How Tiles Works – Tiles Privacy

https://www.blog.tiles.run/p/how-tiles-works
1•_feynon•23m ago•0 comments

Ask HN: Can people please stop commenting on whether a submission is AI?

1•AndrewDucker•23m ago•3 comments

Creating a New Embedded Rust Projects for NXP LPC55S69

https://mcuoneclipse.com/2025/10/19/creating-an-new-embedded-rust-projects-for-nxp-lpc55s69/
1•hasheddan•23m ago•0 comments

Satisfying Bazel's relative paths requirement in C++ toolchains

https://pigweed.dev/blog/09-bazel-relative-toolchain-paths.html
1•kaycebasques•24m ago•0 comments

OpenAI Wants Federal Backstop for New Investments [video]

https://www.wsj.com/video/openai-wants-federal-backstop-for-new-investments/4F6C864C-7332-448B-A9...
2•mkhattab•25m ago•0 comments

Bombshell report exposes how Meta relied on scam ad profits to fund AI

https://arstechnica.com/tech-policy/2025/11/bombshell-report-exposes-how-meta-relied-on-scam-ad-p...
3•duxup•25m ago•1 comments

What Did Medieval Peasants Know? (2022)

https://www.theatlantic.com/health/archive/2022/05/medieval-history-peasant-life-work/629783/
1•thinkingemote•26m ago•0 comments

Cursor – Sixty days with the AI coding startup

https://joincolossus.com/article/inside-cursor/
1•davidgomes•26m ago•0 comments

JanitorBench: A new LLM benchmark for multi-turn chats

https://about.janitorai.com/
17•shep101•26m ago•2 comments

Lightstep is shutting down March 1, 2026

https://docs.lightstep.com/changelog/eol-notice
4•acid__•29m ago•1 comments

Ford Considers Scrapping Electric Version of F-150 Truck

https://www.wsj.com/business/autos/ford-150-lightning-ev-decision-89dc0d84
8•JumpCrisscross•31m ago•0 comments

Show HN: Deepcon – Get the most accurate context for coding agents

https://deepcon.ai
3•ethanpark•33m ago•1 comments

New court docs put Sam Altman's honesty in spotlight again

https://www.fastcompany.com/91436157/court-filing-sam-altman-openai-anthropic
2•teachrdan•33m ago•0 comments
Open in hackernews

Show HN: TabPFN-2.5 – SOTA foundation model for tabular data

https://priorlabs.ai/technical-reports/tabpfn-2-5-model-report
39•onasta•1h ago
I am excited to announce the release of TabPFN-2.5, our tabular foundation model that now scales to datasets of up to 50,000 samples and 2,000 features - a 5x increase from TabPFN v2, published in the Nature journal earlier this year. TabPFN-2.5 delivers state-of-the-art predictions in one forward pass without hyperparameter tuning across classification and regression tasks.

What’s new in 2.5: TabPFN-2.5 maintains the core approach of v2 - a pretrained transformer trained on more than hundred million synthetic datasets to perform in-context learning and output a predictive distribution for the test data. It natively supports missing values, cateogrical features, text and numerical features is robust to outliers and uninformative features.

The major improvements:

- 5x scale increase: Now handles 50,000 samples × 2,000 features (up from 10,000 × 500 in v2)

- SOTA performance: TabPFN-2.5 outperforms tuned tree-based methods and matches the performance of a complex ensemble (AutoGluon 1.4), that itself includes TabPFN v2, tuned for 4 hours. Tuning the model improves performance, outperforming AutoGluon 1.4 for regression tasks.

- Rebuilt API: New REST interface along with Python SDK with dedicated fit & predict endpoints, making deployment and integration more developer-friendly

- A distillation engine that converts TabPFN-2.5 into a compact MLP or tree ensemble while preserving accuracy and offer low latency inference.

There are still some limitations. The model is designed for datasets up to 50K samples. It can handle larger datasets but that hasn’t been our focus with TabPFN-2.5. The distillation engine is not yet available through the API but only through licenses (though we do show the performance in the model report).

We’re actively working on removing these limitations and intend to release newer models focused on context reasoning, causal inference, graph networks, larger data and time-series. TabPFN-2.5 is available via API and a package on Hugging Face. Would love for you to try it and give us your feedback!

Model report: https://priorlabs.ai/technical-reports/tabpfn-2-5-model-repo...

Package: https://github.com/PriorLabs/TabPFN

Client: https://github.com/PriorLabs/tabpfn-client

Docs: https://docs.priorlabs.ai/quickstart

Comments

klemens_floege•1h ago
Good stuff!
zurfer•1h ago
The current go to solution for the kinds of problems that TabPFN is solving would be something like XGBoost. In general it's a good baseline, but the challenge is always that you need to spend a lot of time feature engineering and tweaking the data representation before something like XGBoost can deliver good performance on your regression or classification problems.

For me the promise of foundation models for tabular data is that there are enough generalizable patterns, so that you need less manual feature engineering and data cleaning.

And kudos to the team, I think it's a really creative application of neural networks. I was always frustrated with neural networks, since they were hard to tune on "structured" data and always under-performed (for me), but we also never had real foundational models for structured data.

noahho•3m ago
Less feature engineering is definitely something we are aiming for. The current version is actually only based on statistics, the real world connections between features is something we're working on right now and hope to show results for soon. That's the next step
dill_1•48m ago
Tabular data is still underrated!
noahho•6m ago
When we released TabPFNv1 over three years ago, I didn’t expect at all the hundreds of comments and reposts we would see. Tabular data had been a field getting little love from AI research—but we immediately felt that this was a topic that data scientists, scientists, financial analysts, and enterprise users deeply cared about. Glad its useful to people!
abracos•44m ago
how does it compare to automl tools?
noahho•34m ago
TabPFN-2.5 default (one forward pass) matches AutoGluon 1.4 tuned for four-hours. Autogluon is the strongest AutoML including stacking of XGB and cat boost and even includes the previous TabPFNv2.
TheTaytay•2m ago
Looks really cool. In reading through the FAQ, it says this: Q: "How are text features handled?" A: "In the local package version text features are encoded as categoricals without considering their semantic meaning. Our API automatically detects text features and includes their semantic meaning into our prediction. The local package version encodes text as numerical categories and does not include semantic meaning."

So that means that automatic embedding/semantic meaning is reserved for API use of TabPFN, right? Otherwise, if I use it locally, it's going to assign each of my distinct text values an arbitrary int, right?