frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Tabular Foundation Models Still Need One Thing: Multi-Table Aggregation

https://wesmadrigal.github.io/GraphReduce/tutorial_auto_feature_engineering/
1•madman2890•2h ago

Comments

madman2890•2h ago
Tabular foundation models like TabPFN and related work are extremely promising. They’re starting to show strong results on many classical tabular ML benchmarks and can reduce the amount of manual modeling work required from data scientists. However, there is a structural reality of enterprise data that these models don’t remove. Most real-world machine learning problems are not stored in a single clean table. Instead they live across dozens or hundreds of relational tables: orders, customers, events, transactions, shipments, products, logs, etc. Each table captures part of the signal, often with one-to-many relationships, time dependencies, and high cardinality entities. Before any tabular model can be trained, those signals have to be integrated. In practice this means: Traversing relational graphs of tables Aggregating child tables to parent entities Handling time windows and temporal leakage Collapsing many-to-many relationships into meaningful features Producing a single wide training dataset This step is usually the most time-consuming part of the entire ML workflow. Even if the model itself becomes automated via a tabular foundation model, the data still has to be prepared. This is where GraphReduce comes in. GraphReduce treats the relational database as a graph of entities and relationships. Instead of manually writing large SQL pipelines, the user defines the nodes (tables) and their relationships. GraphReduce then walks the graph and performs the required aggregations automatically, generating a single training dataset.

Why a Robotics Startup's Failure Proves the Robot Starts with the Actuator

https://www.firgelli.com/blogs/news/k-scale-labs-failure-proves-robot-starts-with-actuator
1•RobbieDickson•2m ago•0 comments

Relicensing with AI-Assisted Rewrite

https://tuananh.net/2026/03/05/relicensing-with-ai-assisted-rewrite/
1•tuananh•8m ago•0 comments

NHS official pushed to add patient data to Palantir while advising company

https://www.ft.com/content/6c548670-0f3e-45f1-ba08-8bb6dd152af5
2•Jimmc414•11m ago•0 comments

Large-Scale Agentic RL for CUDA Kernel Generation

https://cuda-agent.github.io/
1•gmays•16m ago•0 comments

Ask HN: Seeing More Techcrunch on Frontpage?

1•par•16m ago•0 comments

Website showing LLM solutions to Knuth's Problems?

1•daly•16m ago•0 comments

Unified In-Process Agent Interface for Claude Code, Codex, Kimi

https://github.com/odysa/one-agent-sdk
2•agentforce•17m ago•0 comments

DOJ proposes policy aimed at limiting state bar ethics probes into its attorneys

https://abcnews.com/US/doj-proposes-policy-aimed-limiting-state-bar-ethics/story?id=130755238
4•petethomas•17m ago•1 comments

Global Consciousness Live Data

https://gcp2.net/#home_page_live_data
1•avonmach•18m ago•0 comments

RoundsKeeper – Score tracking for board/card games (Swift/SwiftUI, iCloud sync)

https://apps.apple.com/us/app/roundskeeper/id6754217431
1•justbustr•19m ago•1 comments

Show HN: The hardware isn't changing, why not get AI to build custom drivers?

https://github.com/eli7vh/signal-chain
1•elijahlucian•22m ago•0 comments

Show HN: Scape – One-click worktrees and orchestrators for Claude Code

https://www.scape.work/
3•bgnm2000•23m ago•0 comments

The Global Coherence Initiative [video]

https://www.youtube.com/watch?v=QFqsY-DT6rg
1•avonmach•23m ago•0 comments

Show HN: Fast Chladni figure simulation in Python with NumPy vectorization

https://github.com/ratwolfzero/Chladni_Figures
2•ratwolf•31m ago•1 comments

Show HN: Session-safe uploads and artifacts for remote MCP servers

https://aakashh242.github.io/remote-mcp-adapter/
1•aakashh242•34m ago•0 comments

You Just Reveived

https://dylan.gr/1772520728
3•djnaraps•38m ago•0 comments

Nike Is Moving Jobs to Low-Wage Regions of Indonesia

https://www.propublica.org/article/nike-jobs-indonesia-living-wages
4•petethomas•41m ago•0 comments

First Open-Source PR

https://duanehilton.com/notes/your-first-open-source-pr
1•rem_one•43m ago•0 comments

Signalbase – Real-time business intelligence API for agents (x402, USDC on Base)

https://github.com/brandontan/signalbase
1•brtan881972•48m ago•0 comments

Show HN: OctoFlow–GPU-native lang, vibe-coded with human at every decision gate

https://github.com/octoflow-lang/octoflow
1•mr_octopus•48m ago•3 comments

Low data gravity for fast retrieval on K8s (2021)

https://ra-mos.medium.com/get-up-an-running-with-local-ssds-on-kubernetes-gke-p1-the-code-c6cf5ac...
2•ramoz•49m ago•0 comments

Dear Meta Smart Glasses Wearers: You're Being Watched, Too

https://gizmodo.com/dear-meta-smart-glasses-wearers-youre-being-watched-too-2000728928
4•pabs3•50m ago•1 comments

Bayesian teaching enables probabilistic reasoning in large language models

https://www.nature.com/articles/s41467-025-67998-6
2•paraschopra•51m ago•0 comments

GitHub Copilot Goldeneye model preview

https://docs.github.com/en/copilot/reference/ai-models/model-hosting
1•amusingimpala75•59m ago•1 comments

*

1•kaungsetlin•1h ago•0 comments

LexisNexis confirms data breach as hackers leak stolen files

https://www.bleepingcomputer.com/news/security/lexisnexis-confirms-data-breach-as-hackers-leak-st...
4•arkadiyt•1h ago•0 comments

Morgan Stanley Lays Off 2,500 Employees Across All Divisions

https://www.wsj.com/finance/banking/morgan-stanley-lays-off-2-500-employees-across-all-divisions-...
5•LostMyLogin•1h ago•1 comments

Learn Fundamentals, Not Frameworks

https://newsletter.techworld-with-milan.com/p/learn-fundamentals-not-frameworks
2•stosssik•1h ago•0 comments

Brainworm – Hiding in Your Context Window

https://www.originhq.com/blog/brainworm
1•dsr12•1h ago•0 comments

How does AI change Software Engineering?

https://dlants.me/ai-se.html
1•todsacerdoti•1h ago•0 comments