frontpage.

Even in 2026, most of the work in tabular predictive AI still has very little to do with the model itself. Whether you're using CatBoost, XGBoost, or newer tabular foundation models like TabPFN, the .fit() step is usually the smallest part of the workflow.

The real time sink is everything before that. Most real-world predictive problems live across many relational tables. So the majority of the work ends up being:

• Discovering which tables are actually relevant

• Understanding foreign keys and entity relationships

• Figuring out cardinality (1:1, 1:N, N:M)

• Aggregating child tables into meaningful features

• Handling time windows and leakage

• Integrating everything into a single training table

Only after all of that can you actually train the model. In many projects, 80–90% of the effort is spent on data discovery and multi-table aggregation, while the modeling step itself takes minutes.

Tabular foundation models reduce the amount of tuning required, but they don’t remove the fundamental need to collapse relational data into a single learning table. The bottleneck in tabular AI has always been the data graph, not the model.

Graphreduce is a project I've been incrementally building for a few years that addresses the real problem in tabular predictive AI: data prep

https://wesmadrigal.github.io/GraphReduce/

Show HN: CodeConvert – Developer Conversion Tools (JSON→TS, YAML↔JSON, etc.)

Show HN: OXPT – Visual branching canvas for prompt versioning (Korean support)

Impacts of goat browsing on native vegetation during invasive plant control

"Future-proofing" PC builds

Review AI-Generated Code

Aging Redefined: Cognitive and Physical Improvement with Positive Age Beliefs

The Custom ASIC Thesis

A 130KB Markdown file that turns Claude Code into an opinionated senior PM

Slow Living

Show HN: Beads planner plugin for Claude Code

Can You Nationalize a Frontier AI Lab?

Devenv 2.0: A Fresh Interface to Nix

We signed a treaty. The Senate never voted on it. Now AI reshapes the economy

Datasets for Reconstructing Visual Perception from Brain Data

WTF is going on with databases? SpacetimeDB controversial release

Semiotic-Reflexive Transformer for Meaning Divergence Detection and Modulation

Show HN: Bb – Windows through a detective's lens

Npd: Notepad, Notes, Sketch and Tasks

Show HN: DumbClaw, dumb and simple version of OpenClaw

Microsoft and Microsoft's 'Open' 'AI' Seeking Bailout from The Pentagon

Netflix Acquires AI Filmmaking Startup Founded by Ben Affleck

Infinite Mario levels – generated on the fly

Iran war spreads as European nations drawn further in

A GitHub Issue Title Compromised 4k Developer Machines

IBM Union Says Many IBM Layoffs in Europe Confirmed

Transparency fears over plan to redact 2,000 staff names on Commons register

Replacing Juniors with AI Is Shortsighted

Book look: The secrets of consulting, by Gerald Weinberg

Clawspace

The Fantasy of a Comfy Retirement Has Always Been a Mirage

Tabular data is the frontier – graphs can help

Comments