frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Tabular data is the frontier – graphs can help

5•madman2890•1h ago
Even in 2026, most of the work in tabular predictive AI still has very little to do with the model itself. Whether you're using CatBoost, XGBoost, or newer tabular foundation models like TabPFN, the .fit() step is usually the smallest part of the workflow.

The real time sink is everything before that. Most real-world predictive problems live across many relational tables. So the majority of the work ends up being:

• Discovering which tables are actually relevant

• Understanding foreign keys and entity relationships

• Figuring out cardinality (1:1, 1:N, N:M)

• Aggregating child tables into meaningful features

• Handling time windows and leakage

• Integrating everything into a single training table

Only after all of that can you actually train the model. In many projects, 80–90% of the effort is spent on data discovery and multi-table aggregation, while the modeling step itself takes minutes.

Tabular foundation models reduce the amount of tuning required, but they don’t remove the fundamental need to collapse relational data into a single learning table. The bottleneck in tabular AI has always been the data graph, not the model.

Graphreduce is a project I've been incrementally building for a few years that addresses the real problem in tabular predictive AI: data prep

https://wesmadrigal.github.io/GraphReduce/

Comments

amazonbezos•1h ago
This is definitely where most of the time is spent - very cool!
madman2890•1h ago
Glad you find it useful.

Show HN: CodeConvert – Developer Conversion Tools (JSON→TS, YAML↔JSON, etc.)

https://www.codeconvert.dev/
1•tuxnotfound•50s ago•0 comments

Show HN: OXPT – Visual branching canvas for prompt versioning (Korean support)

https://www.oxpt.online
1•macnorton•1m ago•0 comments

Impacts of goat browsing on native vegetation during invasive plant control

https://onlinelibrary.wiley.com/doi/10.1111/rec.70338
1•PaulHoule•2m ago•0 comments

"Future-proofing" PC builds

https://rubenerd.com/futureproofing-pc-builds/
1•mikece•2m ago•0 comments

Review AI-Generated Code

https://iamvishnu.com/posts/you-must-review-ai-code
1•vishnuharidas•3m ago•0 comments

Aging Redefined: Cognitive and Physical Improvement with Positive Age Beliefs

https://www.mdpi.com/2308-3417/11/2/28
1•bikenaga•3m ago•0 comments

The Custom ASIC Thesis

https://www.latent.space/p/ainews-the-custom-asic-thesis
1•amelius•4m ago•0 comments

A 130KB Markdown file that turns Claude Code into an opinionated senior PM

https://github.com/Digidai/product-manager-skills
1•genedai•6m ago•1 comments

Slow Living

https://francescrossley.com/slow-living/
1•speckx•6m ago•0 comments

Show HN: Beads planner plugin for Claude Code

https://github.com/jbdamask/john-claude-skills/tree/main/plugins/beads-planner
1•jbdamask•6m ago•0 comments

Can You Nationalize a Frontier AI Lab?

https://jhallard.substack.com/p/can-you-nationalize-a-frontier-ai
1•forthwall•7m ago•0 comments

Devenv 2.0: A Fresh Interface to Nix

https://devenv.sh/blog/2026/03/05/devenv-20-a-fresh-interface-to-nix/
2•ryanhn•7m ago•0 comments

We signed a treaty. The Senate never voted on it. Now AI reshapes the economy

https://unratified.org/why/
1•9wzYQbTYsAIc•9m ago•1 comments

Datasets for Reconstructing Visual Perception from Brain Data

https://github.com/seelikat/neuro-visual-reconstruction-dataset-index
2•katsee•9m ago•0 comments

WTF is going on with databases? SpacetimeDB controversial release

https://www.paralect.com/trends/spacetimedb-release
1•igorkrasnik•9m ago•1 comments

Semiotic-Reflexive Transformer for Meaning Divergence Detection and Modulation

https://sublius.substack.com/p/the-semiotic-reflexive-transformer
2•spacebacon•10m ago•0 comments

Show HN: Bb – Windows through a detective's lens

https://github.com/cristeigabriela/bb
1•gabriela_c•11m ago•0 comments

Npd: Notepad, Notes, Sketch and Tasks

https://play.google.com/store/apps/details?id=nota.npd.com&hl=en_US
1•bugtishop•11m ago•1 comments

Show HN: DumbClaw, dumb and simple version of OpenClaw

https://github.com/chrischongyj/dumbclaw
1•claudeviolin•11m ago•0 comments

Microsoft and Microsoft's 'Open' 'AI' Seeking Bailout from The Pentagon

https://techrights.org/n/2026/03/05/Microsoft_and_Microsoft_s_Open_AI_Seeking_Bailout_From_the_Pe...
2•amcclure•13m ago•0 comments

Netflix Acquires AI Filmmaking Startup Founded by Ben Affleck

https://variety.com/2026/film/news/netflix-acquires-ben-affleck-ai-filmmaking-startup-interpositi...
1•andsoitis•13m ago•0 comments

Infinite Mario levels – generated on the fly

https://supermario.leanmcp.live
1•dheerajmp•13m ago•1 comments

Iran war spreads as European nations drawn further in

https://www.cnn.com/world/live-news/iran-war-us-israel-trump-03-05-26
1•mgh2•14m ago•0 comments

A GitHub Issue Title Compromised 4k Developer Machines

https://grith.ai/blog/clinejection-when-your-ai-tool-installs-another
2•edf13•14m ago•0 comments

IBM Union Says Many IBM Layoffs in Europe Confirmed

https://techrights.org/n/2026/03/05/IBM_Union_Says_Many_IBM_Layoffs_in_Europe_With_Netherlands_an...
3•amcclure•14m ago•0 comments

Transparency fears over plan to redact 2,000 staff names on Commons register

https://www.theguardian.com/politics/2026/mar/05/staff-names-parliament-register-transparency-fears
1•chrisjj•14m ago•1 comments

Replacing Juniors with AI Is Shortsighted

https://burkey.co/Blog/Replacing+Juniors+With+AI+Is+Shortsighted
1•amcclure•14m ago•0 comments

Book look: The secrets of consulting, by Gerald Weinberg

https://neil-vass.com/book-look-the-secrets-of-consulting-by-gerald-weinberg/
1•mooreds•14m ago•0 comments

Clawspace

https://github.com/nickytonline/clawspace
1•nickytonline•15m ago•1 comments

The Fantasy of a Comfy Retirement Has Always Been a Mirage

https://www.nytimes.com/2026/03/04/opinion/gen-x-retirement.html
3•mooreds•15m ago•0 comments