frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

KumoRFM: A Foundation Model for In-Context Learning on Relational Data

https://kumo.ai/company/news/kumo-relational-foundation-model/
97•cliffly•11h ago

Comments

simplesort•10h ago
Jure Leskovec was my Professor at Stanford a few years back, cool to see he's behind this.

He seemed like a good guy and got the sense that he was destined to do something big

stuartjohnson12•7h ago
Vid is a good friend of mine and he's wicked smart and also a very solid guy I adore.

I'm also guessing at some point he will probably read this comment, so hey Vid! See you at the next VRSA meetup!

andraz•3h ago
Wickedly smart team indeed!
Rohitcss•10h ago
A real-time in-context label generator. Nice...
bookworm123•9h ago
I feel like this is the next big thing for AI, having the ability to interact with any sort of structured dataset out of the box. Very cool project!
perbu•7h ago
I'll suspect it'll be more like the next little thing. Most of don't interact that much with structured data, so the applications will be very specific.

However, the algo-trading crowd, will likely be very interested in this. They deal with structured data all day and it would surprise me if most of them don't already have things like this working in their networks. They seem to be very secretive, though, so we're not gonna hear much.

cliffly•6h ago
We all interact with structured data models constantly, like literally thousands of times each day, just indirectly.

Every single credit card purchase gets classified by a model as fraud or ok. When you go to Netflix and see recommended movies, it's all predictions on structured data. Every single post in every social media feed is there because a model predicted you'd like it.

Realistically, it might be more like 10s of thousands or even hundreds of thousands of predictions that we engage with in a day.

If reality matches the benchmarks for this model, it can kick off a whole new category of models that can potentially be bigger than LLMs

gk1•6h ago
Structured data = relational data

This has more applications than you might first think.

hbarka•5h ago
Does AI for relational data work the same way as token predictions does for LLM AI?
tinyoli•8h ago
Strange that they do not compare it against TabFN, which is another foundation model for tabular data. (https://github.com/PriorLabs/TabPFN)
profjure•1h ago
TabPFN is an amazing innovation. But there are some crucial differences in model capabilities that make it hard for a fair comparison.

TabPFN can only operate on a single small table. But real-world datasets are actually multi-table and to make accurate prediction you need to capture signal from multiple tables (for example, customers, products, purchases).

So, the comparison to TabPFN would be unfair as it would only use data from a single table and that would lead to bad performance of TabPFN.

SubiculumCode•3h ago
So suppose I've got a database of behavioral and neuroimaging data from a research study on autism. Is this something that can be used to predict diagnosis from the other data fields?
profjure•58m ago
Yes, I think this would work. For example, you'd organize the data into 3 tables: patients, behaviors and images. The patients table would have a partially filled-out "diagnosis" column. The model would then predict diagnosis of not-yet-diagnosed patients based on the patterns in data fields of previously diagnosed patients.
EGreg•3h ago
So can this be used to predict patterns for traffic, restaurant table availability, and your customers’ demand for things based on other customers?
autorinalagist•48m ago
Hey! I'm one of the engineers who worked on this project.

These are all problems that KumoRFM is able to solve given that you have the right relational data of course! So e.g. for predicting restaurant table availability you would need at least an occupancy table which records how many seats were available historically and you can predict its future entries.

But you can also add more relevant data without joining into a single table, so you can add a restaurants table, a holiday-calendar table, weather patterns, etc. and KumoRFM should take it all into account when predicting.

nsbk•3h ago
Interesting timing, they have recently reached out to my $dayjob. We will be probably be running a workshop on our (massive) dataset with them. I'd like to evaluate the performance of a couple of analytical models we've manually built against whatever this model can do based on some prompts. Exciting times!
dcrimp•2h ago
interesting! Super cool idea to augment software built with traditional DBs

I had some thoughts [1] around a concept similar to this a while ago, although it was much less refined. My thinking was around whether or not we could have a neural net remember a relational database schema, and be able to be queried for facts it knows, and facts it might predict.

This seems like a much more sensical (and actualised) stab at this kinda concept.

[1]: dancrimp.nz/2024/11/01/semantic-db/

Postgres IDE in VS Code

https://techcommunity.microsoft.com/blog/adforpostgresql/announcing-a-new-ide-for-postgresql-in-vs-code-from-microsoft/4414648
322•Dowwie•3h ago•149 comments

Find Your People

https://foundersatwork.posthaven.com/find-your-people
112•jl•2h ago•45 comments

Beyond Semantics: Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
47•nyrikki•2h ago•13 comments

Into The Tunnel: The secret life of wind tunnels

https://jordanwtaylor2.substack.com/p/into-the-tunnel
18•iamwil•1h ago•2 comments

Caesar's Last Breath

https://charliesabino.com/caesars-last-breath/
86•charliesabino•4h ago•38 comments

The metre originated in the French Revolution

https://www.abc.net.au/news/science/2025-05-20/metre-treaty-anniversary-metric-system-measurement-metrology/105302024
37•Tomte•2h ago•59 comments

Types of optical systems in a lens designer's toolbox (2020)

https://www.pencilofrays.com/lens-design-forms/
3•picture•5m ago•0 comments

Positional preferences, order effects, prompt sensitivity undermine AI judgments

https://www.cip.org/blog/llm-judges-are-unreliable
43•joalstein•1h ago•34 comments

You Don't Need Re-Ranking: Understanding the Superlinked Vector Layer

https://superlinked.com/vectorhub/articles/why-do-not-need-re-ranking
13•softwaredoug•1h ago•7 comments

Show HN: Samchika – A Java Library for Fast, Multithreaded File Processing

https://github.com/MayankPratap/Samchika
39•mprataps•4h ago•23 comments

Sesame Scheme: Unintended Consequences of Allergen Food Labeling

https://www.choicesmagazine.org/choices-magazine/submitted-articles/unintended-consequences-of-allergen-food-labeling
15•hilux•1h ago•3 comments

Why I no longer have an old-school cert on my HTTPS site

https://rachelbythebay.com/w/2025/05/22/ssl/
140•mcbain•7h ago•117 comments

Designing type inference for high quality type errors

https://blog.polybdenum.com/2025/02/14/designing-type-inference-for-high-quality-type-errors.html
36•PaulHoule•4d ago•6 comments

Remembering Alasdair MacIntyre

https://www.wordonfire.org/articles/remembering-alasdair-macintyre-1929-2025/
100•danielam•7h ago•41 comments

MCP is the coming of Web 2.0 2.0

https://www.anildash.com//2025/05/20/mcp-web20-20/
91•freediver•3h ago•87 comments

Slime (2021)

https://granta.com/slime/
15•Tomte•2h ago•0 comments

Writing A Job Runner (In Elixir) (Again) (10 years later)

https://github.com/notactuallytreyanastasio/genstage_tutorial_2025/blob/main/README.md
89•rhgraysonii•7h ago•24 comments

The Curious Case of the Pygmy Nuthatch

https://slate.com/culture/2025/05/birds-movies-charlies-angels-2000-pygmy-nuthatch.html
103•prawn•2d ago•13 comments

John Carmack talk at Upper Bound 2025

https://twitter.com/ID_AA_Carmack/status/1925710474366034326
395•tosh•13h ago•259 comments

How to live on $432 a month in America

https://shagbark.substack.com/p/how-to-live-on-432-a-month-in-america
23•cactusplant7374•1h ago•5 comments

Tallest Wooden Wind Turbine

https://modvion.com/
145•Bluestein•4d ago•105 comments

Alberta separatism push roils Canada

https://www.nytimes.com/2025/05/22/world/canada/alberta-separatism-referendum.html
6•paulpauper•1h ago•13 comments

Satellites Spotting Depth

https://tech.marksblogg.com/depth-anything-v2-maxar-ai-detection.html
82•marklit•2d ago•20 comments

'Turbocharged' Mitochondria Power Birds' Epic Migratory Journeys

https://www.quantamagazine.org/turbocharged-mitochondria-power-birds-epic-migratory-journeys-20250519/
69•pseudolus•4d ago•16 comments

Show HN: Defuddle, an HTML-to-Markdown alternative to Readability

https://github.com/kepano/defuddle
365•kepano•20h ago•61 comments

Bits with Soul

https://www.darwin.cam.ac.uk/lectures/entry/bits-with-soul/
23•mrkeen•4d ago•5 comments

Sugar-Coated Poison: Benign Generation Unlocks LLM Jailbreaking

https://arxiv.org/abs/2504.05652
40•favoboa•2d ago•35 comments

Measuring Lunar North and South Polar Regions

https://iopscience.iop.org/article/10.3847/PSJ/adbc9d
7•bryanrasmussen•3d ago•2 comments

KumoRFM: A Foundation Model for In-Context Learning on Relational Data

https://kumo.ai/company/news/kumo-relational-foundation-model/
97•cliffly•11h ago•17 comments

That fractal that's been up on my wall for years

https://chriskw.xyz/2025/05/21/Fractal/
526•chriskw•1d ago•37 comments