frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

How LLM agents solve the table merging problem

https://futuresearch.ai/deep-merge-tutorial/
17•ddp26•1h ago

Comments

mckennameyer•1h ago
Interesting approach with the cascade. How do you decide when to escalate from fuzzy matching to LLM?
parad0x0n•1h ago
So fuzzy matching only makes sense if you expect two columns having the same data more or less, otherwise you can skip that step.

And then you have to pick a threshold -> if similarity of strings is above that threshold, it's a match, otherwise, not. Threshold should be high to prevent false positives. LLM will take care of the non-matches

jackfranklyn•45m ago
Been working on this exact problem in the financial/accounting space - matching bank statement rows to accounting records. Real-world messiness makes it interesting:

The fuzzy threshold question is tricky because false positives are worse than false negatives. A user seeing a wrong match erodes trust fast. We ended up with a tiered approach: high-confidence matches go through automatically, medium-confidence gets surfaced for human review, low-confidence stays unmatched rather than guessing.

One thing we found: the hardest cases aren't the ones where strings are slightly different - they're the ones where the same transaction appears with completely different descriptions on each side. "PAYPAL *ACME" vs "Invoice 1234 - Acme Ltd". No amount of fuzzy matching helps there. That's where learning from historical patterns (how did the user match these before?) beats trying to infer semantic similarity from scratch every time.

ddp26•33m ago
Yep! We have lots of examples like that where two vendors, or two customers, are completely non-matching. With LLMs and LLM web agents, you also can associate things that are not the same entity.

One example we have is merging a table of companies to a table of company websites. You get things like "Acme Corp" matching "my-logicistics.com" that no LLM has memorized, so you have to look them up using the web. ReAct web agents work really well here, but it can be very expensive, so it's all about doing this cost efficiently.

Windows 11 update KB5074109 is breaking systems – Microsoft says uninstall it

https://www.windowscentral.com/microsoft/windows-11/microsoft-urges-uninstalling-the-update-kb507...
3•game_the0ry•2m ago•0 comments

Apple's Siri Chatbot May Run on Google Servers

https://www.macrumors.com/2026/01/22/apples-siri-chatbot-may-run-on-google-servers/
1•jonbaer•3m ago•0 comments

Claude Code TUI Runs on React

https://twitter.com/trq212/status/2014051501786931427
1•redox99•4m ago•0 comments

Habitable worlds may be more common than thought

https://www.jpost.com/science/article-884256
2•wjb3•4m ago•0 comments

Show HN: We tested AI agents with 214 attacks that don't require jailbreaking

1•exordex•4m ago•0 comments

Digital Gentrification: Building a future we remember instead of the one we want

https://kraa.io/306942411031387136
1•ieuanking•5m ago•1 comments

Spectrogram Art with Webcam Images [video]

https://www.youtube.com/watch?v=1wc6l5TQjfs
1•phantomshelby•6m ago•0 comments

Mistral CEO:China lagging in AI is a 'fairy tale'

https://www.msn.com/en-us/money/other/china-lagging-in-ai-is-a-fairy-tale-mistral-ceo-says/ar-AA1...
2•ekm2•8m ago•0 comments

Salesforce can't even get its Settings menu right

https://www.mildlyangry.com/2026/salesforce-cant-even-get-its-settings-menu-right/
1•reddalo•8m ago•0 comments

Drowning in AI slop, cURL ends bug bounties

https://thenewstack.io/drowning-in-ai-slop-reports-curl-ends-bug-bounties/
3•CrankyBear•8m ago•0 comments

The Moral Education of an Alien Mind

https://www.lawfaremedia.org/article/the-moral-education-of-an-alien-mind
1•ano-ther•9m ago•0 comments

Service degradation on Microsoft 365 (Business or Enterprise)

https://status.cloud.microsoft/m365/referrer=serviceStatusRedirect
3•caminanteblanco•10m ago•0 comments

'Organized syndicates' fraudulently access health records, lawsuit says

https://www.washingtonpost.com/health/2026/01/22/electronic-health-record-fraud-lawsuit/
3•reaperducer•10m ago•0 comments

Single Executable Applications with Node.js

https://nodejs.org/api/single-executable-applications.html
1•strogonoff•11m ago•0 comments

Judge rejects DOJ's initial attempt to bring charges against Don Lemon

https://www.cnn.com/2026/01/22/politics/don-lemon-justice-department-minnesota
4•FireBeyond•13m ago•1 comments

Cybersecurity companies can now track brand visibility in ChatGPT, perplexity

https://www.usatoday.com/press-release/story/23810/san-francisco-startup-launches-first-platform-...
1•guptadeepak•14m ago•1 comments

Joan Didion: Only Disconnect

https://www.writing.upenn.edu/~afilreis/103/didion-per-harrison.html
2•ownlife•14m ago•0 comments

Call for Volunteers: Debian Data Protection Team

https://lists.debian.org/debian-devel-announce/2026/01/msg00001.html
2•ImJamal•16m ago•0 comments

Show HN: Meepr – A quiet, self building social platform

https://meepr.co/
1•neom•17m ago•0 comments

Fail Productively

https://dontbreakprod.com/posts/fail-productively
2•dorkrawk•17m ago•0 comments

AI Global: Global Sector Trends on Generative AI (1/16/26) [pdf]

https://www.similarweb.com/corp/wp-content/uploads/2026/01/attachment-Global-AI-Tracker-7.pdf
1•frozenseven•19m ago•0 comments

Betting on Prediction Markets Is Their Job. They Make Millions.

https://www.nytimes.com/2026/01/22/business/prediction-markets-polymarket-kalshi.html
1•bookofjoe•19m ago•1 comments

The Tragedy of the Agentic Commons

https://aleximas.substack.com/p/the-tragedy-of-the-agentic-commons
2•alphabetatango•20m ago•0 comments

General Fusion to Become First Publicly Traded Pure-Play Fusion Company

https://generalfusion.com/post/general-fusion-business-combination-announcement/
1•Element_•22m ago•0 comments

Show HN: TalkCAD – AI agent to generate CAD models using OpenSCAD code

https://github.com/outerreaches/talkcad
1•alex_maz•24m ago•0 comments

A New Single-Stage Four-Switch Common-Ground-Type Buck-Boost Inverter

https://www.mdpi.com/1996-1073/19/1/64
1•PaulHoule•24m ago•0 comments

Ask HN: I'm sure more than just Microsoft is down rn

5•koconder•25m ago•2 comments

Seasoning a Kid: A Search for a Practice of Place

https://emergencemagazine.org/feature/seasoning-a-kid/
1•gmays•25m ago•0 comments

Why Catalonia Failed (2022)

https://www.palladiummag.com/2022/05/11/why-catalonia-failed/
1•icwtyjj•25m ago•0 comments

Lululemon – you're wearing them wrong

https://financialpost.com/pmn/business-pmn/lululemon-now-asks-leggings-buyers-to-wear-skin-toned-...
2•canucker2016•26m ago•1 comments