frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: I built a clawdbot that texts like your crush

https://14.israelfirew.co
1•IsruAlpha•25s ago•0 comments

Scientists reverse Alzheimer's in mice and restore memory (2025)

https://www.sciencedaily.com/releases/2025/12/251224032354.htm
1•walterbell•3m ago•0 comments

Compiling Prolog to Forth [pdf]

https://vfxforth.com/flag/jfar/vol4/no4/article4.pdf
1•todsacerdoti•4m ago•0 comments

Show HN: Cymatica – an experimental, meditative audiovisual app

https://apps.apple.com/us/app/cymatica-sounds-visualizer/id6748863721
1•_august•6m ago•0 comments

GitBlack: Tracing America's Foundation

https://gitblack.vercel.app/
1•martialg•6m ago•0 comments

Horizon-LM: A RAM-Centric Architecture for LLM Training

https://arxiv.org/abs/2602.04816
1•chrsw•6m ago•0 comments

We just ordered shawarma and fries from Cursor [video]

https://www.youtube.com/shorts/WALQOiugbWc
1•jeffreyjin•7m ago•1 comments

Correctio

https://rhetoric.byu.edu/Figures/C/correctio.htm
1•grantpitt•7m ago•0 comments

Trying to make an Automated Ecologist: A first pass through the Biotime dataset

https://chillphysicsenjoyer.substack.com/p/trying-to-make-an-automated-ecologist
1•crescit_eundo•11m ago•0 comments

Watch Ukraine's Minigun-Firing, Drone-Hunting Turboprop in Action

https://www.twz.com/air/watch-ukraines-minigun-firing-drone-hunting-turboprop-in-action
1•breve•12m ago•0 comments

Free Trial: AI Interviewer

https://ai-interviewer.nuvoice.ai/
1•sijain2•12m ago•0 comments

FDA Intends to Take Action Against Non-FDA-Approved GLP-1 Drugs

https://www.fda.gov/news-events/press-announcements/fda-intends-take-action-against-non-fda-appro...
7•randycupertino•14m ago•2 comments

Supernote e-ink devices for writing like paper

https://supernote.eu/choose-your-product/
3•janandonly•16m ago•0 comments

We are QA Engineers now

https://serce.me/posts/2026-02-05-we-are-qa-engineers-now
1•SerCe•16m ago•0 comments

Show HN: Measuring how AI agent teams improve issue resolution on SWE-Verified

https://arxiv.org/abs/2602.01465
2•NBenkovich•16m ago•0 comments

Adversarial Reasoning: Multiagent World Models for Closing the Simulation Gap

https://www.latent.space/p/adversarial-reasoning
1•swyx•17m ago•0 comments

Show HN: Poddley.com – Follow people, not podcasts

https://poddley.com/guests/ana-kasparian/episodes
1•onesandofgrain•25m ago•0 comments

Layoffs Surge 118% in January – The Highest Since 2009

https://www.cnbc.com/2026/02/05/layoff-and-hiring-announcements-hit-their-worst-january-levels-si...
8•karakoram•25m ago•0 comments

Papyrus 114: Homer's Iliad

https://p114.homemade.systems/
1•mwenge•25m ago•1 comments

DicePit – Real-time multiplayer Knucklebones in the browser

https://dicepit.pages.dev/
1•r1z4•25m ago•1 comments

Turn-Based Structural Triggers: Prompt-Free Backdoors in Multi-Turn LLMs

https://arxiv.org/abs/2601.14340
2•PaulHoule•27m ago•0 comments

Show HN: AI Agent Tool That Keeps You in the Loop

https://github.com/dshearer/misatay
2•dshearer•28m ago•0 comments

Why Every R Package Wrapping External Tools Needs a Sitrep() Function

https://drmowinckels.io/blog/2026/sitrep-functions/
1•todsacerdoti•28m ago•0 comments

Achieving Ultra-Fast AI Chat Widgets

https://www.cjroth.com/blog/2026-02-06-chat-widgets
1•thoughtfulchris•30m ago•0 comments

Show HN: Runtime Fence – Kill switch for AI agents

https://github.com/RunTimeAdmin/ai-agent-killswitch
1•ccie14019•33m ago•1 comments

Researchers surprised by the brain benefits of cannabis usage in adults over 40

https://nypost.com/2026/02/07/health/cannabis-may-benefit-aging-brains-study-finds/
2•SirLJ•34m ago•0 comments

Peter Thiel warns the Antichrist, apocalypse linked to the 'end of modernity'

https://fortune.com/2026/02/04/peter-thiel-antichrist-greta-thunberg-end-of-modernity-billionaires/
4•randycupertino•35m ago•2 comments

USS Preble Used Helios Laser to Zap Four Drones in Expanding Testing

https://www.twz.com/sea/uss-preble-used-helios-laser-to-zap-four-drones-in-expanding-testing
3•breve•41m ago•0 comments

Show HN: Animated beach scene, made with CSS

https://ahmed-machine.github.io/beach-scene/
1•ahmedoo•41m ago•0 comments

An update on unredacting select Epstein files – DBC12.pdf liberated

https://neosmart.net/blog/efta00400459-has-been-cracked-dbc12-pdf-liberated/
3•ks2048•41m ago•0 comments
Open in hackernews

Parse complex documents in LangChain with new provider UndatasIO

https://docs.langchain.com/oss/python/integrations/document_loaders/undatasio
1•jojogh•4mo ago

Comments

jojogh•4mo ago
Hey HN, Alex here, founder of undatas.io.

Huge news: We just launched as a LangChain Core Provider—and we’re here to kill the #1 pain point of RAG: garbage document parsing.

Let’s cut to it: Building reliable AI used to feel like rolling the dice. Existing loaders mangle tables, drop critical data, and give zero way to verify outputs. You’d blindy feed messy text into embeddings, waste compute on garbage, and wonder why your app failed. I started Undatasio because this frustration broke more of my projects than I can count.

Our fix? Two non-negotiables: absolute parsing precision and total transparency—wrapped in a model no one else offers: pay only for the parses you accept. Bad output? It’s free. No excuses, no gotchas.

This isn’t "another loader" for LangChain. As a Core Provider, `UndatasioLoader` bakes quality control into the start of your chain: - Programmatically check parsed JSON before it hits embeddings - Reject docs that miss key fields (e.g., no `invoice_total`, wrong table columns) - See exactly where data came from with positional `bbox` coordinates (build your own validation UI in minutes)

No more wasting time or money on downstream garbage. Data prep should be the reliable part of your stack—not the scary one.

We’ve been grinding to make this integration feel native to LangChain, and partnering with their team to push it live has been a blast.

If you’re tired of RAG failing because your inputs are broken, give it a spin. We’re here all day to answer questions, and we need your feedback to make this even better.

Links to get started are in the comments—fire away!

Here’s how to get started:

1. Install the Package: pip install langchain-undatasio (PyPI Link: https://pypi.org/project/langchain-undatasio/)

2. Check out the Official Docs: (LangChain Provider Page: https://docs.langchain.com/oss/python/integrations/providers...)

3. Try the Live Demo: We've set up a Colab notebook with examples. (Google Colab Notebook: https://colab.research.google.com/drive/1k_UhPjNoiUXC7mkMOEI...)

I'll be here all day to answer any questions. Let me know what you think.