frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Code only says what it does

https://brooker.co.za/blog/2020/06/23/code.html
1•logicprog•4m ago•0 comments

The success of 'natural language programming'

https://brooker.co.za/blog/2025/12/16/natural-language.html
1•logicprog•4m ago•0 comments

The Scriptovision Super Micro Script video titler is almost a home computer

http://oldvcr.blogspot.com/2026/02/the-scriptovision-super-micro-script.html
2•todsacerdoti•5m ago•0 comments

Discovering the "original" iPhone from 1995 [video]

https://www.youtube.com/watch?v=7cip9w-UxIc
1•fortran77•6m ago•0 comments

Psychometric Comparability of LLM-Based Digital Twins

https://arxiv.org/abs/2601.14264
1•PaulHoule•8m ago•0 comments

SidePop – track revenue, costs, and overall business health in one place

https://www.sidepop.io
1•ecaglar•10m ago•1 comments

The Other Markov's Inequality

https://www.ethanepperly.com/index.php/2026/01/16/the-other-markovs-inequality/
1•tzury•12m ago•0 comments

The Cascading Effects of Repackaged APIs [pdf]

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=6055034
1•Tejas_dmg•14m ago•0 comments

Lightweight and extensible compatibility layer between dataframe libraries

https://narwhals-dev.github.io/narwhals/
1•kermatt•16m ago•0 comments

Haskell for all: Beyond agentic coding

https://haskellforall.com/2026/02/beyond-agentic-coding
2•RebelPotato•20m ago•0 comments

Dorsey's Block cutting up to 10% of staff

https://www.reuters.com/business/dorseys-block-cutting-up-10-staff-bloomberg-news-reports-2026-02...
2•dev_tty01•23m ago•0 comments

Show HN: Freenet Lives – Real-Time Decentralized Apps at Scale [video]

https://www.youtube.com/watch?v=3SxNBz1VTE0
1•sanity•24m ago•1 comments

In the AI age, 'slow and steady' doesn't win

https://www.semafor.com/article/01/30/2026/in-the-ai-age-slow-and-steady-is-on-the-outs
1•mooreds•32m ago•1 comments

Administration won't let student deported to Honduras return

https://www.reuters.com/world/us/trump-administration-wont-let-student-deported-honduras-return-2...
1•petethomas•32m ago•0 comments

How were the NIST ECDSA curve parameters generated? (2023)

https://saweis.net/posts/nist-curve-seed-origins.html
2•mooreds•32m ago•0 comments

AI, networks and Mechanical Turks (2025)

https://www.ben-evans.com/benedictevans/2025/11/23/ai-networks-and-mechanical-turks
1•mooreds•33m ago•0 comments

Goto Considered Awesome [video]

https://www.youtube.com/watch?v=1UKVEUGEk6Y
1•linkdd•35m ago•0 comments

Show HN: I Built a Free AI LinkedIn Carousel Generator

https://carousel-ai.intellisell.ai/
1•troyethaniel•36m ago•0 comments

Implementing Auto Tiling with Just 5 Tiles

https://www.kyledunbar.dev/2026/02/05/Implementing-auto-tiling-with-just-5-tiles.html
1•todsacerdoti•38m ago•0 comments

Open Challange (Get all Universities involved

https://x.com/i/grok/share/3513b9001b8445e49e4795c93bcb1855
1•rwilliamspbgops•38m ago•0 comments

Apple Tried to Tamper Proof AirTag 2 Speakers – I Broke It [video]

https://www.youtube.com/watch?v=QLK6ixQpQsQ
2•gnabgib•40m ago•0 comments

Show HN: Isolating AI-generated code from human code | Vibe as a Code

https://www.npmjs.com/package/@gace/vaac
1•bstrama•42m ago•0 comments

Show HN: More beautiful and usable Hacker News

https://twitter.com/shivamhwp/status/2020125417995436090
3•shivamhwp•42m ago•0 comments

Toledo Derailment Rescue [video]

https://www.youtube.com/watch?v=wPHh5yHxkfU
1•samsolomon•44m ago•0 comments

War Department Cuts Ties with Harvard University

https://www.war.gov/News/News-Stories/Article/Article/4399812/war-department-cuts-ties-with-harva...
9•geox•48m ago•1 comments

Show HN: LocalGPT – A local-first AI assistant in Rust with persistent memory

https://github.com/localgpt-app/localgpt
3•yi_wang•49m ago•0 comments

A Bid-Based NFT Advertising Grid

https://bidsabillion.com/
1•chainbuilder•52m ago•1 comments

AI readability score for your documentation

https://docsalot.dev/tools/docsagent-score
1•fazkan•1h ago•0 comments

NASA Study: Non-Biologic Processes Don't Explain Mars Organics

https://science.nasa.gov/blogs/science-news/2026/02/06/nasa-study-non-biologic-processes-dont-ful...
3•bediger4000•1h ago•2 comments

I inhaled traffic fumes to find out where air pollution goes in my body

https://www.bbc.com/news/articles/c74w48d8epgo
2•dabinat•1h ago•0 comments
Open in hackernews

Parse complex documents in LangChain with new provider UndatasIO

https://docs.langchain.com/oss/python/integrations/document_loaders/undatasio
1•jojogh•4mo ago

Comments

jojogh•4mo ago
Hey HN, Alex here, founder of undatas.io.

Huge news: We just launched as a LangChain Core Provider—and we’re here to kill the #1 pain point of RAG: garbage document parsing.

Let’s cut to it: Building reliable AI used to feel like rolling the dice. Existing loaders mangle tables, drop critical data, and give zero way to verify outputs. You’d blindy feed messy text into embeddings, waste compute on garbage, and wonder why your app failed. I started Undatasio because this frustration broke more of my projects than I can count.

Our fix? Two non-negotiables: absolute parsing precision and total transparency—wrapped in a model no one else offers: pay only for the parses you accept. Bad output? It’s free. No excuses, no gotchas.

This isn’t "another loader" for LangChain. As a Core Provider, `UndatasioLoader` bakes quality control into the start of your chain: - Programmatically check parsed JSON before it hits embeddings - Reject docs that miss key fields (e.g., no `invoice_total`, wrong table columns) - See exactly where data came from with positional `bbox` coordinates (build your own validation UI in minutes)

No more wasting time or money on downstream garbage. Data prep should be the reliable part of your stack—not the scary one.

We’ve been grinding to make this integration feel native to LangChain, and partnering with their team to push it live has been a blast.

If you’re tired of RAG failing because your inputs are broken, give it a spin. We’re here all day to answer questions, and we need your feedback to make this even better.

Links to get started are in the comments—fire away!

Here’s how to get started:

1. Install the Package: pip install langchain-undatasio (PyPI Link: https://pypi.org/project/langchain-undatasio/)

2. Check out the Official Docs: (LangChain Provider Page: https://docs.langchain.com/oss/python/integrations/providers...)

3. Try the Live Demo: We've set up a Colab notebook with examples. (Google Colab Notebook: https://colab.research.google.com/drive/1k_UhPjNoiUXC7mkMOEI...)

I'll be here all day to answer any questions. Let me know what you think.