frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Incorporator, Turn any API/File into typed Python graph with pipeline

https://github.com/PyPlumber/Incorporator/
1•PyPlumber•1h ago
When landing data I prefer to keep it as close to the original source as possible. Most of the Python data ingestion programs I saw treated Python more like SQL instead of harnessing object orientation. This was my attempt at translating my object-orented columnar approach to Python. I originally did it with Requests and Pandas but the overhead costs did not seem worth it. Claude helped refactor for async and Pydantic.

Now HTTPX’s async capabilities and Pydantic’s class building took this project over the top. By harnessing their abilities I shifted the codebase from data mapper to pipeline orchestrator. I added every format I could that seemed to have an established Python library. Right now I believe I support : JSON, NDJSON, XML, CSV, TSV, PSV, SQLite, and HTML out of the box. Optional extras (~30 MB pyarrow) unlock Parquet, Feather, ORC; Avro and XLSX have their own extras. I also added every compression I could find. Benchmarks at least for a windows machine are on par with other elt packages.

By focusing on function wrappers to make the developer’s syntax as easy as possible for the original data mapping calls, I established simple automated pipelines with one cli command and one JSON reference file. The JSON is basically the same syntax you would use in Python.

Both stream and fjord accept inflow and outflow Python code. Inflow code allows you to set custom conversion functions and mappings for the incoming data. The outflow code allows you to manipulate the exporting data into a new object new entirely.

Also, because your pipeline is basically created by a JSON file. You should eventually be able to automate the creation of the entire pipeline. Enjoy.

https://github.com/PyPlumber/Incorporator/

How you use it: Declare a subclass with no fields, point it at a URL, and it infers a Pydantic model from the response at runtime — with full strict typing, dot-notation, and an optional registry lookup by any key. class Launch(Incorporator): pass launches = await Launch.incorp( inc_url="https://ll.thespacedevs.com/2.2.0/launch/upcoming/" )

These functions handle the rest of your data mapping and export format needs: - test() lets the framework write the call kwargs for you - refresh() re-fetches with the seed call's params auto-replayed - export() serialises to any of the 13 formats

Then these functions create a pipeline. - stream() runs a chunked daemon with bounded memory. Can be used in two modes: pass-through or stateful (in RAM) updates to be manipulated in real-time. - fjord() fans out N sources and fuses them through a user reducer. This accepts multiple sources and exports.

After that all works copy the parameters into pipeline.json and the command can be as simple: incorporator validate pipeline.json incorporator fjord pipeline.json –logs

Comments

PyPlumber•10m ago
The docs and examples folders have: Tutorials 1-7 with matching code files. Should be a nice progression on using the tool.

Appendices have more advanced examples. There's a fantasy racing league example with 6-api calls & 1-file source with 3 outflows all in the form of an automated fjord pipeline cli call.

Claude FM

https://www.youtube.com/watch?v=YmQ7jRgf4f0
1•arishi•4m ago•0 comments

Judge Bars Kars4Kids from Broadcasting 'Misleading' Ads in California

https://www.nytimes.com/2026/05/15/us/kars4kids-advertising-banned-california.html
4•xnx•5m ago•0 comments

Hotel check-in system exposed 1M passports and driver's licenses

https://techcrunch.com/2026/05/15/a-hotel-check-in-system-left-a-million-passports-and-drivers-li...
1•p_stuart82•6m ago•0 comments

Ask an Astronaut: 333 hours of Q&A footage with astronauts

https://askanastronaut.issinrealtime.org/
1•gaws•7m ago•0 comments

Obsidian Radar Plugin

https://github.com/lfcipriani/obsidian-radar
1•lfcipriani•7m ago•0 comments

The founder's playbook: Building an AI-native startup

https://claude.com/blog/the-founders-playbook
1•shenli3514•7m ago•0 comments

Fedora Hummingbird: Taking the Hummingbird model to the full operating system

https://fedoramagazine.org/fedora-hummingbird-linux-taking-the-hummingbird-model-to-the-full-os/
1•abdelhousni•8m ago•1 comments

Google's Gemini Omni video model surfaces ahead of I/O debut

https://www.testingcatalog.com/googles-gemini-omni-video-model-surfaces-ahead-of-i-o-debut/
1•gmays•8m ago•0 comments

Analysis of 70 years of "Eurovision" lyrics

https://puntofisso.net/eurovision/
1•gaws•9m ago•0 comments

WinCE64 – Windows CE 2.11 for N64

https://github.com/ThroatyMumbo/WinCE64
3•xyru•9m ago•0 comments

The Magic of Meetups

https://elijahpotter.dev/articles/product-product-product
1•chilipepperhott•11m ago•0 comments

OpenAI Models in OpenClaw, Done Right

https://openclaw.ai/blog/openai-models-in-openclaw-done-right
1•sansuk•13m ago•0 comments

Show HN: Claude Code vs. Codex Global Usage Leaderboard

https://costhawk.ai/leaderboard
2•optimizethis•14m ago•0 comments

Stop state surveillance in Canada. Stop Bill C-22

https://www.jccf.ca/stop-bill-c-22-stop-surveillance-in-canada/
2•YoctoYARN•14m ago•0 comments

Learning, Fast and Slow: Towards LLMs That Adapt Continually

https://gepa-ai.github.io/gepa/blog/2026/05/11/learning-fast-and-slow/
2•LakshyAAAgrawal•18m ago•0 comments

A New Kind of Family-Separation Crisis

https://www.theatlantic.com/politics/2026/05/honduras-deportations-without-children/687153/
2•paulpauper•19m ago•0 comments

I Was Drowning Running 14 Markets Alone. So I Built a $0.41/Day AI Employee

https://medium.com/@alanscottencinas/i-was-drowning-running-14-markets-alone-so-i-built-a-0-41-da...
1•encinas88•19m ago•0 comments

State media control influences large language models

https://www.nature.com/articles/s41586-026-10506-7
1•paulpauper•19m ago•0 comments

What I've Been Reading

https://marginalrevolution.com/marginalrevolution/2026/05/what-ive-been-reading-288.html
2•paulpauper•20m ago•0 comments

THORChain exploit hits Bitcoin, Ethereum, and BSC: Hackers steal over $10M

https://ambcrypto.com/thorchain-exploit-hits-bitcoin-ethereum-and-bsc-hackers-steal-over-10-mln/
5•wslh•21m ago•1 comments

Tell HN: Audible app used 19.8GB of data while not being used

2•jimnotgym•21m ago•1 comments

Show HN: X open sourced their algorithm

https://www.xalgorithm.xyz/en
2•hsnrique•22m ago•0 comments

Does Trump Mobile know how many stripes are on the American flag?

https://www.theverge.com/gadgets/931347/trump-mobile-t1-phone-logo-flag-stars-stripes
4•droidjj•24m ago•0 comments

Show HN: Emergence World: World building as a way to evaluate LLMs

https://world.emergence.ai/
2•deepakakkil•25m ago•0 comments

ABC News has taken all FiveThirtyEight articles offline

https://twitter.com/baseballot/status/2055309076209492208
27•cmsparks•25m ago•2 comments

Dual Intel Arc Pro B60(48G) Inference, Virtualization, and Gaming Testing

https://www.lttlabs.com/articles/2026/05/15/maxsun-intel-arc-pro-b60-dual-48g-turbo-review
3•LabsLucas•26m ago•0 comments

Ask HN: Reviewing Plan from Plan Mode

3•dennis3124•27m ago•0 comments

Cvl: A C++26 library for mutating consteval state

https://github.com/friedkeenan/cvl
2•friedkeenan•28m ago•0 comments

Don't Design Your Emails (2016)

https://www.gkogan.co/dont-design-emails/
1•downbad_•29m ago•0 comments

PSVL 1.0 – The most comprehensive source-visible license (276 clauses)

https://github.com/BMBOMICH/PSVL
1•BMBOMICH•29m ago•0 comments