frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: Pontoon, an open-source data export platform

3•kalanm•21h ago
Hi HN,

We’re Alex and Kalan, the creators of Pontoon (https://github.com/pontoon-data/Pontoon). Pontoon is an open-source data export platform that makes it really easy to create data syncs and send data to your enterprise customers. Check out our demo here: https://app.storylane.io/share/onova7c23ai6 or try it out with docker: https://pontoon-data.github.io/Pontoon/getting-started/quick...

While at our prior roles as data engineers, we’ve both felt the pain of data APIs. We either had to spend weeks building out data pipelines in house or spend a lot on ETL tools like Fivetran (https://www.fivetran.com/). However, there were a few companies that offered data syncs that would sync directly to our data warehouse (eg. Redshift, Snowflake, etc.), and when that was an option, we always chose it. This led us to wonder “Why don’t more companies offer data syncs?”. It turns out, building reliable cross-cloud data syncs is difficult. That’s why we built Pontoon.

We designed Pontoon to be:

- Easily deployed: we provide a single, self-contained Docker image for easy deployment and Docker Compose for larger workloads (https://pontoon-data.github.io/Pontoon/getting-started/quick...)

- Support modern data warehouses: we support syncing to/from Snowflake, BigQuery, Redshift, and Postgres.

- Sync cross cloud: sync from BigQuery to Redshift, Snowflake to BigQuery, Postgres to Redshift, etc.

- Developer friendly: data syncs can also be built via the API

- Open source: Pontoon is free to use by anyone

Under the hood, we use Apache Arrow (https://arrow.apache.org/) to move data between sources and destinations. Arrow is very performant - we wanted to use a library that could handle the scale of moving millions of records per minute.

In the shorter-term, there are several improvements we want to make, like:

- Adding support for DBT models to make adding data models easier

- UX improvements like better error messaging and monitoring of data syncs

- More sources and destinations (S3, GCS, Databricks, etc.)

- Improve the API for a more developer friendly experience (it’s currently tied pretty closely to the front end)

In the longer-term, we want to make data sharing as easy as possible. As data engineers, we sometimes felt like second class citizens with how we were told to get the data we needed - “just loop through this api 1000 times”, “you probably won’t get rate limited” (we did), “we can schedule an email to send you a csv every day”. We want to change how modern data sharing is done and make it simple for everyone.

Give it a try: https://github.com/pontoon-data/Pontoon. Cheers!

Manifesto: Rules for Standards-Makers (2017)

http://scripting.com/2017/05/09/rulesForStandardsmakers.html
1•antonalekseev•4m ago•0 comments

Did pihole mail donation list got leaked?

https://discourse.pi-hole.net/t/did-pihole-mail-donation-list-got-leaked/81441
1•taubek•5m ago•0 comments

Chesterton's Fence: A Lesson in Thinking (2022)

https://fs.blog/chestertons-fence/
1•mschuster91•8m ago•0 comments

Hard reality about AI mobile app developers

https://substack.com/home/post/p-169651454
1•ykhandelwaly•12m ago•0 comments

The Design and Implementation of Extensible Variants for Rust in CGP

https://contextgeneric.dev/blog/extensible-datatypes-part-4/
3•Bogdanp•13m ago•0 comments

Show HN: Open-source self-hosted LLM comparison tool for your own prompt

https://github.com/stashlabs/duelr
2•ycsuck•15m ago•0 comments

Show HN: When Intelligence Becomes a Trap: A Wake-Up Call for the AI Industry

https://everydayai.top/
1•fishfl•17m ago•0 comments

Load Balancing AI/ML API with Apache Apisix

https://apisix.apache.org/blog/2025/07/31/load-balancing-between-ai-ml-api-with-apisix/
2•Yilialinn•20m ago•0 comments

Public Perspectives on AI Governance: Survey of Adults in CA, Illinois, and NY

https://zenodo.org/records/16566059
1•sebg•22m ago•0 comments

Claude Code and Tinder = 10 Dates in a Week

https://www.reddit.com/r/ClaudeCode/s/4FNn4ftdLj
2•cft•22m ago•1 comments

I built a design studio for people who can't design (like me)

https://glowupshot.com
2•omarkhairy21•25m ago•1 comments

Supply-chain attacks on open source software are getting out of hand

https://arstechnica.com/security/2025/07/open-source-repositories-are-seeing-a-rash-of-supply-chain-attacks/
1•_tk_•25m ago•0 comments

Agentic Coding Things That Didn't Work

https://lucumr.pocoo.org/2025/7/30/things-that-didnt-work/
2•sebg•27m ago•0 comments

RIP Amazon QLDB

https://news.alvaroduran.com/p/if-amazon-cant-figure-out-how-to
2•ohduran•32m ago•0 comments

General availability of Amazon EC2 G6f instances with fractional GPUs

https://aws.amazon.com/about-aws/whats-new/2025/07/amazon-ec2-g6f-instances-fractional-gpus/
3•mariuz•32m ago•0 comments

Show HN: Add Travel Time – Auto Travel Time in Google Calendar

https://www.addtraveltime.com
2•benklinger•38m ago•0 comments

Show HN: Handelsregister.ai – Dev-friendly API for the German business registry

https://handelsregister.ai/de
1•padho•39m ago•0 comments

AI-Designed Enzymes Break Down Plastic in Hours

https://earth.org/plastic-eating-enzyme/
1•karlperera•46m ago•2 comments

Sharding Postgres at Network Speed

https://pgdog.dev/blog/sharding-postgres-at-network-speed
2•GarethX•50m ago•0 comments

KIRA project launches Germany's first autonomous public transport shuttles

https://urban-mobility-observatory.transport.ec.europa.eu/news-events/news/kira-project-launches-germanys-first-autonomous-public-transport-shuttles-2025-06-13_en
2•taubek•52m ago•0 comments

Claude Code: My Most Trusted Coworker and My Worst Enemy

https://lopezb.com/articles/claude-code-my-most-trusted-coworker-and-my-worst-enemy
3•GarethX•53m ago•0 comments

Lethal Cambodia-Thailand border clash linked to cyber-scam slave camps

https://www.theregister.com/2025/07/31/thai_cambodia_war_cyberscam_links/
1•romaniitedomum•58m ago•0 comments

Ask HN: How do you measure "AI slop"?

4•crakhamster01•58m ago•1 comments

Agntcy: Building Infrastructure for the Internet of Agents

https://agntcy.org
1•thebeardisred•1h ago•1 comments

Gödel: The Limits of Logic and the Foundations of Modern Mathematics

https://quantumzeitgeist.com/godels-incompleteness-theorems/
1•bryanrasmussen•1h ago•1 comments

Categorising My Daily Todo List with Deepseek-R1

https://www.bentasker.co.uk/posts/blog/software-development/ai-todo-list-categorisation.html
2•furkansahin•1h ago•0 comments

Dine and dash mental health toll on restaurant staff

https://www.bbc.co.uk/news/articles/cjd24ky4818o
1•mellosouls•1h ago•0 comments

LangExtract: A Gemini powered information extraction library

https://developers.googleblog.com/en/introducing-langextract-a-gemini-powered-information-extraction-library/
2•thebeardisred•1h ago•0 comments

C++: "model of the hardware" vs. "model of the compiler" (2018)

http://ithare.com/c-model-of-the-hardware-vs-model-of-the-compiler/
1•oumua_don17•1h ago•0 comments

If the Coronation of Charles II Had Event Marketing

https://www.youtube.com/watch?v=MtJvrjiSmds
1•zb9461•1h ago•0 comments