frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

OpenCiv3: Open-source, cross-platform reimagining of Civilization III

https://openciv3.org/
494•klaussilveira•8h ago•135 comments

The Waymo World Model

https://waymo.com/blog/2026/02/the-waymo-world-model-a-new-frontier-for-autonomous-driving-simula...
835•xnx•13h ago•500 comments

How we made geo joins 400× faster with H3 indexes

https://floedb.ai/blog/how-we-made-geo-joins-400-faster-with-h3-indexes
52•matheusalmeida•1d ago•9 comments

A century of hair samples proves leaded gas ban worked

https://arstechnica.com/science/2026/02/a-century-of-hair-samples-proves-leaded-gas-ban-worked/
108•jnord•4d ago•17 comments

Monty: A minimal, secure Python interpreter written in Rust for use by AI

https://github.com/pydantic/monty
162•dmpetrov•8h ago•75 comments

Show HN: Look Ma, No Linux: Shell, App Installer, Vi, Cc on ESP32-S3 / BreezyBox

https://github.com/valdanylchuk/breezydemo
165•isitcontent•8h ago•18 comments

Dark Alley Mathematics

https://blog.szczepan.org/blog/three-points/
59•quibono•4d ago•10 comments

Show HN: I spent 4 years building a UI design tool with only the features I use

https://vecti.com
274•vecti•10h ago•127 comments

Show HN: If you lose your memory, how to regain access to your computer?

https://eljojo.github.io/rememory/
221•eljojo•11h ago•138 comments

Microsoft open-sources LiteBox, a security-focused library OS

https://github.com/microsoft/litebox
337•aktau•14h ago•163 comments

Show HN: ARM64 Android Dev Kit

https://github.com/denuoweb/ARM64-ADK
11•denuoweb•1d ago•0 comments

Sheldon Brown's Bicycle Technical Info

https://www.sheldonbrown.com/
332•ostacke•14h ago•89 comments

PC Floppy Copy Protection: Vault Prolok

https://martypc.blogspot.com/2024/09/pc-floppy-copy-protection-vault-prolok.html
34•kmm•4d ago•2 comments

Hackers (1995) Animated Experience

https://hackers-1995.vercel.app/
420•todsacerdoti•16h ago•221 comments

An Update on Heroku

https://www.heroku.com/blog/an-update-on-heroku/
355•lstoll•14h ago•246 comments

Female Asian Elephant Calf Born at the Smithsonian National Zoo

https://www.si.edu/newsdesk/releases/female-asian-elephant-calf-born-smithsonians-national-zoo-an...
15•gmays•3h ago•2 comments

Delimited Continuations vs. Lwt for Threads

https://mirageos.org/blog/delimcc-vs-lwt
9•romes•4d ago•1 comments

Show HN: R3forth, a ColorForth-inspired language with a tiny VM

https://github.com/phreda4/r3
56•phreda4•7h ago•9 comments

How to effectively write quality code with AI

https://heidenstedt.org/posts/2026/how-to-effectively-write-quality-code-with-ai/
209•i5heu•11h ago•152 comments

I spent 5 years in DevOps – Solutions engineering gave me what I was missing

https://infisical.com/blog/devops-to-solutions-engineering
121•vmatsiiako•13h ago•47 comments

Introducing the Developer Knowledge API and MCP Server

https://developers.googleblog.com/introducing-the-developer-knowledge-api-and-mcp-server/
32•gfortaine•5h ago•6 comments

Learning from context is harder than we thought

https://hy.tencent.com/research/100025?langVersion=en
157•limoce•3d ago•79 comments

Understanding Neural Network, Visually

https://visualrambling.space/neural-network/
257•surprisetalk•3d ago•33 comments

I now assume that all ads on Apple news are scams

https://kirkville.com/i-now-assume-that-all-ads-on-apple-news-are-scams/
1011•cdrnsf•17h ago•421 comments

FORTH? Really!?

https://rescrv.net/w/2026/02/06/associative
51•rescrv•16h ago•17 comments

I'm going to cure my girlfriend's brain tumor

https://andrewjrod.substack.com/p/im-going-to-cure-my-girlfriends-brain
90•ray__•4h ago•41 comments

Evaluating and mitigating the growing risk of LLM-discovered 0-days

https://red.anthropic.com/2026/zero-days/
43•lebovic•1d ago•12 comments

How virtual textures work

https://www.shlom.dev/articles/how-virtual-textures-really-work/
34•betamark•15h ago•29 comments

Show HN: Smooth CLI – Token-efficient browser for AI agents

https://docs.smooth.sh/cli/overview
78•antves•1d ago•59 comments

Show HN: Slack CLI for Agents

https://github.com/stablyai/agent-slack
43•nwparker•1d ago•11 comments
Open in hackernews

Fivetran to acquire Census

https://www.fivetran.com/blog/why-fivetran-and-census-are-joining-forces
82•njaremko•9mo ago

Comments

_dark_matter_•9mo ago
Seems like a no-brainer. I wonder if they ever started to build these capabilities in house; I'm sure they already had so much of the tooling available.
mritchie712•9mo ago
Yeah, I was always curious why Fivetran didn't build this themselves when reverse-ETL started to take off.

I built a company[0], SeekWell, in this space (launched before Census), but was mostly focused on Sheets and Slack as destinations. SeekWell was acquired a few years ago too.

0 - https://seekwell.io/

skadamat•9mo ago
The challenge of syncing from stubborn SaaS tools to your data warehouse / database I suspect is different than syncing data from your data warehouse / database back to SaaS tools. Specifically, reverse ETL has to incorporate more context from the business I guess so the data that lands in the 3rd party tools is actually solid.

Once you have customers and a good network of integrations with a large number of tools, I suspect it's easier to just buy that company than build it all yourself?

throwaway7783•9mo ago
The data is only as solid as you make it to be. Ultimately reverse ETL is just a technology (basically from SQL to APIs). The quality/correctness of data is someone else's headache. I've been there and done that, and reverse ETL is a feature-product with huge churn. See how Hightouch pivoted hard from that into CDP.
georgewfraser•9mo ago
This is exactly right. We even went so far as to build a proof of concept internally, and the technical challenges are just very different. The simplest way to explain it is that Fivetran connects a skinny pipe (APIs) to a fat pipe (databases) while Census connects a fat pipe to a skinny pipe.
tschellenbach•9mo ago
not a big fan of fivetran's pricing
throwaway7783•9mo ago
Can you say more? Is it the unpredictability, or just too expensive?
rawgabbit•9mo ago
They charge based on the highest number of unique rows transferred for the month. It gets expensive quickly.
arjie•9mo ago
I run a professional services org that helps you switch to an open source alternative. We'll host the solution for you if you want and aim to be drop-in Fivetran compatible in your workflows with a transition plan so you can run the thing if you'd like. Pricing is flexible. Personal email in profile.
orangechairs•9mo ago
Anyone hear rumors of how much they were acquired for?
mritchie712•9mo ago
I'd guess a little north of $500 million.

- Census last raised $60M Series B at a $630M valuation (upper bound)

- Census’s estimated annual revenue is $31.6 million with ~200 employees.

- Median private-SaaS EV/ARR multiple is 7× (7 * 31 = 217 = lower bound)

- Hightouch raises $80M on a $1.2B valuation(at ~60× ARR)

- Twilio completes $3.2B acquisition of Segment at ~21× ARR (upper multiple bound)

DropkickM16•9mo ago
lol
throwaway7783•9mo ago
Still missing a critical piece - ETL. If they acquire it instead of just asking people to use dbt, they have (kinda) the full stack solution.
mritchie712•9mo ago
there's going to be more consolidation in data tooling this year. Many of the stand alone tools raised too much money and no one wants to buy 5 really expensive tools to assemble a "data stack" anymore.

if you want a data platform that's built to work as one cohesive unit, we got you: https://www.definite.app/

Definite has a data lake, ETL, and BI in one app.

davidu•9mo ago
Congrats to the Census team and the Fivetran team!
bradleybuda•9mo ago
Thanks DU! You've had our back since the early days, thanks for taking a chance on us!
barrrrald•9mo ago
Congrats to everyone. Some of the smartest and kindest people in data coming together!
tqi•9mo ago
What does this actually mean for customers? Is are we going to have to rebuild our Census syncs in Fivetran or will the product continue to run as-is? Will plans / pricing change?
buremba•9mo ago
This indeed sounds like closing the loop, congrats to the team! Boris’s announcement is pretty interesting: https://www.getcensus.com/blog/census-joins-forces-with-five...
educasean•9mo ago
Congrats to both Census and Fivetran. Census has an amazing product and very good people. Excited to see what's coming next from y'all
stalluri•9mo ago
Always wondered FT and Census might come into the each other's territory. Good to see both are merging forces together now!
zoogeny•9mo ago
All of these tools are insanely expensive (from my own experience at companies that have used them). I understand it, since building your own pipeline to handle the kind of throughput analytics takes is expensive and time consuming. Business leaders want the visibility but don't want to redirect dev resources to build and maintain these creaky data pipelines. It is the perfect market of high-value and low tolerance for build (on the build or buy spectrum).

But I am not going to pay $1000/month as a bootstrap startup. What open source alternatives exist that can be run on basic hardware?

loginx•9mo ago
Haven't used it personally, but I would suggest looking into Apache Hudi
zoogeny•9mo ago
Good to know about but looks more like an open source snowflake (e.g. data lake). Fivetran and Census are the in/out process layers that bookend the data lake. Although, Hudi does look like it has some of that functionality baked in.
paxys•9mo ago
A bootstrapped startup needs a MySQL database and a bunch of SELECT queries. Everything else is overkill.
zoogeny•9mo ago
Sure, SQL + something like metabase is a decent starting point (ideally running on a read-only replica). However, there is room to improve over that.

It's like logging. Yeah, there is sentry, papertrail, splunk, datadog and the like. But something better than greping sys logs is nice and totally reasonable for a startup to standup with Kibana/Elastic running on a tiny instance. That can provide significantly higher value.

There is a middle ground between stone tools and jet aircrafts. I was asking: what are the middle ground tools in this space.

themanmaran•9mo ago
Airbyte is probably the best opensource tool in this space.
zoogeny•9mo ago
Cheers, that is what I was thinking must exist but didn't know about.
iflores12•9mo ago
Airbyte gave us more headaches than it was worth. But if you can get it to work for you, it's probably the closest you'll get to Fivetran in the open-source tool space
ssharp•9mo ago
I'm not sure about Census but Fivetran's free plan has met my needs to sync data from different ad platforms to BigQuery pretty well.

One of their pitfalls is charging by the row. If you're cost-conscious, you really need to watch what data you're syncing and you need to pare it down quite a bit during the 2-week period they give you when setting up a new connector. If you do all that though, you can get a lot of mileage out of the free plan for some use cases.

tomrod•9mo ago
Or batch massive rows? JSON structures in-database go a long way...
morkalork•9mo ago
Ok if you're bootstrap it probably doesn't make sense but otherwise fivetran is fantastic for not having to deal with a boatload of third parties constant API updates and changes. If your core competency is something else entirely and not doing ETL, then it's worth paying for so you're not wasting time on doing that ETL work.
zoogeny•9mo ago
Yes, I've used Fivetran at VC funded startups that I worked at and I understand the value of not having to build this piece of common infrastructure. Although we did experience regular (probably once every couple of months) issues with our ETL getting out of sync. We even had to do a full re-sync on a couple of occasions (which to their credit they did for no charge).

As I said, I totally understand this market and why these companies are valuable. I respect the work they do. But while I am a tiny, tiny startup I don't want to lock in to anything and I know I can handle the amount of data myself with little effort if I have a basic open source alternative I can manage myself.

caust1c•9mo ago
Check out redpanda connect / warpstream bento (depending on your license needs). Both came out of what was benthos.

https://github.com/redpanda-data/connect

https://github.com/warpstreamlabs/bento

zoogeny•9mo ago
Interesting, it looks like redpanda is a Kafka replacement and redpanda connect is a Kafka connect replacement but with a supported set of connectors (sources and sinks). I (once upon a time) had to write a Kafka connector myself so I get the general idea.

To be honest, I hadn't really given much thought about what event streaming I would use anyway. So I imagine using redpanda along with redpanda connect could be that layer (I was considering just using Redis streams or even PostgreSQL) and then there is just another redpanda connector for the db to add into that mix. If someone is starting from scratch that might be a good path. But I agree the MIT license of warpstream is a bit nicer if all you need is the connectors.

mritchie712•9mo ago
The best open source options are Airbyte and Meltano / Singer. But it's hard to keep them running. If you self-host them, you'll hit issues at least a few times a month which can each take a few hours to solve.

It's not like running Postgres which "just works". When you self-host Airbyte, you're still building a good bit.

I felt the same way about the cost of data tools. Paying $1,000 for Fivetran, $2,000 for Snowflake, $2,000 for Looker seemed crazy. We bundle all three for $500 / month at https://www.definite.app

fblp•9mo ago
Your comment reads like a pitch but I checked definite and I've been looking for something like this. What we you using it for? Did you evaluate any other Ai analytics tools?
mritchie712•9mo ago
Yes, I've looked at them all. Most AI analytics tools are doing "text to SQL", but writing SQL is a small percentage of data work.

We built an entire stack so the agent can operate across that whole stack (e.g. create pipelines, model data, build reports, etc.)

banditelol•9mo ago
Hi, I've been loking something like this! Any of your custumer has success story migrating off bigquery to your platform? And how do you compare to motherduck? (Looks like you built some of ypur stack on top of duckdb)
mritchie712•9mo ago
Yes, we've had many bigquery / snowflake converts. The reality is, most companies don't have 100tb of data (which is what those platforms are optimized for). Motherduck has a good post[0] on this:

> There were many thousands of customers who paid less than $10 a month for storage, which is half a terabyte. Among customers who were using the service heavily, the median data storage size was much less than 100 GB.

I'm a fan of what motherduck is doing. We're building something different (opinionated, instant data stack), but yes, we both use duckdb under the hood.

0 - https://motherduck.com/blog/big-data-is-dead/

empireofdust•9mo ago
Airbyte’s not an alternative for reverse-etl though. Also your pricing page also says $1k per month.
mritchie712•9mo ago
we have a discount for startups, which would qualify many on HN
doctorpangloss•9mo ago
Palantir's market cap is $274b and they make glorified dashboards. There's just too much money in it to spend cycles doing it for free.
banditelol•9mo ago
I've tried airbyte, sling, and dlt (besides building several tools from scratch)

My best bet for now will be dlt if you have dedicated DE team, but sling will get you a long way for moving data around your warehouse

film42•9mo ago
Congrats to the teams! Like others have said, your pricing ends up killing adoption for my company. We ended up self-hosting Airbyte. It ain't perfect but at least we're not paying $10/GB to replicate data within our own VPC.
tpoacher•9mo ago
Is Fivetran to Fortran like what C++ is to C?
bicx•9mo ago
It’s like what Cantonese is to cantaloupe
r1290•9mo ago
Fivetran has been great. But in this new ai world. Something like dragster + dlt and sling. You can have your own fivetran developed in house. I haven’t dove too much into reverse etl- but it would be awesome to see a dtl like open source tool for reverse etl.
r1290•9mo ago
Fivetran should’ve done this a long time ago. I think that both etl and reverse etl is going open source route. With this ai world we live in now. You just need dagster or temporal - and a few lines of python.
danscan•9mo ago
This page is a great example of why FCP perf is important. It took a scenery long for any content to appear, and I bounced off the page pretty quickly a few times thinking it was down.

I can’t be the only one