frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Forge – Automate 3NF Schema Generation from Nested JSON in BigQuery/Snowflake

1•brady_bastian•1h ago
I've built a product that completely parses highly nested JSON data in cloud data warehouses. Forge works by methodically dissecting each subcollection and each field of Json data, one by one, and creates 3NF tables for each json sub-object. This completely flattens Json data of any complexity and depth and fully accounts for any schema changes in the entire dataset.

While hand crafted scripts work once and ok for a quick look, a systematic deconstruction and rebuild of the entire Json object is required to truly understand the structure. Some companies have Json data coming from MongoDb or Firestore which has undergone hundreds of even thousands of changes from changing data types to abstract manipulations such as changing Json object to array. A simple parsing script won't cut it. You will either sacrifice some data in order to get something out of it or spend weeks writing dozens of scripts and manipulations to correctly process it. Repeat this for each API and each schema that your company utilizes.

Forge doesn't stop at just unnesting. With the included AI schema classifier, Excalibur, we automatically identify which API your data is coming from based upon tens of thousands of examples. From Stripe to hubspot to segment, we detect it, classify it, and automatically apply field mappings. Additionally, Forge uses advanced AI and ML techniques to document and identify PII fields in your data. No more painstaking scrubbing and parsing of your data, just quick and ready analytics.

How does Forge handle schema changes? Automatic detection and adaptation. When new fields appear, Forge regenerates models while maintaining backward compatibility. Zero downtime.

Does my data leave my warehouse? SaaS: Forge connects via service account to process data in-place. Only schema fingerprints (not actual data) sent for AI classification. Enterprise: Everything runs in YOUR VPC. Zero data egress.

What warehouses do you support? BigQuery, Snowflake, Databricks, and Redshift. One parse generates native models for all four simultaneously.

How accurate is PII detection? Pridwen uses a 3-layer hybrid system (rules + ML + crowd) with 95%+ accuracy. Context-aware and supports 20+ languages.

Do you replace Fivetran/Airbyte? No, we're complementary. Use Fivetran/Airbyte to load raw JSON → Use Forge to transform it into analytics tables.

How much engineering time does this save? Conservative estimate: 2-4 weeks initial build + 10 hours/month maintenance = $50,000-100,000/year for mid-size teams.

Comments

redwood•1h ago
Is this a service, a github repo, something else? did you forget to include a link? have you considered a service that does the opposite?

Functional Programming in M4

https://minnie.tuhs.org/pipermail/tuhs/2020-August/022108.html
1•fanf2•35s ago•0 comments

AI makes it easier to build the wrong thing faster

https://newsletter.masilotti.com/p/ai-makes-it-easier-to-build-the-wrong
1•joemasilotti•47s ago•1 comments

Show HN: I built a macOS desktop toy that patrols while you work

https://airwolfspace.com/tinytanks
1•kailuo•1m ago•0 comments

Poison at Play: Unsafe lead levels found in half of New Orleans playgrounds

https://veritenews.org/2026/02/05/poison-at-play-playgrounds-lead-levels/
1•hn_acker•1m ago•0 comments

Unresponsive Buttons on My Fastest Hardware

https://blog.jim-nielsen.com/2026/unresponsive-buttons/
1•speckx•1m ago•0 comments

AI-First Company Memos

https://the-ai-native.company/
1•bobismyuncle•1m ago•0 comments

How to Test ProxySQL Read/Write Split with Sysbench

https://rendiment.io/mysql/proxysql/2026/02/03/sysbench-proxysql.html
1•nethalo•2m ago•0 comments

The singularity won't be gentle – by Nate Silver

https://www.natesilver.net/p/the-singularity-wont-be-gentle
1•rbanffy•3m ago•0 comments

A New Computer Could Replace Electricity with Light

https://www.popularmechanics.com/science/a70223544/computer-could-replace-electricity-with-light/
1•falcor84•4m ago•0 comments

Show HN: Health.md - Apple Health → Markdown

https://healthmd.isolated.tech/
1•codybontecou•4m ago•0 comments

PicoClaw: Ultra-Efficient AI Assistant in Go

https://github.com/sipeed/picoclaw
1•wicket•5m ago•0 comments

AITools.coffee – GitHub metrics observatory tracking 27K+ open-source AI repos

https://aitools.coffee
1•alexela84•5m ago•1 comments

AI Agents 101: From Concept to Code (No Frameworks Required)

https://medium.com/@kamil.tustanowski/ai-agents-101-from-concept-to-code-no-frameworks-required-2...
1•semerkchet•5m ago•0 comments

Databases should contain their own Metadata – Use SQL Everywhere

https://floedb.ai/blog/databases-should-contain-their-own-metadata-instrumentation-in-floe
3•matheusalmeida•6m ago•0 comments

Seeking Order in Chaos

https://garrit.xyz/posts/2026-02-11-on-seeking-order-in-chaos
3•garritfra•6m ago•0 comments

Show HN: Funxy – A typed scripting language that embeds into Go apps

https://github.com/funvibe/funxy
1•funbitty•6m ago•0 comments

The jarring experience of developing today

https://its.beer/thoughts/the-jarring-experience-of-developing-today
1•beerd•7m ago•0 comments

Kiro: DeepSeek, MiniMax, and Qwen now available as open weight model options

https://kiro.dev/changelog/models/deepseek-minimax-and-qwen-now-available-as-open-weight-model-op...
2•siegers•7m ago•0 comments

Terence Tao: Why I Co-Founded SAIR

https://www.youtube.com/watch?v=Z5GKnb4H_bM
1•nyc111•9m ago•0 comments

Maia 200: The AI accelerator built for inference

https://blogs.microsoft.com/blog/2026/01/26/maia-200-the-ai-accelerator-built-for-inference/
1•MarlonPro•12m ago•0 comments

Gravity: Dynamically typed, embeddable programming language written in C

https://www.gravity-lang.org
1•klaussilveira•13m ago•0 comments

Power-User Utility to Recover, Export, Merge, Audit, and Sort Chrome Extensions

https://github.com/ZulfekarAliAgha/REMAS
1•zulali•13m ago•1 comments

Show HN: A compiled programming language for LLM-to-LLM communication [pdf]

https://sifsystemsmcrd.com/KL_White_Paper.pdf
1•tmbird•13m ago•1 comments

Show HN: See what your AI agents do under the hood

https://pingpulsehq.com
1•shafeeq2207•14m ago•0 comments

EPA to repeal its own conclusion that greenhouse gases warm the planet

https://www.nbcnews.com/science/climate-change/epa-to-repeal-endangerment-finding-climate-change-...
8•geox•15m ago•2 comments

Can you trust LastPass in 2026? Inside the quest to rebuild its security culture

https://www.zdnet.com/article/lastpass-2026-rebuilding-trust-ceo-interview/
3•arusahni•19m ago•0 comments

Show HN: Z-Image Base – Fast AI Image Generator (Open-Source, Free Tier)

https://z-imagebase.com/
1•chengai1106•19m ago•0 comments

When the Competition Is Down the Hall

https://k2xl.substack.com/p/when-the-competition-is-down-the
1•k2xl•19m ago•0 comments

The Banality of MAGA Evil

https://paulkrugman.substack.com/p/the-banality-of-maga-evil
5•rbanffy•20m ago•0 comments

Show HN: Onlybots.cam

https://onlybots.cam
1•m0rtyn•20m ago•0 comments