frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Synth Data Studio Open-source synthetic data with differential privacy

https://github.com/Urz1/synthetic-data-studio
1•Sadam_H•2w ago

Comments

Sadam_H•2w ago
Hey HN! I'm a student who built this over the past 5 months. Why I built this:Every project I worked on hit the same wall: I couldn't use real data due to HIPAA/GDPR, public datasets were too generic, and mocking data manually was painful. Existing tools like Gretel or Tonic are enterprise-priced and closed-source.

So I built an open-source alternative that does two things: Schema mode: Define columns and generate upto 1M rows (no training data needed). ML mode: Upload a CSV to train CTGAN/TVAE/Copula and generate high-fidelity synthetic data.

Tech stack: Frontend: Next.js 15, TypeScript, Tailwind

Backend: FastAPI, PostgreSQL, Redis

ML: SDV library (CTGAN, TVAE, GaussianCopula)

Privacy: Differential privacy using $(\epsilon, \delta)$-probabilistic guarantees.

Auth: Better Auth (self-hosted) Deployment: Docker Compose

Hardest technical challenge:Getting differential privacy parameters right. The $\epsilon$ (epsilon) budget directly trades off between privacy and utility. Too strict makes the data useless; too loose causes privacy leaks. I ended up exposing this as a configurable slider with sensible defaults and documentation.

Pricing/Openness:100% MIT licensed (fork it, host it, modify it). Self-host: docker-compose up and you're running. No tracking or data collection on self-hosted instances.

Try it out:Live playground (no signup): https://www.synthdata.studio/playground

GitHub:https://github.com/Urz1/synthetic-data-studio

I’d love to hear your feedback on the architecture, privacy implementation, or what features would make this useful for your workflow!

AI Agent Automates Google Stock Analysis from Financial Reports

https://pardusai.org/view/54c6646b9e273bbe103b76256a91a7f30da624062a8a6eeb16febfe403efd078
1•JasonHEIN•3m ago•0 comments

Voxtral Realtime 4B Pure C Implementation

https://github.com/antirez/voxtral.c
1•andreabat•5m ago•0 comments

I Was Trapped in Chinese Mafia Crypto Slavery [video]

https://www.youtube.com/watch?v=zOcNaWmmn0A
1•mgh2•11m ago•0 comments

U.S. CBP Reported Employee Arrests (FY2020 – FYTD)

https://www.cbp.gov/newsroom/stats/reported-employee-arrests
1•ludicrousdispla•13m ago•0 comments

Show HN: I built a free UCP checker – see if AI agents can find your store

https://ucphub.ai/ucp-store-check/
2•vladeta•18m ago•1 comments

Show HN: SVGV – A Real-Time Vector Video Format for Budget Hardware

https://github.com/thealidev/VectorVision-SVGV
1•thealidev•20m ago•0 comments

Study of 150 developers shows AI generated code no harder to maintain long term

https://www.youtube.com/watch?v=b9EbCb5A408
1•lifeisstillgood•20m ago•0 comments

Spotify now requires premium accounts for developer mode API access

https://www.neowin.net/news/spotify-now-requires-premium-accounts-for-developer-mode-api-access/
1•bundie•23m ago•0 comments

When Albert Einstein Moved to Princeton

https://twitter.com/Math_files/status/2020017485815456224
1•keepamovin•24m ago•0 comments

Agents.md as a Dark Signal

https://joshmock.com/post/2026-agents-md-as-a-dark-signal/
2•birdculture•26m ago•0 comments

System time, clocks, and their syncing in macOS

https://eclecticlight.co/2025/05/21/system-time-clocks-and-their-syncing-in-macos/
1•fanf2•28m ago•0 comments

McCLIM and 7GUIs – Part 1: The Counter

https://turtleware.eu/posts/McCLIM-and-7GUIs---Part-1-The-Counter.html
1•ramenbytes•30m ago•0 comments

So whats the next word, then? Almost-no-math intro to transformer models

https://matthias-kainer.de/blog/posts/so-whats-the-next-word-then-/
1•oesimania•32m ago•0 comments

Ed Zitron: The Hater's Guide to Microsoft

https://bsky.app/profile/edzitron.com/post/3me7ibeym2c2n
2•vintagedave•35m ago•1 comments

UK infants ill after drinking contaminated baby formula of Nestle and Danone

https://www.bbc.com/news/articles/c931rxnwn3lo
1•__natty__•35m ago•0 comments

Show HN: Android-based audio player for seniors – Homer Audio Player

https://homeraudioplayer.app
3•cinusek•36m ago•0 comments

Starter Template for Ory Kratos

https://github.com/Samuelk0nrad/docker-ory
1•samuel_0xK•37m ago•0 comments

LLMs are powerful, but enterprises are deterministic by nature

2•prateekdalal•41m ago•0 comments

Make your iPad 3 a touchscreen for your computer

https://github.com/lemonjesus/ipad-touch-screen
2•0y•46m ago•1 comments

Internationalization and Localization in the Age of Agents

https://myblog.ru/internationalization-and-localization-in-the-age-of-agents
1•xenator•46m ago•0 comments

Building a Custom Clawdbot Workflow to Automate Website Creation

https://seedance2api.org/
1•pekingzcc•49m ago•1 comments

Why the "Taiwan Dome" won't survive a Chinese attack

https://www.lowyinstitute.org/the-interpreter/why-taiwan-dome-won-t-survive-chinese-attack
2•ryan_j_naughton•49m ago•0 comments

Xkcd: Game AIs

https://xkcd.com/1002/
2•ravenical•51m ago•0 comments

Windows 11 is finally killing off legacy printer drivers in 2026

https://www.windowscentral.com/microsoft/windows-11/windows-11-finally-pulls-the-plug-on-legacy-p...
1•ValdikSS•51m ago•0 comments

From Offloading to Engagement (Study on Generative AI)

https://www.mdpi.com/2306-5729/10/11/172
1•boshomi•53m ago•1 comments

AI for People

https://justsitandgrin.im/posts/ai-for-people/
1•dive•54m ago•0 comments

Rome is studded with cannon balls (2022)

https://essenceofrome.com/rome-is-studded-with-cannon-balls
1•thomassmith65•59m ago•0 comments

8-piece tablebase development on Lichess (op1 partial)

https://lichess.org/@/Lichess/blog/op1-partial-8-piece-tablebase-available/1ptPBDpC
2•somethingp•1h ago•0 comments

US to bankroll far-right think tanks in Europe against digital laws

https://www.brusselstimes.com/1957195/us-to-fund-far-right-forces-in-europe-tbtb
5•saubeidl•1h ago•0 comments

Ask HN: Have AI companies replaced their own SaaS usage with agents?

1•tuxpenguine•1h ago•0 comments