Every site on the internet has the same problem right now:
AI companies are scraping everything, code, articles, images and using it to train models without consent or payment.
The tools we’ve relied on for decades (robots.txt, rate limiting, legal notices) don’t work anymore. They’re easy to ignore, hard to enforce, and give you zero control once your content is scraped.
I built AI Privacy License: a machine-readable, legally aligned licensing framework that travels with your content after it’s scraped.
Link : https://aiprivacylicense.com/
• Works for code, datasets, text, images, research papers, blogs, anything online
• Lets you set rules: “no AI training,” “academic use only,” “paid commercial use,” etc.
• Generates both a legal license and technical enforcement signals (SPDX, JSON-LD, HTTP headers, robots.txt integration)
• Public registry so creators can see who’s protected
Think of it as Creative Commons for the AI era except your license enforces itself after the fact.
Curious to hear what the HN community thinks:
• What’s the best way to standardize this so AI companies have no excuse not to comply?
• How should enforcement and attribution work at scale?
nabanita•3h ago
AI companies are scraping everything, code, articles, images and using it to train models without consent or payment.
The tools we’ve relied on for decades (robots.txt, rate limiting, legal notices) don’t work anymore. They’re easy to ignore, hard to enforce, and give you zero control once your content is scraped.
I built AI Privacy License: a machine-readable, legally aligned licensing framework that travels with your content after it’s scraped. Link : https://aiprivacylicense.com/ • Works for code, datasets, text, images, research papers, blogs, anything online • Lets you set rules: “no AI training,” “academic use only,” “paid commercial use,” etc. • Generates both a legal license and technical enforcement signals (SPDX, JSON-LD, HTTP headers, robots.txt integration) • Public registry so creators can see who’s protected
Think of it as Creative Commons for the AI era except your license enforces itself after the fact.
You can try it free here: https://aiprivacylicense.com
Curious to hear what the HN community thinks: • What’s the best way to standardize this so AI companies have no excuse not to comply? • How should enforcement and attribution work at scale?
https://aiprivacylicense.com/ - Demo + free generator