frontpage.

Today we're releasing Crovia Spider v1: an open-core forensic tool that digs into existing public AI datasets (2024–2026) for license hints, provenance signals, and compliance holes – no new crawls, no private data touched. Just verifiable clarity on what's already out there.

Gran it on LAION-5B (the backbone of Stable Diffusion, etc.):

Unverified CC-BY 4.0 / 3.0 licenses

Tens of thousands of "unknown" entries

Mixed variants with zero audit trace

First-ever Compliance Score: 14/100 (every model on it inherits the risk)

Real receipts (e.g., cid:url_sha256:c7cc5b0acf8330e51ffd1ed02f108e6a9649e13ed3547a14255dad6bdf7f01c5 → cc-by-4.0 unverified).

Why? EU AI Act hits 2026: models need reproducible evidence, transparent licensing, and Annex IV bundles. Spider outputs audit packs that plug straight into Crovia Trust (offline Merkle proofs <30s). All Apache 2.0, CLI-ready.

Reproduce it: crovia-spider from-laion --output receipts.ndjson on your dataset. Brutal feedback? Integrations with HF/FAISS?

Let's build the governance layer AI deserves.

Repo: https://github.com/croviatrust/crovia-core-engine

(Real receipts extracted via Crovia Spider)

cid:url_sha256:c7cc5b0acf8330e51ffd1ed02f108e6a9649e13ed3547a14255dad6bdf7f01c5

License: cc-by-4.0 (unverified)

cid:url_sha256:267ad746f168458aa6aca730d82dd565ba0dbada0107317d2252d3b60d57fade

License: cc-by-sa-3.0 (unverified)

cid:url_sha256:8bad9a02f5b4b1e08e19a6417bd6fb03576c80a80deef4f4a1ca868eb9265e71

License: unknownDocs/Spec: docs/CROVIA_SPIDER_RECEIPT_v1.md

#AIGovernance

1979: The Model World of Robert Symes [video]

Satellites Have a Lot of Room

1980s Farm Crisis

Show HN: FSID - Identifier for files and directories (like ISBN for Books)

Show HN: Holy Grail: Open-Source Autonomous Development Agent

Show HN: Minecraft Creeper meets 90s Tamagotchi

Show HN: Termiteam – Control center for multiple AI agent terminals

The only U.S. particle collider shuts down

Ask HN: Why do purchased B2B email lists still have such poor deliverability?

Show HN: Remotion directory (videos and prompts)

Portable C Compiler

Show HN: Kokki – A "Dual-Core" System Prompt to Reduce LLM Hallucinations

Software Engineering Transformation 2026

Microsoft purges Win11 printer drivers, devices on borrowed time

Lunch with the FT: Tarek Mansour

Old Mexico and her lost provinces (1883)

'AI' is a dick move, redux

The source code was the moat. But not anymore

Does anyone else feel like their inbox has become their job?

An AI model that can read and diagnose a brain MRI in seconds

Dev with 5 of experience switched to Rails, what should I be careful about?

AlphaFace: High Fidelity and Real-Time Face Swapper Robust to Facial Pose

Scientists discover “levitating” time crystals that you can hold in your hand

Rammstein – Deutschland (C64 Cover, Real SID, 8-bit – 2019) [video]

Tell HN: Yet Another Round of Zendesk Spam

Postgres Message Queue (PGMQ)

Show HN: Django-rclone: Database and media backups for Django, powered by rclone

NY lawmakers proposed statewide data center moratorium

OpenClaw AI chatbots are running amok – these scientists are listening in

Show HN: AI agent forgets user preferences every session. This fixes it