frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

I scanned 2,500 Hugging Face models for malware/issues. Here is the data

https://github.com/ArseniiBrazhnyk/Veritensor
3•arseniibr•1h ago

Comments

arseniibr•1h ago
Hi HN,

I built a CLI tool called Veritensor for scunning AI models, because I found out that downloading model weights from 3rd party websites and loading them with torch.load() can lead to RCE. At the same time, simple regex scanners are easy to bypass.

To test my tool, I ran it against 2500 new and trending models on Hugging Face.

Here is what I found — 86 failed models: Broken files — 16 models were actually Git LFS text pointers (several hundred bytes), not binaries. If you try to load them, your code crashes. Hidden Licenses — 5 models. I found models with Non-Commercial licenses hidden inside the .safetensors headers, even if the repo looked open source. Shadow Dependencies — 49 models. Many models tried to import libraries I didn't have (like ultralytics or deepspeed). My tool blocked them because I use a strict allowlist of libraries. Suspicious Code — 11 files used STACK_GLOBAL to build function names dynamically. This is a common way how RCE malware hides, though in my case, it was mostly old numpy files. Scan Errors — 5 models failed because of missing local dependencies (like h5py for old Keras files).

I was able to detect some threats because under the hood, Veritensor works differently from common regex scanners. Instead of searching for suspicious text, it simulates how Pickle loads data, which helps it find hidden payloads without running any code. It also checks that the model file is real by hashing it and comparing it with the version from Hugging Face, so fake or changed models can be detected. Veritensor also looks at model metadata in formats like Safetensors and GGUF to spot license restrictions. If everything looks safe, it can sign the container using Sigstore Cosign.

It supports PyTorch, Keras, and GGUF. Free to use — Apache 2.0.

Repo: https://github.com/ArseniiBrazhnyk/Veritensor Data of the scan [CSV/JSON]: https://drive.google.com/drive/folders/1G-Bq063zk8szx9fAQ3NN... PyPI: pip install veritensor

Let me know if you have any feedback, have you ever faced similar threats and whether this tool could be useful for you.

patrakov•1h ago
The single --force flag is not a good design decision. Please break it up (EDIT: I see you already did it partially in veritensor.yaml). Right now, according to the description, it suppresses detection of both genuinely non-commercial/AGPL models and models with inconsistent licensing data. Also, I might accept AGPL but not CC-BY-NC.

Probably, it would be better to split it into --accept-model-license=AGPL --accept-inconsistent-licensing --ignore-layer-license-metadata --ignore-rce-vector=os.system and so on.

Sage: AI-powered Git commit message and branch name generator

https://github.com/thanipro/sage
1•FPurchess•1m ago•1 comments

Let AI catalog your house for insurance

https://mattsayar.com/let-ai-catalog-your-house-for-insurance/
1•MattSayar•1m ago•0 comments

Show HN: FeedOwn – Self-hosted RSS reader running on free tiers ($0/month)

https://github.com/kiyohken2000/feedown
1•kiyohken2000•2m ago•0 comments

Claude's New Constitution

https://www.anthropic.com/news/claude-new-constitution
1•meetpateltech•3m ago•0 comments

chrome://crash is the best home page

https://blog.thomasorlita.com/chrome-crash-home-page/
1•thomascz•3m ago•0 comments

Anthropic's CEO stuns Davos with Nvidia criticism

https://techcrunch.com/2026/01/20/anthropics-ceo-stuns-davos-with-nvidia-criticism/
1•pseudolus•6m ago•1 comments

Book Notes – What Is Existentialism

https://arpitbhayani.me/blogs/book-notes-what-is-existentialism/
1•vbanurag•7m ago•0 comments

DOGE improperly shared sensitive social security data, DOJ court filing reveals

https://www.theguardian.com/us-news/2026/jan/21/doge-social-security-data
5•GuinansEyebrows•8m ago•1 comments

Show HN: iOS app I made to track my anxiety

https://mudoapp.com
1•adictonator•9m ago•0 comments

Designing a Programming Language for the Desert

https://futhark-lang.org/blog/2018-06-18-designing-a-programming-language-for-the-desert.html
2•pcfwik•10m ago•0 comments

Show HN: I built a chess explorer that explains strategy instead of just stats

https://www.atlaschess.me/
1•Ahmad_shuja•11m ago•0 comments

FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model

https://huggingface.co/FlashLabs/Chroma-4B
1•pretext•11m ago•0 comments

Architecture as Data Is the Missing Unlock for Generative Code

https://spynejs.com/blog/frontend-architecture-has-met-its-reasoning-moment#architecture-as-data
1•nybatista•11m ago•0 comments

Show HN: A multiplayer browser-based RPG

https://delvethedepths.online/
1•xazzzzzzz•14m ago•0 comments

Show HN: SpeechOS – Wispr Flow-inspired voice input for any web app

https://www.speechos.ai/
1•gangster_dave•15m ago•0 comments

Ask HN: Are you going to meetups/conferences?

1•carimura•15m ago•1 comments

BBC announces landmark deal to make bespoke content for YouTube

https://www.theguardian.com/media/2026/jan/21/bbc-announces-landmark-deal-to-make-bespoke-content...
1•bookofjoe•15m ago•0 comments

Everyone Deserves a Better Computer

https://www.aheadcomputing.com/blog/everyone-deserves-a-better-computer
1•turoczy•16m ago•0 comments

Winter Storm to Target over 180M from Texas to New England

https://weather.com/storms/winter/news/2026-01-21-winter-storm-fern-ice-snow-forecast-south-north...
1•washedup•17m ago•0 comments

Show HN: DynamoLens – native, open-source DynamoDB desktop client (Go and Wails)

https://github.com/rasjonell/dynamo-lens
1•rasjonell•18m ago•0 comments

Show HN: Loomind – Local-first workspace with RAG and eternal memory

https://loomind.me/
1•redhotcookerr•18m ago•1 comments

Show HN: Neboo – A minimalist digital circuit breaker for doomscrolling

https://www.neboo.me/
1•jaylisches•19m ago•0 comments

The Work of Art in the Age of Mechanical Reproduction

https://en.wikipedia.org/wiki/The_Work_of_Art_in_the_Age_of_Mechanical_Reproduction
1•tokai•19m ago•0 comments

A Centralised Approach to AI / LLM Agent Instruction Using Git Submodules

https://www.appsoftware.com/blog/a-centralised-approach-to-ai-llm-agent-instruction-using-git-sub...
1•appsoftware•19m ago•1 comments

Show HN: Semantic map and analysis of Hacker News stories in 2025

https://lincolnmaxwell.com/p/clustering-hackernews-2025/
1•youngermax•20m ago•0 comments

Can Everyone Please Stop Being Stupid About Who Pays Tariffs?

https://benthams.substack.com/p/can-everyone-please-stop-being-stupid
2•yoble•21m ago•1 comments

Nintendo demanded Sega to put Mario's foot in front of Sonic's at Olympic Games

https://nintendoeverything.com/nintendo-demanded-sega-to-put-marios-foot-in-front-of-sonics-for-m...
2•randycupertino•21m ago•0 comments

Show HN: Shared Device Group – GPU sharing at the Kubernetes scheduler level

https://github.com/sceneryback/shared-device-group
1•ChuangLabs•22m ago•1 comments

Psilocybin could treat depression via a non-hallucinogenic receptor

https://medicalxpress.com/news/2026-01-psilocybin-depression-hallucinogenic-receptor.html
1•pseudolus•23m ago•0 comments

Rho-alpha (ρα) Microsoft first robotics model derived from Phi

https://www.microsoft.com/en-us/research/story/advancing-ai-for-the-physical-world/
2•robotswantdata•24m ago•0 comments