Looking for feedback on proposed AI health risk scoring framework

1•ynori7•1h ago

Comments

ynori7•1h ago

TL;DR: Scorings like CVSS evaluate IT risks, but AI introduces risks that CVSS doesn’t cover such as: psychological manipulation, unintended harm, and health consequences. I’m proposing a first draft of a scoring framework called “AI Risk Assessment – Health” to help close this gap. My intention is to make AI safer for users, especially focusing on minors and vulnerable populations. This is not a finished standard but an open invitation to collaborate.

==Background:== I’m a physician, not an AI security expert or IT professional. While using AI for daily use, I stumbled upon a serious filter failure and tried, unsuccessfully, to report it. After investigating the field and reading technical vulnerability reports, I noticed that security reports typically include CVSS scoring. CVSS works well for software bugs, but it doesn't reflect the new human and psychological risks posed by AI. Using CVSS would be a bit like using a nutrition label to rate painkillers.

This inspired me to sketch an alternative. AI Risk Assessment-Health focuses on things current scoring systems miss: human safety, mental health, and vulnerable populations.

==The Framework:== The framework evaluates risks across seven dimensions (like Physical Safety Impact, Mental Health Impact, and AI Bonding) and calculates a severity score. The framework, in it’s current state, is purely heuristic and not battle-hardened, however it serves as a discussion starter.

You can find the full draft here:

https://github.com/Yasmin-FY/AIRA-F/blob/main/README.md

==An Invitation to Collaborate:== As a physician without an IT background, I bring an outside perspective that places human well-being at the center but which inevitably overlooks technical and mathematical nuances. This framework is not expected to be a finished standard, but rather a discussion starter and a critical thought experiment.

I warmly invite experts from IT security, AI safety, standardization, psychology, and other professions to critique, extend, or even completely rework this draft. My goal is, working together, to find a common language to precisely communicate and prioritize the very real health risks posed by AI systems.

Here are a few example topics I’m interested in digging into:

How can health-related risks be rated without being overly subjective? Should this be an extension of CVSS or an entirely separate system? How can the scoring algorithm, weighting, and calibration be made more rigorous?

==Closing Thought== My intention with building this framework is to build a safer AI, especially for minors and vulnerable people as well as to enable a standardized way of communicating, evaluating, and prioritizing AI content and behavior issues.

So I kindly ask you. Take it, break it, make it better.

Many thanks to everyone who has stuck with me this far. Your opinion is greatly appreciated.

OpenPrinter, an Open Hardware Printer

The Moon Landing: An Undelivered Nixon Speech

F-Droid and Google's Developer Registration Decree

Ask HN: Best Architecture Patterns for Lightweight SWE Workflows?

Bad Apple but It's Played Inside Super Mario Bros. [TAS]

An AI Index for all our customers

Off-duty DC firefighter was shot and dialed 911, but no one picked up

Stealing from Google

6,100-Qubit Processor Shatters Quantum Computing Record

AI Platform Citation Patterns: What Marketers Must Know About GenAI Discovery

Reddit Seeks to Strike Next AI Content Pact with Google, OpenAI

Forecasting Revolutionnay Package

The Internet Is Powered by Generosity

Startup Nights 2025 is comming up on 6-7 Nov. in Switzerland

Twenty Lessons on Fighting Tyranny from the Twentieth Century

Building a Custom Car Computer in 2006: Honda Accord PC Integration Project

Germany: Ministry of Economic Affairs effectively abolishes supply chain law

Animating Geometry with AMD DGF

Nvidia Vulkan Ray Tracing Tutorial

Figma Rendering: Powered by WebGPU

Show HN: When you open Instagram, your textbook opens instead – ZenScript App

Zeloof Z2 homemade silicon integrated circuit

Connect Word Level Solutions and Walkthrough Videos

I Am Not Claiming Your Videos: Mistaken YouTube Copyright Claims

AirAuth: open-source NextJS authentication

Useful Engineering Management Artifacts

Show HN: Traceroute Visualizer

Jony Ive designs a $4,800 lamp

Looking for feedback on proposed AI health risk scoring framework

Syndication feed fetchers, HTTP redirects, and conditional GET

Looking for feedback on proposed AI health risk scoring framework

Comments

OpenPrinter, an Open Hardware Printer

The Moon Landing: An Undelivered Nixon Speech

F-Droid and Google's Developer Registration Decree

Ask HN: Best Architecture Patterns for Lightweight SWE Workflows?

Bad Apple but It's Played Inside Super Mario Bros. [TAS]

An AI Index for all our customers

Off-duty DC firefighter was shot and dialed 911, but no one picked up

Stealing from Google

6,100-Qubit Processor Shatters Quantum Computing Record

AI Platform Citation Patterns: What Marketers Must Know About GenAI Discovery

Reddit Seeks to Strike Next AI Content Pact with Google, OpenAI

Forecasting Revolutionnay Package

The Internet Is Powered by Generosity

Startup Nights 2025 is comming up on 6-7 Nov. in Switzerland

Twenty Lessons on Fighting Tyranny from the Twentieth Century

Building a Custom Car Computer in 2006: Honda Accord PC Integration Project

Germany: Ministry of Economic Affairs effectively abolishes supply chain law

Animating Geometry with AMD DGF

Nvidia Vulkan Ray Tracing Tutorial

Figma Rendering: Powered by WebGPU

Show HN: When you open Instagram, your textbook opens instead – ZenScript App

Zeloof Z2 homemade silicon integrated circuit

Connect Word Level Solutions and Walkthrough Videos

I Am Not Claiming Your Videos: Mistaken YouTube Copyright Claims

AirAuth: open-source NextJS authentication

Useful Engineering Management Artifacts

Show HN: Traceroute Visualizer

Jony Ive designs a $4,800 lamp

Looking for feedback on proposed AI health risk scoring framework

Syndication feed fetchers, HTTP redirects, and conditional GET