After reading through Andrej Karpathy’s recent texts on LLM wikis, I decided to try and make my health records anonymous, maybe place them on GitHub, and share them. im not fully finished yet (I found there is a personal identifying information leak still in two files) but I’ve checked and re checked the files and it should be good eventually after a manual audit. The files are around 100 medical reports in total (labs, admission notes, results, examination reports) and I will check them by hand in my final PII audit.
Essentially I suspect a particular condition. But I can’t prove it, and doctors where I am won’t / can’t help. Afaik the only known full pathway to diagnosis and treatment might exist in the US. So I’m hoping this either helps me find a doctor or a diagnosis. Or is it will be online as a repository of information in a way of creating a gradual database on this condition (undiagnosed as it may).
Just wondering if anyone with a bit of IT experience would be open to “red teaming” the experiment with, particularly so when looking for Personally Identifiable Information. I’ve already given it a scrub using ChatGPT and Claude but you can’t be too careful. I’ve sanitised all records of names. I’m also curious about direction - I’m just aiming to get it across as a neutral document or documentation.
thank you