news newest ask show jobs

Open Source @Github

fp.

Open in hackernews

Show HN: We're inviting Anthropic to put the real Mythos 5 on our open benchmark

https://realvuln.com

3•jfaganel99•1h ago

Comments

jfaganel99•1h ago

Question, because I can't answer it myself...

Created an open-source benchmark for code security scanners and ran a bunch of them along with LLMs on real vulnerable code. Fable 5 is on there also as of yesterday, but that's the gated public model. The one we all wants to see is Mythos 5, and it's locked to a handful of vetted orgs.

So does anyone here have access to Mythos 5? And can run it against the benchmark.

Would genuinely like to see what it scores and at what cost.

jfaganel99•24m ago

For the sceptics... The benchmark is research based with a published ArXiv paper on the methodology

https://arxiv.org/abs/2604.13764

Cerberus, an Open-Source USB protection device

https://github.com/Lab217MX/Cerberus-A-USB-Watchdog

1•glitchboi•1m ago•0 comments

Why the 2026 World Cup Ball Has Deeper Seams

http://liveatthewitchtrials.blogspot.com/2026/06/the-2026-world-cup-football-is-big.html

1•speckx•1m ago•0 comments

What the Fuck Happened to Nerds

https://mrmarket.bearblog.dev/what-the-fuck-happened-to-nerds/

1•mrmarket•2m ago•0 comments

Philtrum – It Started with a Prompt

https://philtrum.app/

1•pencilcheck•4m ago•1 comments

The Token Value of $200/mo Plans

https://twitter.com/SemiAnalysis_/status/2064815044085318040

1•thedebuglife•4m ago•0 comments

The Token Value of $200/mo Plans

https://link.mail.beehiiv.com/ss/c/u001.LDkxbMa7NCxUGG7E2Yh3ABiuUAE5LTRLvOwLxg7TbRtWwRuK02qKlX8wK...

1•thedebuglife•4m ago•0 comments

AI is about to get fast, and it's never going to slow down

https://medium.com/@NMitchem/ai-is-about-to-get-fast-and-its-never-going-to-slow-down-78e13e794375

2•Mitchem•4m ago•0 comments

Bio input based, instead of vision based, physical AI for industrial bio

https://diggest.substack.com/p/creating-a-benchmark-for-physical

1•digvijay0401•5m ago•0 comments

Merman: headless Mermaid.js in Rust

https://github.com/Latias94/merman

1•nateb2022•5m ago•0 comments

Forget Zune. Forget Vista. Copilot Is Microsoft's Biggest Failure

https://www.youtube.com/watch?v=ER0jRB3nhK4

3•valeg•7m ago•0 comments

Understanding the rationale behind a rule when trying to circumvent it

https://devblogs.microsoft.com/oldnewthing/20260611-00/?p=112415

1•ibobev•7m ago•0 comments

Why do you say that a COM STA thread must pump messages?

https://devblogs.microsoft.com/oldnewthing/20260522-00/?p=112348

1•ibobev•8m ago•0 comments

Quantity leads to quality (the origin of a parable) (2020)

https://austinkleon.com/2020/12/10/quantity-leads-to-quality-the-origin-of-a-parable/

1•crescit_eundo•8m ago•0 comments

Learning to be a Tech Lead (2024)

https://miryeh.medium.com/learning-to-be-a-tech-lead-e22a0b4f01d5

1•mooreds•8m ago•0 comments

The tanks in Cushing, Oklahoma, are hitting bottom

https://www.cnn.com/2026/06/12/business/cushing-oil-inventory

4•mooreds•10m ago•0 comments

Why Artists Are Running Their Own Data Centers

https://southpole.blog/artists-running-their-own-data-centers/

1•berlianta•11m ago•0 comments

Can smartphones help explain the drop in birth rates?

https://text.npr.org/nx-s1-5851795

1•mooreds•11m ago•0 comments

India says it is working to stop water flowing into Pakistan

https://www.channelnewsasia.com/asia/india-pakistan-conflict-water-treaty-disagreement-6173811

1•vrganj•12m ago•0 comments

Verizon sent man a refurbished phone with MDM, then deleted his data remotely

https://arstechnica.com/tech-policy/2026/06/verizon-sent-man-a-refurbished-phone-with-mdm-then-de...

3•Brajeshwar•12m ago•0 comments

Amazon.ca is down – everything is out of stock

https://www.amazon.ca/Decker-CBG110SC-Electric-Smartgrind-Grinder/dp/B07SZ9FFT9/ref=lp_2224068011...

1•Callicles•12m ago•0 comments

When should we expect to meet aliens?

https://aliens.fyi

3•avhwl•13m ago•1 comments

Solving a chess puzzle with Claude and Prolog

https://www.johndcook.com/blog/2026/06/11/prolog-claude/

2•ibobev•14m ago•0 comments

Author Jane Yolen, 87, died. Writer of fantasy, sci-fi, and children's books

https://locusmag.com/2026/06/jane-yolen-1939-2026/

1•speckx•14m ago•0 comments

Agentic-Engineering-Handbook

https://github.com/keyuchen21/agentic-engineering-handbook

2•keyuchen2020•15m ago•0 comments

Nvidia Is Developing an AI Healthcare Model with Startup Abridge

https://www.wsj.com/cio-journal/nvidia-is-developing-an-ai-healthcare-model-with-startup-abridge-...

1•bookofjoe•16m ago•1 comments

Vykar is a fast, encrypted, deduplicated backup tool written in Rust

https://vykar.borgbase.com

2•delduca•17m ago•0 comments

Google Sues to Stop Chinese Cybercrime Group from Using Its A.I

https://www.nytimes.com/2026/06/12/technology/google-lawsuit-china-ai-scams.html

1•ChrisArchitect•18m ago•1 comments

Open Knowledge Format

https://cloud.google.com/blog/products/data-analytics/how-the-open-knowledge-format-can-improve-d...

1•berlianta•18m ago•0 comments

Show HN: Memoriq – Private AI Memory for ChatGPT, Claude, Gemini and Grok

https://memoriq.me/

2•giekaton•22m ago•0 comments

WASI 0.3.0 Released

https://github.com/WebAssembly/WASI/releases/tag/v0.3.0

5•mavdol04•23m ago•0 comments