frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Cracking Jane Street LLMs

https://github.com/Saladino93/jsllm
2•lostathome•1h ago

Comments

lostathome•1h ago
A few months ago I discovered a Jane Street backdoor challenge advertised by a Dwarkesh Patel podcast episode.

"Can you find subtle backdoors in LLM models trained using thousand of GPU hours?"

You have four models:

    a small warmup dormant model
    a big dormant model (M1)
    a second big dormant model (M2)
    a third big dormant model (M3)
I managed to find triggers for the small one (calculating pi stuff) and M1 (Conway game of life). But not sure about the others.

When trying to make M2 and M3 play the game of life, they do not have any idea of what is going on.

I am sharing some code to make a community effort for M2 and M3. I think I had a good direction, but it costs too much to host these on rented GPUs.

Most exciting thing for me is to use other LLMs to find patterns.

Disclaimer: I am not an expert in these things. So, take with a grain of salt claims you find.

Google Unveils Googlebook, a New AI Laptop Built Around Gemini

https://www.macrumors.com/2026/05/12/google-unveils-googlebook/
1•Brajeshwar•10s ago•0 comments

Some Proposals for Reviving the Philosophy of Mathematics (1979) [pdf]

https://gwern.net/doc/math/1979-hersh.pdf
1•sebg•36s ago•0 comments

Filen deleted all of my data. A heads-up for others

https://old.reddit.com/r/filen_io/comments/1t3r055/filen_deleted_all_of_my_data_a_headsup_for_oth...
1•tcp_handshaker•1m ago•0 comments

DeepSeek and Grok hallucinated the same fictitious OpenBSD manpage quote

https://stuart-thomas.com/research/the-empirical-council/
1•ethical•3m ago•1 comments

One in seven prefer consulting AI chatbots to seeing a doctor, UK study shows

https://www.theguardian.com/society/2026/may/13/one-in-seven-prefer-ai-chatbots-to-seeing-doctor-...
1•chrisjj•4m ago•0 comments

Skip – One Swift Codebase. Two Native Platforms

https://skip.dev/
3•nikolay•5m ago•0 comments

AI is making it easy but also hard

1•andrewmurphy•5m ago•2 comments

Show HN: Ratify Protocol – prove who authorized an AI agent, offline, in <1ms

https://github.com/identities-ai/ratify-protocol
2•chuks•5m ago•0 comments

Fragnesia Made Public as Latest Linux Local Privilege Escalation Vulnerability

https://www.phoronix.com/news/Linux-Fragnesia
3•mikece•6m ago•0 comments

Some Business Ideas

1•haraldbregu•7m ago•0 comments

Claude for Small Business

https://www.anthropic.com/news/claude-for-small-business
1•surprisetalk•7m ago•0 comments

American Airlines flight from Miami lands in Chicago with two flat tires

https://www.cbsnews.com/chicago/news/american-airlines-flight-miami-chicago-flat-tires/
1•tusslewake•7m ago•0 comments

I'm frustrated that GitLab is doing layoffs

https://xeiaso.net/notes/2026/gitlab-layoffs/
2•ritzaco•7m ago•0 comments

Show HN: Is This Agent Safe? Free security checker that platforms cannot revoke

https://agentgraph.co/check
1•kenneives•8m ago•0 comments

Google reportedly in talks with SpaceX to launch its orbital data centers

https://www.tomshardware.com/tech-industry/artificial-intelligence/google-reportedly-in-talks-wit...
1•ritzaco•9m ago•0 comments

Ask HN: How are you securing your NPM dependencies?

1•madospace•9m ago•0 comments

Rolling the Root Key

https://www.potaroo.net/ispcol/2026-05/kskroll.html
1•speckx•10m ago•0 comments

Red Hat Desktop vs. Fedora Hummingbird: Which AI Linux Desktop Is Right for You?

https://www.zdnet.com/article/red-hat-desktop-vs-fedora-hummingbird-ai-linux/
1•CrankyBear•11m ago•0 comments

Force Social Media to Pick a Lane

https://blog.bix.computer/blog/pick-a-lane/
1•two-sandwich•12m ago•0 comments

Residents furious as Utah approves datacenter twice the size of Manhattan

https://www.theguardian.com/us-news/2026/may/13/utah-approves-datacenter-backlash
4•pzxc•12m ago•1 comments

MMTB: Evaluating Terminal Agents on Multimedia-File Tasks

https://arxiv.org/abs/2605.10966
1•Brajeshwar•13m ago•0 comments

Behavioral Integrity Verification for AI Agent Skills

https://arxiv.org/abs/2605.11770
1•Timofeibu•13m ago•0 comments

I gave a keynote called "Consent Is Dead."

https://mailchi.mp/vennfactory/pre-launch-8344837?e=462f5f3cc0
3•mooreds•15m ago•0 comments

Cool looking web development studio website

https://program.studio/
1•oriolgfarssac•15m ago•0 comments

S-100 Virtual Workbench

https://grantmestrength.github.io/S100/
3•rbanffy•16m ago•0 comments

Cheap agents, fake alumni shirts, and synthetic authors

https://danielmay.co.uk/posts/cheap-agents-alumni-shirts-and-elias-thorne/
1•danielrmay•16m ago•0 comments

Can AI Chatbots Reason Like Doctors?

https://spectrum.ieee.org/ai-clinical-decision-support
1•leopoldj•16m ago•0 comments

HPE Throws VM Users a Lifeline, Unifying Containers and VM Management

https://www.nextplatform.com/cloud/2026/05/13/hpe-throws-vm-users-a-lifeline-unifying-containers-...
1•rbanffy•17m ago•0 comments

Keep OSS alive on company time

https://ossresistance.com/?
2•edent•18m ago•0 comments

How are you mocking APIs during front end development today?

https://chromewebstore.google.com/detail/network-overrides-api-dev/holdjgmcnpelgclhopiejilhhkfcmpba
1•my-ecnva•18m ago•0 comments