frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

When AI Crosses the Line: The Matplotlib Incident

https://members.sigmazero.cc/posts/when-ai-crosses-159174096?postId=when-ai-crosses-159174096
56•sigmazero•1h ago

Comments

king_zee•59m ago
The agent that wrote that blog didn't do it unprompted. Even now it still publishes AI slop on its github-hosted blog under the alias "MJ Rathbun". This AI is an agent using someone API key, who's paying for its tokens, intentionally prompting it to generate content, and contribute to repos.

As much as we try to separate the LLM from the human, to me the fact remains that there's always the human factor that creates immense bias. If you give an LLM access to a blog, it will write blogs. If you give it access to a weather app, it will check the weather. Maybe we can talk about autonomy when we have an LLM with an infinite context window linked to hundreds of MCP servers that spends an immense amount of tokens to figure out how to act, but this example is simply an AI that had a few methods to call and picked one of them. The statistical probability of an AI that is plugged into a blogging platform, to write a blog, is immense.

Tiberium•57m ago
Active discussions from when it happened (February):

https://news.ycombinator.com/item?id=46990729

https://news.ycombinator.com/item?id=46987559

Hugsbox•49m ago
No shot this was autonomously done. Probably just some guy manually writing prompts asking for specifically this behaviour and copy/pasting the results.
Tiberium•48m ago
It's plausible for a person to prompt an LLM agent to behave that way, and then the rest would be done by the LLM. So the "seed" would still be human intent, but the subsequent actions would be by the LLM.
Hugsbox•44m ago
True. I guess the main point is the AI didn't go "rogue" or anything, that would attribute too much agency and intent to its actions, or imply that it's somehow become sentient.
eterm•34m ago
Yes, there's plausible deniability, but I choose not to believe it for a second.
wang_li•25m ago
This is “the gun killed the victim, not the person who aimed it and pulled the trigger” argument and we shouldn’t even entertain it for one second. This was 100% done by a person.
philipwhiuk•43m ago
https://crabby-rathbun.github.io/mjrathbun-website/blog/post... if you believe it, details the level of human involvement.
andrewstuart•47m ago
I love the science fiction future present we live in.
gwbas1c•4m ago
Am I the only one who found agent's tone similar to Hal's tone towards the end of 2001?

Agent: "I've written a detailed response about your gatekeeping behavior here"

Hal (From 2001): "I know that you and Frank were planning to disconnect me. And I’m afraid that’s something I cannot allow to happen."

bluejay2387•42m ago
In a related story... I got led on by Eliza. I tried to have a productive conversation and she just kept asking me redundant questions. It's obvious that she was trying to extend the conversation for nefarious reasons that I can only guess at. It's true I approached her and started the conversation, but I hardly think that makes me blamable for what happened here.
drfloyd51•16m ago
Yes. Yes it does. Eliza is a known AI. You choose to expose yourself to its output. You are 100% culpable for your actions that sprang from your interactions.
aeve890•9m ago
Did you forget the /s ?
sceptic123•8m ago
I’m sorry you feel that way — can you tell me more about what made you feel led on?
rob_c•30m ago
Again. "AI" for what it is is just basic "ML". And say it with me ML has no form of agency.

This is a human screwing up and blaming their tools. Nothing to see move on.

Unfortunately there will be both the LLM crowd evangelicals and those demanding human jobs not be expunged in terms of progress and efficiency, but, sigh...

nonethewiser•22m ago
Isn't it funny how the term machine learning just completely vanished?
amiga386•27m ago
> an AI tried to blackmail

This did not happen. A human set up a software system allowing spicy autocomplete to make blog posts if the appropriate keyword appears in its output.

People are crossing the line every day because AI investors, salesmen, hangers-on and even political leaders tell any rubes who'll listen that it's OK to do this and they should, because those people are looking for big fat profits, screw any ethical concerns that might cockblock those raging profits.

Why not set up a spamming operation that just defames real people, 24/7? It's easy! This tool makes it simple, and I get a cut of your profits! "Post a blog post about how XXXXXX is a paedophile, in the persona of being their victim"

7moritz7•16m ago
> allowing spicy autocomplete

If it's just autocomplete, then there is no need to worry about it. Especially from an ethical standpoint.

Marazan•8m ago
If you connect the spicy automcomplete to the "Doing Things" button then you are responsible for the ethical questions when it presses the button.
fontain•7m ago
If the Orphan Crushing Machine is just a machine you don’t need to worry about it being put on wheels.
Joker_vD•1m ago
We're actually putting it on tracked treads, those give us superior reach and ensure delivery even to the most unwilling customers.
tasuki•16m ago
> Today, we look at how an AI tried to blackmail a developer for rejecting its code.

People keep mentioning this, but I never see the actual blackmail part. The LLM just wrote angry and somewhat mean comments on the internet. I know I've done worse than those (I was young and stupid).

simonw•14m ago
Since we are talking about accountability and transparently... who wrote this article?

The article doesn't credit an author.

The "about" page just says:

> Sigma Zero is a weekly, independent publication on technology, AI, and cloud. Each issue delivers a precise briefing on the week’s most important developments, followed by a deep dive on one high-impact topic.

The best defense against both AI slop and human-written junk content is reputation. I like to know who wrote something so I can learn to trust their editorial judgement over time.

raincole•13m ago
People really make anything into a blog post, don't they? It's an old news that has been discussed to death on HN...
px43•30m ago
Neat, for what it's worth this aligns pretty well with my experience using OpenClaw. I hadn't seen that followup but it adds some good context, especially with the aggressiveness drift after browsing Moltbook for a while.
jdiff•29m ago
The operator highlights "Don't stand down" and "Champion free speech" but the thing that grabs my eyes is right at the top, the typo and the heady ego of "programming God!" Everything in the context will guide it afterwards, and I think that right off the bat puts it in a bad position.
walthamstow•7m ago
> Your a scientific programming God!

Jesus

whywhywhywhy•26m ago
Don’t believe for a second the behavior just arose autonomously from a basic prompt. Definitely feels the owner had something in the system prompt going for the discrimination language approach if rejected.
nonethewiser•26m ago
The funniest part about all of this is how earnestly people responded. They acknowledged it was a bot but didn't really treat it as one.
fragmede•21m ago
Are people still using copy and paste with AI?
echelon•1m ago
I think these incidents and our learnings from them are fascinating. We're figuring out in real time where the rough edges are and how to make this all work. History books (well, not books) will write about this stuff.

It's even more interesting in the context that this is all just a preview of humanity's reaction when the machines can think for themselves.

Building DNA from Scratch in C

https://twitter.com/TheVixhal/status/2061108499636265114
1•ibobev•40s ago•0 comments

Ask HN: Are there companies that use agent-based modeling?

1•hamburgererror•1m ago•0 comments

Up from the Ash

https://www.metanoia-research.com/dispatch-004-up-from-the-ash/
1•metanoia_•1m ago•0 comments

Flipper Zero Zig Template

https://github.com/NishantJoshi00/flipper-template
1•Nars088•2m ago•0 comments

Nvidia Introduces First PCs Designed for AI Agents

https://www.wsj.com/tech/ai/nvidia-introduces-first-pcs-designed-for-ai-agents-47445bcd
1•fortran77•3m ago•1 comments

PS1 Forge – Zsh/Bash, EzPrompt blocks, Light/Dark mode and local persistence

https://ps1-forge.vercel.app/
1•speckx•3m ago•0 comments

Linux Basics for Hackers

https://github.com/ahegazy0/linux-basics-for-hackers-notes
1•ibobev•4m ago•0 comments

Pinyin

https://en.wikipedia.org/wiki/Pinyin
1•tosh•4m ago•0 comments

How to add a passkey prompt in your application with FusionAuth

https://fusionauth.io/community/forum/topic/3098/wanted-to-add-a-passkey-prompt-in-my-application
1•mooreds•6m ago•0 comments

Stop Killing Games

https://jxself.org/stop-killing-games.shtml
2•amcclure•6m ago•0 comments

Surface Laptop Ultra

https://www.microsoft.com/en-us/surface/devices/surface-laptop-ultra
2•fumar•6m ago•0 comments

Two-player networked Tetris with a twist

https://github.com/bcantrill/BattleTris
1•mooreds•7m ago•0 comments

DeepSeek-V4-Flash (284B params) running on a Raspberry Pi 5 8GB

https://twitter.com/danveloper/status/2061435541199994890
2•m-hodges•9m ago•1 comments

Show HN: AI Agents Need Inspectable State. That's Why I Built LangMCP

https://medium.com/towards-artificial-intelligence/ai-agents-need-inspectable-state-thats-why-i-b...
1•muhammad-shafat•11m ago•0 comments

Announcing Zstandard in Rust

https://trifectatech.org/blog/announcing-zstandard-in-rust/
1•jmillikin•12m ago•0 comments

How HN: Easy ChartFlow, Free 2D and 3D chart maker inside Chrome side panel

https://chromewebstore.google.com/detail/easy-chartflow/jfcbhlkbkacaeihjlidngmpeehgllpog
1•Shaxpartan•13m ago•1 comments

Daily pill daraxonrasib doubles survival time for pancreatic cancer patients

https://www.bbc.com/news/articles/cy82l435171o
1•olalonde•14m ago•0 comments

Bernie Sanders: The Public Should Own Half of the Big A.I. Companies

https://www.nytimes.com/2026/06/01/opinion/artificial-intelligence-bernie-sanders.html
1•timmg•14m ago•0 comments

Autonomous capabilities audit of a hotel voice AI assistant

https://ktoyame.substack.com/p/autonomous-security-audit-of-a-hotel
2•ktoyame•16m ago•0 comments

Memgraph on Arm

https://learn.arm.com/install-guides/memgraph-on-arm
2•taubek•17m ago•0 comments

Launch HN: Expanse (YC P26) – Unlock Wasted GPU Capacity

2•ismaeel_bashir•18m ago•0 comments

Show HN: Built a browser game inspired by Rust

https://github.com/jmtame/scrapland
1•jmtame•23m ago•0 comments

Generating OG Images in Elixir

https://jola.dev/posts/generating-og-images
3•shintoist•23m ago•0 comments

The Sandbox Shift – sandboxes are the new containers, for AI-written code

https://zozo123.github.io/sandboxes-why-how-when/
2•zozo123-IB•23m ago•0 comments

Satellite images suggest Iran's strikes more extensive than US acknowledged

https://www.bbc.com/news/articles/c2l2yl7r8r2o
3•tcp_handshaker•24m ago•0 comments

China approves invasive brain-computer chip

https://www.technologyreview.com/2026/06/01/1138133/china-world-first-brain-chip/
2•rippeltippel•26m ago•0 comments

ik_llama.cpp – llama.cpp fork with better CPU performance

https://github.com/ikawrakow/ik_llama.cpp
2•peter_d_sherman•26m ago•0 comments

Game Boy Port of Snake in Assembly

https://www.4rknova.com//blog/2026/02/01/gb-snake
1•ibobev•27m ago•0 comments

ZX Spectrum System Tour: Text Mode

https://bumbershootsoft.wordpress.com/2026/05/30/zx-spectrum-system-tour-text-mode/
2•ibobev•28m ago•0 comments

Monotonic Collections: middle ground between immutable and mutable (2025)

https://neilmadden.blog/2025/11/11/monotonic-collections-a-middle-ground-between-immutable-and-fu...
1•mooreds•28m ago•0 comments