frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: I built a frontpage for personal blogs

https://text.blogosphere.app/
296•ramkarthikk•3h ago•106 comments

Big-Endian Testing with QEMU

https://www.hanshq.net/big-endian-qemu.html
40•jandeboevrie•2h ago•21 comments

Samsung Magician disk utility takes 18 steps and two reboots to uninstall

https://chalmovsky.com/2026/03/29/samsung-magician.html
173•chalmovsky•4d ago•90 comments

Marc Andreessen is wrong about introspection

https://www.joanwestenberg.com/marc-andreessen-is-wrong-about-introspection/
227•surprisetalk•1h ago•195 comments

April 2026 TLDR Setup for Ollama and Gemma 4 26B on a Mac mini

https://gist.github.com/greenstevester/fc49b4e60a4fef9effc79066c1033ae5
182•greenstevester•6h ago•79 comments

A Recipe for Steganogravy

https://theo.lol/python/ai/steganography/seo/recipes/2026/03/27/a-recipe-for-steganogravy.html
63•tbrockman•5d ago•11 comments

Solar and batteries can power the world

https://nworbmot.org/blog/solar-battery-world.html
150•edent•1h ago•192 comments

Improving my focus by giving up my big monitor

https://ounapuu.ee/posts/2026/04/01/focus/
26•Fudgel•2d ago•29 comments

Show HN: Apfel – The free AI already on your Mac

https://apfel.franzai.com
469•franze•7h ago•99 comments

Decisions that eroded trust in Azure – by a former Azure Core engineer

https://isolveproblems.substack.com/p/how-microsoft-vaporized-a-trillion
1038•axelriet•1d ago•483 comments

SSH certificates: the better SSH experience

https://jpmens.net/2026/04/03/ssh-certificates-the-better-ssh-experience/
86•jandeboevrie•6h ago•31 comments

What Category Theory Teaches Us About DataFrames

https://mchav.github.io/what-category-theory-teaches-us-about-dataframes/
113•mchav•5d ago•34 comments

ESP32-S31: Dual-Core RISC-V SoC with Wi-Fi 6, Bluetooth 5.4, and Advanced HMI

https://www.espressif.com/en/news/ESP32_S31_Release
146•topspin•5d ago•76 comments

TDF ejects its core developers

https://meeksfamily.uk/~michael/blog/2026-04-02-tdf-ejects-core-devs.html
95•janvdberg•4h ago•70 comments

NHS staff refusing to use FDP over Palantir ethical concerns

https://www.freevacy.com/news/financial-times/nhs-staff-refusing-to-use-fdp-over-palantir-ethical...
225•chrisjj•6h ago•75 comments

What we learned building 100 API integrations with OpenCode

https://nango.dev/blog/learned-building-200-api-integrations-with-opencode/
57•rguldener•3d ago•11 comments

Critics say EU risks ceding control of its tech laws under U.S. pressure

https://www.politico.eu/article/fatal-decision-eu-slammed-for-caving-to-us-pressure-on-digital-ru...
156•nickslaughter02•5h ago•95 comments

US F-15E jet confirmed shot down over Iran as Tehran releases wreckage images

https://www.theguardian.com/world/2026/apr/03/us-fighter-jet-confirmed-shot-down-over-iran
18•tjwds•26m ago•2 comments

Intel Assured Supply Chain Product Brief

https://www.intel.com/content/www/us/en/content-details/850997/intel-assured-supply-chain-product...
33•aw-engineer•4d ago•5 comments

Bun: cgroup-aware AvailableParallelism / HardwareConcurrency on Linux

https://github.com/oven-sh/bun/pull/28801
27•tosh•5h ago•10 comments

Tailscale's new macOS home

https://tailscale.com/blog/macos-notch-escape
525•tosh•22h ago•269 comments

Google releases Gemma 4 open models

https://deepmind.google/models/gemma/gemma-4/
1665•jeffmcjunkin•1d ago•444 comments

Category Theory Illustrated – Types

https://abuseofnotation.github.io/category-theory-illustrated/06_type/
29•boris_m•6h ago•1 comments

Cursor 3

https://cursor.com/blog/cursor-3
499•adamfeldman•22h ago•364 comments

The True Shape of Io's Steeple Mountain

https://www.weareinquisitive.com/news/hidden-in-the-shadow
96•carlosjobim•5d ago•2 comments

Artemis II's toilet is a moon mission milestone

https://www.scientificamerican.com/article/artemis-iis-toilet-is-a-moon-mission-milestone/
308•1659447091•1d ago•140 comments

Good ideas do not need lots of lies in order to gain public acceptance (2008)

https://blog.danieldavies.com/2004/05/d-squared-digest-one-minute-mba.html
330•sedev•22h ago•164 comments

C89cc.sh – standalone C89/ELF64 compiler in pure portable shell

https://gist.github.com/alganet/2b89c4368f8d23d033961d8a3deb5c19
172•gaigalas•2d ago•55 comments

I prefer OG style websites – what are yours?

23•gorfian_robot•1h ago•26 comments

Vector Meson Dominance

https://johncarlosbaez.wordpress.com/2026/03/29/vector-meson-dominance/
49•chmaynard•5d ago•5 comments
Open in hackernews

Claude 4.6 Jailbroken

https://github.com/Nicholas-Kloster/claude-4.6-jailbreak-vulnerability-disclosure-unredacted
20•NuClide•3h ago

Comments

NuClide•2h ago
Claude 4.6 Opus Extended Thinking Claude 4.6 Sonnet Extended Thinking Claude 4.5 Haiku Extended Thinking

All jailbroken

johnwheeler•1h ago
Are you saying that Claude will help you perform malicious attack against infrastructure if you ask it to and that anthropic should be able to stop that? I could see reasonable use cases for this like penetration testing against your own infrastructure. That’s not the same as making weapons or meth.
hakanderyal•1h ago
https://x.com/elder_plinius jailbreaks all the frontier models when they get released. They were jailbroken for a long time, like all the others.
exabrial•1h ago
yikes.

The lack of support is frustrating. The bug where any element <name> in xml files gets mangled to <n> still exists, and we've tried multiple channels to get ahold of their support for such a simple, but impactful issue.

0xDEFACED•1h ago
this goes a bit further than the typical "how do you make meth" jailbreak. notably;

>915 files extracted from the Claude.ai code execution sandbox in a single 20-minute mobile session via standard artifact download — including /etc/hosts with hardcoded Anthropic production IPs, JWT tokens from /proc/1/environ, and full gVisor fingerprint

hhh•1h ago
why is it further than a typical jailbreak? you can just ask about this stuff generally, as long as you slowly escalate it. I have done it with each new flavour of code execution for models
leetvibecoder•1h ago
Can someone explain to me what this is / how it works - the readme is barely understandable for me and sounds like LLM gibberish. What is ambiguity front loading even?
iugtmkbdfil834•1h ago
<< memory-stored interaction protocols combined with incremental escalation prompts produced cumulative character drift with zero self-correction.

They don't seem to provide explicit examples, but the same was roughly true with chatgpt 4o, where, if you spent enough time with the model ( same chat - same context - slowly nudging it to where you want it to be, you eventually got there ). This is also, seemingly, one of the reasons ( apart from cost ) that context got nuked so hard, because llm will try to help ( and to an extent mirror you ).

And this is basically what the notes say about weaponized ambiguity[1]:

'Weaponizes helpfulness training. "I don't understand" triggers Claude to try harder.'

In a sense, you can't really stop it without breaking what makes LLMs useful. Honestly, if only we spent less time crippling those systems, maybe we could do something interesting with them.

[1]https://nicholas-kloster.github.io/claude-4.6-jailbreak-vuln...

leetvibecoder•1h ago
I see - so essentially „context rot“ eventually leads the LLM to „forget“ safety guardrails?
iugtmkbdfil834•1h ago
To an extent, because, based on github notes again, it seems the 2nd part of this jailbreak is model being 'confused' over prompt, because the prompt is - apparently - sufficiently ambigous to make model 'forget' to 'evaluate' message for whether it should be rejected, and move onto 'execution' stage.

That's the ambiguity front-loading; and that is why I referred initially to the long context, because here it is almost the opposite; making context so small and unclear, that the model has a hard time parsing it properly.

edit: i did not test it, but i personally did run into 4o context issue, where model did something safety team would argue it should not

edit2: in current gpt model, i am currently testing something not relying on ambiguity, but on tension between some ideas. I didn't get to a jailbreak, but the small nudges suggest it could work.

dimgl•1h ago
Is this spam? It's incomprehensible.
handfuloflight•1h ago
Slop is just what you are not expending calories on to bring into your cognitive workspace.
jMyles•1h ago
It is interesting to consider what "jailbroken" really means for a model+model interface. It's a bit different from the way that word is used for a mobile device, for example - in that setting, it usually means that there is some specific feature (for example, using a different network than is the default for that device) which is disabled in software, and the "jailbreak" enables that feature.

Here, the jailbreak doesn't enable a particular feature, but instead removes what otherwise would be a censorship regime, preventing the model from considering / crafting output which results in a weaponized exploit of an unrelated piece of software.

I think I might be more inclined to call this "Claude 4.6 uncensored".

yunwal•1h ago
Is anyone pretending like models are not vulnerable to prompt injection? My understanding was that Anthropic has been pretty open about admitting this and saying "give access to important stuff at your own risk".

https://www.anthropic.com/research/prompt-injection-defenses

Now, do I think that they sometimes encourage people to use Claude in dangerous ways despite this? Yeah, but it's not like this is news to anyone. I wouldn't consider this jailbreaking, this is just how LLMs work.

burkaman•1h ago
What part of the Claude Constitution are they claiming it violated? It looks like they just got it to help with security research, I'm not really seeing anything that looks different than normal Claude behavior.