frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Refusal in Language Models Is Mediated by a Single Direction

https://arxiv.org/abs/2406.11717
23•fagnerbrack•2h ago

Comments

akersten•1h ago
2024 which is ancient history. This is not true anymore, the models now are trained to prevent abliteration by spreading out the refusal encoding

See https://arxiv.org/abs/2505.19056

Der_Einzige•46m ago
That doesn't stop/prevent abliteration. The creator of XTC/DRY is also a chad who makes sure that you really can access the full model capabilities. Censorship is the devil.

https://github.com/p-e-w/heretic

RRRA•41m ago
It was pretty funny to see Qwen 3.6 (heretic) tell me about how many death the Chinese government thought happened at Tiananmen Sq. on April 15th 1989.

Makes you wonder where that data was taken from, or if their great firewall is broken, or even if Alibaba engineers have special access...

arcfour•28m ago
I don't think it's unreasonable to imagine that Alibaba is allowed to scrape the wider internet, or that some research institution is and then Alibaba got data from them.

What is perhaps more surprising is that the data was not scrubbed before training, but maybe they thought that would be too on-the-nose for the rest of the world and would hamper their popularity if they were too obviously biased.

freehorse•16m ago
I don’t think it is very surprising. Ime I don’t think they try that hard to censor them, but only in a very superficial level that they have to. It is trivial to get their models tell you this kind of stuff, I wouldnt even consider it jailbreaking.

LLMs consistently pick resumes they generate over ones by humans or other models

https://arxiv.org/abs/2509.00462
80•laurex•45m ago•46 comments

How fast is a macOS VM, and how small could it be?

https://eclecticlight.co/2026/05/02/how-fast-is-a-macos-vm-and-how-small-could-it-be/
149•moosia•6h ago•54 comments

Barman – Backup and Recovery Manager for PostgreSQL

https://github.com/EnterpriseDB/barman
43•nateb2022•3d ago•4 comments

Why does it take so long to release black fan versions?

https://www.noctua.at/en/expertise/blog/how-can-it-take-so-long-to-release-black-fan-versions
508•buildbot•11h ago•234 comments

Zugzwang

https://en.wikipedia.org/wiki/Zugzwang
23•Qem•38m ago•3 comments

Refusal in Language Models Is Mediated by a Single Direction

https://arxiv.org/abs/2406.11717
24•fagnerbrack•2h ago•5 comments

Why are there both TMP and TEMP environment variables? (2015)

https://devblogs.microsoft.com/oldnewthing/20150417-00/?p=44213
123•ankitg12•7h ago•66 comments

Show HN: DAC – open-source dashboard as code tool for agents and humans

https://github.com/bruin-data/dac
70•karakanb•3d ago•17 comments

Dotcl: Common Lisp Implementation on .NET

https://github.com/dotcl/dotcl
102•reikonomusha•1d ago•15 comments

Ti-84 Evo

https://education.ti.com/en/products/calculators/graphing-calculators/ti-84-evo
524•thatxliner•20h ago•427 comments

Show HN: Pollen – distributed WASM runtime, no control plane, single binary

https://github.com/sambigeara/pollen
52•sambigeara•2d ago•23 comments

Open Design: Use Your Coding Agent as a Design Engine

https://github.com/nexu-io/open-design
96•steveharing1•3h ago•62 comments

Craig Venter of Human Genome Project Dies at 79

https://www.economist.com/obituary/2026/05/01/craig-venter-raced-to-decode-the-human-genome
32•bookofjoe•4h ago•7 comments

Artemis II Photo Timeline

https://artemistimeline.com/#artemis-ii-walkout-nhq202604010003
286•geerlingguy•2d ago•24 comments

Show HN: Mljar Studio – local AI data analyst that saves analysis as notebooks

https://mljar.com/
43•pplonski86•5h ago•10 comments

New research suggests people can communicate and practice skills while dreaming

https://www.newyorker.com/culture/annals-of-inquiry/its-possible-to-learn-in-our-sleep-should-we
403•XzetaU8•22h ago•236 comments

Show HN: Browser-based light pollution simulator using real photometric data

https://iesna.eu/?wasm=skyglow_demo
31•holg•7h ago•10 comments

SFO Gate Explorer

https://www.flysfo.com/passengers/services/gate-explorer
25•CaliforniaKarl•1d ago•26 comments

Why IPv6 is so complicated

https://github.com/becarpenter/book6/blob/main/01.%20Introduction%20and%20Foreword/Why%20IPv6%20i...
3•speckx•2d ago•0 comments

To Restore an Island Paradise, Add Fungi

https://e360.yale.edu/digest/atoll-islands-sea-level-rise-fungi
110•Brajeshwar•3d ago•30 comments

DeepSeek V4–almost on the frontier, a fraction of the price

https://simonwillison.net/2026/Apr/24/deepseek-v4/
318•indigodaddy•23h ago•196 comments

Oil tanker hijacked off Yemen, steers toward Somalia

https://www.yahoo.com/news/articles/yemen-says-oil-tanker-hijacked-121710980.html
32•delichon•2h ago•34 comments

Show HN: Filling PDF forms with AI using client-side tool calling

https://copilot.simplepdf.com/?share=a7d00ad073c75a75d493228e6ff7b11eb3f2d945b6175913e87898ec96ca...
37•nip•7h ago•17 comments

CollectWise (YC F24) Is Hiring

https://www.ycombinator.com/companies/collectwise/jobs/rEWfZ6R-senior-forward-deployed-engineer
1•OBrien_1107•11h ago

Bitmap and tilemap generation from a single example

https://github.com/mxgmn/WaveFunctionCollapse
56•futurecat•2d ago•12 comments

An unknown Sega Saturn project has come to light after 29 years

https://32bits.substack.com/p/under-the-microscope-pyramid-unreleased
53•bbayles•3h ago•1 comments

Ask.com has closed

https://www.ask.com/
370•supermdguy•12h ago•195 comments

I'm Peter Roberts, immigration attorney who does work for YC and startups. AMA

180•proberts•1d ago•230 comments

Show HN: Large Scale Article Extract of Newspapers 1730s-1960s

https://snewpapers.com/
31•brettnbutter•7h ago•16 comments

A report on burnout in open source software communities (2025) [pdf]

https://mirandaheath.website/static/oss_burnout_report_mh_25.pdf
121•susam•16h ago•46 comments