frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Robustly improving LLM fairness in realistic settings via interpretability

https://www.arxiv.org/pdf/2506.10922
1•like_any_other•7mo ago