frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Beyond the Black Box: Interpretability of LLMs in Finance

https://arxiv.org/abs/2505.24650
67•ashater•1d ago

Comments

ashater•1d ago
Paper introduces AI explainability methods, mechanistic interpretation, and novel Finance-specific use cases. Using Sparse Autoencoders, we zoom into LLM internals and highlight Finance-related features. We provide examples of using interpretability methods to enhance sentiment scoring, detect model bias, and improve trading applications.
juancroldan•1d ago
Cool stuff. I'm the CTO of Stargazr (stargazr.ai), a financial & operational AI for manufacturing companies; we started using transformers to process financial data in 2020, a bit before the GPT boom.

In our experience, things beyond very constrained function calling opens the door to explainability problems. We moved away from "based on the embeddings of this P&L, you should do X" towards "I called a function to generate your P&L, which is in this table; based on this you could think of applying these actions".

It's a loss in terms of semantics (the embeddings could pack more granular P&L observations over time) but much better in terms of explainability. I see other finance AIs such as SAP Joule also going in the same direction.

ashater•1d ago
Thank you. Agreed, we are exploring different ways to apply these interpretability methods to a wide range of transformer based methods, not just decoder based generative applications.
hamburga•1d ago
I’m still waiting for somebody to explain to me how a model with a million+ parameters can ever be interpretable in a useful way. You can’t actually understand the model state, so you’re just making very coarse statistical associations between some parameters and some kinds of responses. Or relying on another AI (itself not interpretable) to do your interpretation for you. What am I missing?
esafak•1d ago
Even a large model has to behave fairly predictably to be useful; it's not totally random, is it? The same thing applies to humans.

Interpretability can mean several things. Are you familiar with things like this? https://distill.pub/2018/building-blocks/

ashater•1d ago
Our paper provides evidence of features in Finance but I would suggest reading seminal papers from Anthropic https://www.anthropic.com/news/golden-gate-claude and https://transformer-circuits.pub/2024/scaling-monosemanticit...

Monosemantic behavior is key in our research.

CGMthrowaway•1d ago
There is a power law curve to the importance of any particular feature. I work with models with 1000's of features and usually it's only the top 5-10 that really matter. But you don't know until you do it
dboreham•1d ago
My take is the model is a matrix (or a thing like a matrix). You can "interpret" it in the context of another matrix that you know (presumably by generating that matrix from known training data, or by looking at the delta between different matrices with different measurable output behavior), you can say how much of your test matrix is present in the target model.
laylower•1d ago
Thanks Ariye. What does group risk think about this paper?

I imagine these metrics would be good to include in the MI but are you confident that the methods being proposed are adequate to convince regulators on both sides of the Atlantic?

ashater•1d ago
Thank you for reading. One of the main reasons we've written the paper is to help with model validation of LLM usage in our highly regulated industry. We are also engaging with regulators.

The industry at the moment is mostly using closed sourced vendor models that are very hard to validate or interpret. We are pushing to move onto models, with open source weights and where we can apply our interpretability methods.

Current validation approaches are still very behavioral in nature and we want move it into mechanistic interpretation world.

vessenes•1d ago
Ooh you had me at mechinterp + finance. Thanks for publishing: I’m excited to read it. Long term do you guys hope to uncover novel frameworks? Or are you most interested in having a handle on what’s going on inside the model?
ashater•1d ago
We want to do both. In finance, highly regulated industry, understanding how models work is critical. In addition, mech interp will allow us to understand which current or new architectures could work better for financial applications.

Show HN: I turned my infrastructure into a tab

https://swiftor.io
1•furaar•7m ago•0 comments

ApiFlux – A Visual Playground to Build and Debug API Workflows

1•Shubham_APIFLUX•8m ago•0 comments

I created a curated list of AI agents for consumers and developers

https://www.agentrank.tech
1•hughmcinnis•17m ago•1 comments

Don't know if your business idea will have traction? stop waiting and find out

1•dopeylime•22m ago•0 comments

Big Tech's AI Endgame Is Coming into Focus (an everything app)

https://www.theatlantic.com/technology/archive/2025/06/everything-app-big-tech-ai-endgame/683024/
4•petethomas•28m ago•0 comments

FFmpeg Merges WebRTC Support

https://github.com/FFmpeg/FFmpeg/commit/167e343bbe75515a80db8ee72ffa0c607c944a00
5•Sean-Der•29m ago•0 comments

Friendship rather than romance protects better from depression

https://www.psychologytoday.com/au/blog/living-single/202505/which-protects-best-from-depression-friendship-or-romance
1•nreece•31m ago•0 comments

Malicious RubyGems pose as Fastlane to steal Telegram API data

https://www.bleepingcomputer.com/news/security/malicious-rubygems-pose-as-fastlane-to-steal-telegram-api-data/
2•feross•33m ago•0 comments

Lodestar Multipliers in Delaware and Federal Attorney Fee Awards

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5237545
1•ckrailo•38m ago•0 comments

Installing *BSD in 2025 part 3 – A critical look at NetBSD's installer

https://eerielinux.wordpress.com/2025/05/31/installing-bsd-in-2025-part-3-a-critical-look-at-netbsds-installer/
1•jaypatelani•38m ago•0 comments

Show HN: Hacker News historic upvote and score data

https://hn.dunkirk.sh/
4•clacker-o-matic•39m ago•2 comments

AI can't solve novel problems yet

https://jamesoclaire.com/2025/06/04/ai-obviously-cant-yet-solve-novel-problems/
3•ddxv•47m ago•4 comments

Designing Algorithmic Delegates

https://arxiv.org/abs/2506.03102
2•MarcoDewey•52m ago•0 comments

Our Startup Was Hacked, Need GitHub's Assistance to Trace Attacker

https://techcrunch.com/2025/06/03/indian-grocery-startup-kiranapro-was-hacked-and-its-servers-deleted-ceo-confirms/
4•deepakravindran•58m ago•3 comments

Merlin Bird ID

https://merlin.allaboutbirds.org/
8•twitchard•59m ago•4 comments

Binary Wordle

https://wordle.chengeric.com/
2•eh8•1h ago•1 comments

The symbolism of the magnifying glass is not universal

https://devblogs.microsoft.com/oldnewthing/20250603-00/?p=111240
4•paulmooreparks•1h ago•1 comments

Google Scholar is Manipulatable (2024)

https://arxiv.org/abs/2402.04607
2•downboots•1h ago•0 comments

'Spiderweb' drone attack marks a new threat for top militaries

https://www.businessinsider.com/operation-spiderweb-5-ways-ukraine-drone-attack-new-era-warfare-2025-6
8•petethomas•1h ago•0 comments

Open Sesame! on the Security and Memorability of Verbal Passwords [pdf]

https://seclab.skku.edu/wp-content/uploads/2025/05/223600a683.pdf
1•grac3•1h ago•0 comments

Chinese couple charged with smuggling a biological pathogen into the U.S.

https://www.nbcnews.com/politics/justice-department/chinese-couple-charged-smuggling-biological-pathogen-us-rcna208658
7•shinryudbz•1h ago•3 comments

How NATO is turning to startups to outpace its rivals

https://thenextweb.com/news/how-nato-startups-fight-future-wars
1•mikece•1h ago•0 comments

DiffX – Next-Generation Extensible Diff Format

https://diffx.org/
35•todsacerdoti•1h ago•5 comments

Flesh-eating New World Screwworm could pose health risks to cattle, humans

https://www.foxnews.com/health/flesh-eating-new-world-screwworm-could-pose-health-risks-cattle-humans
1•keepamovin•1h ago•1 comments

Why is PS3 emulation so fast: RPCS3 optimizations explained [video]

https://www.youtube.com/watch?v=19ae5Mq2lJE
7•alexjplant•1h ago•0 comments

Musk calls Trump's tax bill a 'disgusting abomination'

https://www.bbc.com/news/articles/c0j76djzgpvo
8•andsoitis•1h ago•2 comments

Ask HN: Stripe and Chargebacks

2•gtech1•1h ago•2 comments

Meta and Yandex exfiltrating tracking data on Android via WebRTC

https://arstechnica.com/security/2025/06/meta-and-yandex-are-de-anonymizing-android-users-web-browsing-identifiers/
2•liuandrewk•1h ago•1 comments

Indeed CEO resigns after major growth to focus on AI ethics

https://www.techinasia.com/news/indeed-ceo-resigns-after-major-growth-to-focus-on-ai-ethics
2•williswee•1h ago•0 comments

Why Is the US Dropping Billions of Mutant Flies from the Sky? [video]

https://www.youtube.com/watch?v=zxq60I5RSW8
3•keepamovin•1h ago•1 comments