news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Expected Attention: KV Cache Compression by Estimating Attention

https://arxiv.org/abs/2510.00636

14•sonabinu•2h ago

Comments

tripplyons•52m ago

Great work! I wonder if there is a way to combine similar cache items instead of dropping unlikely ones. Could the proposed attention estimation be used for that?

yorwba•2m ago

Yes, for example https://arxiv.org/pdf/2506.05410 merges two neighboring tokens with the lowest sum of past attention scores, and this method would enable using future expected attention instead.

yalok•35m ago

The paper only mentions evals for Ruler 4K and 16K - I wish they’d go further and measure for longer context windows. I was wondering if there would be some gain as compared to baseline (no compression) for this method - their results for Qwen with Ruler 16K seem to allude to that - at small compression ratios the evals look better than baseline - which means they are not just improving inference speed/memory, but addressing attenuation dilution problem…

Show HN: HackCLI – A Slack terminal client for Hack Clubbers

https://github.com/Jan-Kur/HackCLI

1•Jan_Kur•1m ago•0 comments

Bringing the museum into your living room

https://www.ynetnews.com/health_science/article/bjnw6l8bxx

1•mooreds•2m ago•0 comments

Es-toolkit: JavaScript utility library

https://es-toolkit.dev/

1•mooreds•4m ago•0 comments

OpenAI Is Good at Deals

https://www.bloomberg.com/opinion/newsletters/2025-10-06/openai-is-good-at-deals

1•feross•6m ago•0 comments

Write in C – Let it Be

https://wiki.tcl-lang.org/page/Write+in+C

2•axiomdata316•6m ago•0 comments

No time to learn React (2024)

https://www.keithcirkel.co.uk/i-dont-have-time-to-learn-react/

1•mooreds•7m ago•0 comments

Show HN: Tangled – Git collaboration built on AT Protocol

https://tangled.org

2•icy•10m ago•0 comments

From Claude Code to PageIndex: The Rise of Agentic Retrieval

https://vectifyai.notion.site/agentic-retrieval

7•mingtianzhang•11m ago•0 comments

Show HN: Dodocu – AI that reads and summarizes any contract or document

https://dodocu.xyz

1•valart•13m ago•0 comments

Chatcontrol: German Interior Ministry tries to force approval with a trick

https://pirati.io/2025/10/chatcontrol-il-ministero-dellinterno-tedesco-cerca-di-forzare-lapprovaz...

1•nickslaughter02•14m ago•1 comments

Enceladus's Alien Ocean, Ancient Fungi and the Flavor of Influenza

https://www.scientificamerican.com/podcast/episode/saturns-moon-enceladus-may-harbor-life-study-f...

1•vortex_guardian•15m ago•0 comments

OpenAI ChatKit

https://github.com/openai/chatkit-js

2•arbayi•15m ago•0 comments

The word "dumpster" was originally a trademarked brand name, not a generic term

https://web.archive.org/web/20110220041407/http://www.classicrefusetrucks.com/albums/DE/DE01a.html

3•pcaharrier•15m ago•1 comments

Factors for engineering production-ready AI agents [video]

https://www.youtube.com/watch?v=BsWxPI9UM4c

2•gangtao•17m ago•0 comments

Locality, and Temporal-Spatial Hypothesis

https://brooker.co.za/blog/2025/10/05/locality.html

1•jandrewrogers•18m ago•0 comments

The Value of Historic District Status

https://www.sciencedirect.com/science/article/abs/pii/S0166046225000742

1•paulpauper•18m ago•0 comments

Hamlet Is the Gen Z Story We Need

https://www.honest-broker.com/p/hamlet-is-the-gen-z-story-we-need

1•paulpauper•19m ago•0 comments

Everyone getting so rich with AI it seems

1•paulpauper•20m ago•0 comments

ACP Brings JetBrains on Board

https://zed.dev/blog/jetbrains-on-acp

2•TiredOfLife•24m ago•0 comments

JetBrains × Zed: Open Interoperability for AI Coding Agents in Your IDE

https://blog.jetbrains.com/ai/2025/10/jetbrains-zed-open-interoperability-for-ai-coding-agents-in...

5•TiredOfLife•26m ago•0 comments

Rabbit plots its redemption arc

https://www.engadget.com/rabbit-plots-its-redemption-arc-120000271.html

2•fcpguru•27m ago•0 comments

Show HN: A virtual tour for a 50 year old park

https://embarcaderoplaza.com/tour

1•osmotico•27m ago•0 comments

IDX, EFT, ENTs Parsers (SEC/Edgar/Equity Research)

https://github.com/palmy-investing/palmy-equity-research

1•freeJeffery•28m ago•0 comments

Deloitte will refund Australian government for AI hallucination-filled report

https://arstechnica.com/ai/2025/10/deloitte-will-refund-australian-government-for-ai-hallucinatio...

3•ndsipa_pomu•28m ago•2 comments

Birth of a Bee [video]

https://www.youtube.com/watch?v=zVZTX35KcL4

3•downboots•29m ago•0 comments

Show HN: Camfer – An AI CAD Copilot

https://camfer.dev/download/

1•arya_bastani23•30m ago•0 comments

Keep Working on Wrong Ideas

https://iaziz786.com/blog/keep-working-on-wrong-ideas/

2•iaziz786•31m ago•0 comments

XHarbour – open-source Extended Clipper Language

https://www.xharbour.org/

2•gjvc•31m ago•0 comments

Interactive Best Research-Cell Efficiency Chart

https://www.nrel.gov/pv/interactive-cell-efficiency

1•airstrike•31m ago•0 comments

OpenAI IP promises ring hollow to Sora losers

https://www.theregister.com/2025/10/06/openai_makes_empty_promises_to/

3•rntn•31m ago•0 comments