frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Context Rot: How increasing input tokens impacts LLM performance

https://research.trychroma.com/context-rot
27•kellyhongsn•3h ago
I work on research at Chroma, and I just published our latest technical report on context rot.

TLDR: Model performance is non-uniform across context lengths, including state-of-the-art GPT-4.1, Claude 4, Gemini 2.5, and Qwen3 models.

This highlights the need for context engineering. Whether relevant information is present in a model’s context is not all that matters; what matters more is how that information is presented.

Here is the complete open-source codebase to replicate our results: https://github.com/chroma-core/context-rot

Comments

tjkrusinski•1h ago
Interesting report. Are there recommended sizes for different models? How do I know what works or doesn't for my use case?
posnet•15m ago
I've definitely noticed this anecdotally.

Especially with Gemini Pro when providing long form textual references, providing many documents in a single context windows gives worse answers than having it summarize documents first, ask a question about the summary only, then provide the full text of the sub-documents on request (rag style or just simple agent loop).

Similarly I've personally noticed that Claude Code with Opus or Sonnet gets worse the more compactions happen, it's unclear to me whether it's just the summary gets worse, or if its the context window having a higher percentage of less relevant data, but even clearing the context and asking it to re-read the relevant files (even if they were mentioned and summarized in the compaction) gives better results.

Show HN: Generate any workflow with natural language

https://www.osly.ai/
1•hez2000•31s ago•0 comments

Show HN: ZeroFS: The S3FS that does not suck

https://github.com/Barre/zerofs
1•riccomini•1m ago•0 comments

AskIt MCP – Apache 2.0

https://github.com/johnrobinsn/askit
1•johnrobinsn•5m ago•1 comments

Show HN: A simple iOS-native map measurement app

https://apps.apple.com/ch/app/geometer-map/id6748311921
1•nidegen•6m ago•0 comments

Tresorit – secure file exchange and collaboration made easy

https://tresorit.com/
1•janandonly•6m ago•0 comments

DEWLine Museum – The Distant Early Warning Radar Line

https://dewlinemuseum.com/
2•reaperducer•6m ago•0 comments

Launchk: Rust/Cursive TUI for looking at macOS launchd agents and daemons

https://github.com/mach-kernel/launchk
1•sea-gold•6m ago•0 comments

Ask HN: Have you noticed AI critic content being disparaged on HN?

1•ciwolex•8m ago•1 comments

Researchers Develop New Tool to Measure Biological Age

https://www.seattletimes.com/life/researchers-develop-new-tool-to-measure-biological-age/
1•m463•8m ago•0 comments

I've created a tool that is saving me many hours watching YouTube

https://clarifytube.com/article/this-might-be-bigger-than-deepseek?id=xLFkqYOUN24&l=en
1•lfgtavora•9m ago•1 comments

1.1.1.1 Is Down

https://one.one.one.one/#a
3•kdrag0n•10m ago•2 comments

Scientists detect light passing through human head for brain imaging

https://spie.org/news/scientists-detect-light-passing-through-entire-human-head-opening-new-doors-for-brain-imaging
1•Gaishan•11m ago•0 comments

Code highlighting with Cursor AI for $500k

https://securelist.com/open-source-package-for-cursor-ai-turned-into-a-crypto-heist/116908/
2•ivanjermakov•12m ago•0 comments

Microsoft Surface parody (2007) [video]

https://www.youtube.com/watch?v=CZrr7AZ9nCY
2•cjcenizal•13m ago•0 comments

Texas AG requests Robert Roberson be executed Oct. 16

https://www.texastribune.org/2025/06/17/texas-robert-roberson-execution-date-ken-paxton/
1•rossant•14m ago•0 comments

DHH: Future of Programming, AI, Ruby on Rails, Productivity and Parenting

https://lexfridman.com/dhh-david-heinemeier-hansson-transcript/
1•nstj•15m ago•0 comments

Journalist says 4k fake AI news websites created to game Google algorithms

https://pressgazette.co.uk/news/french-journalist-who-uncovered-4000-fake-ai-news-websites-warns-uk-could-be-next/
1•giuliomagnifico•17m ago•0 comments

Cloudflare DNS Is Down

https://news.ycombinator.com/submit
6•jpillora•17m ago•0 comments

1.1.1.1 Is Down

https://www.cloudflarestatus.com/incidents/28r0vbbxsh8f
14•outworlder•18m ago•0 comments

Shopify MCP Can Be Abused to Manipulate Customer Purchases

https://www.tramlines.io/blog/shopify-sellers-can-abuse-shopify-mcp-to-manipulate-customer-purchase-decisions
1•coderinsan•18m ago•0 comments

Learning to Learn, in the Age of LLMs

https://www.carette.xyz/posts/learning_to_learn/
3•weird_trousers•19m ago•0 comments

Integer Division in Bucketed Time Series

https://uvdn7.github.io/bucketed-time-series/
1•uvdn7•20m ago•0 comments

Why Don't You Fucking Retire Already?

https://medium.com/@docjamesw/why-dont-you-fucking-retire-already-3c47a039897c
2•alihm•20m ago•0 comments

AI labs are coming for Wall Street's quants

https://www.businessinsider.com/ai-talent-openai-wall-street-quant-trading-firms-2025-7
1•Bluestein•22m ago•0 comments

Grad school is worse for public health than STDs (2019)

https://www.benkuhn.net/grad/
1•venkii•23m ago•0 comments

Cloudflare DNS Down in UK/EU

6•suddengunter•24m ago•0 comments

Tiny Great Languages: Mouse

https://zserge.com/posts/langs-mouse/
1•PaulHoule•25m ago•0 comments

Show HN: Forge – Connect multiple AI models through a single API

https://tensorblock.co/forge
3•tensorblock•25m ago•0 comments

Altered State of Consciousness Feels Like an Escape from Reality

https://www.popularmechanics.com/science/health/a65227001/lucid-dreaming-altered-state-of-consciousness/
1•Bluestein•26m ago•0 comments

Think tank: Solar energy tops the EU electricity mix in June

https://www.heise.de/en/news/Think-tank-Solar-energy-tops-the-EU-electricity-mix-for-the-first-time-in-June-10485587.html
1•doener•27m ago•0 comments