news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Déjà Code: How LLMs Cheat on Repos They've Seen

https://blogs.latentforce.ai/blogs/deja-code.html

1•maxaravind•1h ago

Comments

maxaravind•1h ago

One of the authors here.

We ran this experiment a few weeks ago, but Anthropic’s Mythos report dropped this week and thought this would be relevant to share now.

Surprisingly, we found that for repos already in the training data(pre cut off set), the contamination is at such a high level that even by just giving the model the file name and not file contents, the model is able to tell what is inside that file. Same for file paths. Just given a file name, the model is able to correctly guess the file path - this implies that models already know the structure of these repos and thus understand what to look for and where.

This ability drops sharply for unseen repos(post cutoff set) - raises the question how effectively it will hold for private repos with proprietary scaffoldings and programming patterns. Then the question worth asking is how much of Mythos's capability on well known codebases like Firefox and OpenBSD is genuine reasoning vs parametric familiarity with their structure?

Methodology caveat: modest sample (9-10 repos per group), treat numbers as directional - more experiments in progress....

Neural Computer: A New Machine Form Is Emerging

https://metauto.ai/neuralcomputer/

1•plainOldText•1m ago•0 comments

Show HN: Unlegacy – document everything, from COBOL to AI generated code

https://www.unlegacy.ai/

1•Absonsonson•1m ago•0 comments

With Claude Managed Agents, Anthropic wants to run your AI agents for you

https://thenewstack.io/with-claude-managed-agents-anthropic-wants-to-run-your-ai-agents-for-you/

1•mooreds•2m ago•0 comments

A compelling title that is cryptic enough to get you to take action on it

https://ericwbailey.website/published/a-compelling-title-that-is-cryptic-enough-to-get-you-to-tak...

1•mooreds•3m ago•0 comments

Optimize Your Developer Environment

https://stanislav.blog/optimize-your-developer-environment/

1•spanferov•4m ago•0 comments

SomeWM: AwesomeWM Replacement for Wayland

https://somewm.org/

2•NoboruWataya•5m ago•0 comments

Software: Application vs SP500

https://finance.yahoo.com/sectors/technology/software-application/

2•jackdoe•7m ago•0 comments

Locally Uncensored – Local AI desktop app(chat,codeagent,image/videogen,nocloud)

https://github.com/PurpleDoubleD/locally-uncensored

2•PurpleDoubleD•7m ago•0 comments

Redesigning Agent Skills – two missing parts

https://simianwords.bearblog.dev/what-agent-skills-misses-now/

1•simianwords•7m ago•0 comments

Show HN: Tinycloud – Claude Code for video work

https://tinycloud.cloudglue.dev/

4•Gabriel439•9m ago•1 comments

Tar: A slop-free alternative to rsync

https://drewdevault.com/2026/03/28/2026-03-28-rsync-without-rsync.html

1•als0•9m ago•0 comments

The Yak Is Back

https://b10g.xyz/blog/2026/the-yak-is-back/

2•zdw•9m ago•1 comments

Context Engineering – LLM Memory and Retrieval for AI Agents

https://weaviate.io/blog/context-engineering

3•eigenBasis•10m ago•0 comments

Zsh-halfpipe: Edit shell pipeline and see its output update live

https://github.com/raimo/zsh-halfpipe

2•raimo•10m ago•1 comments

State of Docs 2026

https://www.stateofdocs.com/2026/introduction-and-demographics

1•mooreds•11m ago•0 comments

Contemplating Meta's Homegrown MTIA Compute Engine Roadmap

https://www.nextplatform.com/compute/2026/04/08/contemplating-metas-homegrown-mtia-compute-engine...

1•rbanffy•11m ago•0 comments

The difficulty of making sure your website is broken

https://letsencrypt.org/2026/04/10/test-sites.html

2•mcpherrinm•11m ago•0 comments

Wozcode: A Claude Code plugin that supercharges performance, cost, and speed

https://www.wozcode.com/

2•spking•12m ago•0 comments

TeamPCP Supply Chain Campaign: Update 007

https://isc.sans.edu/diary/32880

2•jruohonen•13m ago•0 comments

Japan to ban gene-edited embryos aimed at creating "designer babies"

https://english.kyodonews.net/articles/-/73964

1•randycupertino•14m ago•0 comments

Agents can't check their own work

https://frontierai.substack.com/p/agents-cant-check-their-own-work

2•yedava•15m ago•0 comments

Artemis II Flight Day 10: Crew Sets for Final Burn, Splashdown

https://www.nasa.gov/blogs/missions/2026/04/10/artemis-ii-flight-day-10-crew-sets-for-final-burn-...

1•js2•15m ago•1 comments

LLMs 'banchmark' where they write code controlling units in a 1v1 RTS

https://yare.io/ai-arena

2•levmiseri•16m ago•0 comments

May be the first year of a million CVEs

https://liam-on-linux.dreamwidth.org/98430.html

1•speckx•16m ago•0 comments

The Hypercurious Mind

https://aeon.co/essays/how-the-hypercuriosity-of-adhd-may-have-helped-humans-thrive

1•supermdguy•17m ago•0 comments

Indian Government seeks X Community Notes oversight with IT Rules tweaks

https://www.hindustantimes.com/india-news/government-seeks-x-community-notes-oversight-with-it-ru...

1•LordAtlas•19m ago•0 comments

Cypress vs. Playwright: Architecture Deep Dive

https://yatsushi.com/blog/cypress-vs-playwright/

1•jumbosushi•19m ago•0 comments

A New Wakamai Fondue

https://pixelambacht.nl/2026/a-new-wakamai-fondue/

1•robin_reala•20m ago•0 comments

A discrete structural grammar for financial markets – Kaggle competition

https://www.kaggle.com/competitions/ska-crypto-trading-bot-with-binance

1•quantiota•21m ago•0 comments

Two hundred chimpanzees are embroiled in a 'civil war'

https://www.scientificamerican.com/article/two-hundred-chimpanzees-are-embroiled-in-a-civil-war/

2•gscott•22m ago•0 comments