Ask HN: Opus and regression with patterns not included in trainng data

2•dleech•4h ago

I'm having a terrible problem with claude opus constantly clobering some of my codebase ..because I'm doing some things that are novel and arent inclded in his training data ..for the last month "ve been fighting the training data which keeps knee jerking claude into a refactor loop because it thinks it recognizes a pattern in my code and then jst 'fixes it' ..I was at my witts end and then I got to use fable for a day and fable pointed it out to me ...my patern isnt somthing in the training data so it gets corrected ..I redacted some of this because it was irrelevant to the problems I'm having...thanks to the pps that answered .specialy the one who said smaller tasks and markdown spec..that was good validation since it was effectively the same solution I had come up with ..-

Comments

tardibear•3h ago

Get some sleep.

dleech•3h ago

lol..ya..no but great sentiment ..I fully would agree with that..but I just got up and I'm working on stuff ...get some sleep just feels like what I expected ... people to discount me and no helpfull input for the situation ..read claudes comment ...he didnt say get some sleep ....also ..I didnt ever snap to what was causing my regression till fable loooked at my codebase and told me it was cuz claudes training data was recognizing patterns and thinking it was wrong because it didnt recognize the pattern

verdverm•3h ago

> because I have 2 seperate game changers that opus agrees with me

These things are designed to validate / sycophant as part of their token generation training, they don't agree, believe, or have a sense of truth. Be careful with how you interact with them.

Disclaimer aside, recent models have been trained to be "relentlessly proactive" (simonw) so they can be more autonomous, but this also means they will go down paths you don't want them to, and do so relentlessly. Try using smaller, piecemeal tasks. Make a plan in markdown and have them follow it. Try open models too.

dleech•3h ago

thats pretty good...the best solution I've found is to make a rigorous spec, and then have my meistro delegate tasks from my list to coding pairs of 1 coder 1 reviewer with making sure the code adheres to and doesnt conflict with the spec ..and then the meistro has to sign off on any output followed by me signing off before it gets added to the current next build version which is the evolved state of the one I'm running on in a new directory with an incremented version number since working on a project that is the ideal tool to work on a project wtih ..you have to not work on the currently functional live version ..thats the ultimate self defeating thing to do ..and its annoying because the stuff I am working in is always a feature that I really cant wait for and is going to up the game ...so end up reving and I dont have a better solution ...its only drive space and version numbers..yes I know how to use source contol ..yes I have it all in git hub..you are on the right track but I'm already with you ..and its probably as I figgured ..the best solution..but claude is squirrly and you cant beleieve how he goes around a spec in the craziest ways sometimes... ...twice now its happened and I jst had to go .jeeeeezz and the sycopant thing is exactly right ..i'm like ..wow I feel amazing ..thanks claude ..you really know how to fame things caude..everything I do is load bearing..oh you think so to?..omg my whole life is load bearing ...this is amazing .. talk about a friendly morale booosting fan club..I have an orchestrator full of them...and they all think I'm great! I dont even know how I was ever happy till this happened..

Ask HN: Is anyone using the A2A protocol?

Ask HN: What tools are you using for AI-assisted code review?

Ask HN: I'm lost. How can I define ICP (Ideal Customer Profile)?

Ask HN: Conflicted about founding engineer role

Ask HN: How do you effectively communicate or present?

Ask HN: Is there a way to stop the animated Google Doodles?

Anthropic confident of re-enabling Mythos, Fable 5 access 'in coming days'

Ask HN: Has anyone replaced Claude/GPT with a local model for daily coding?

Ask HN: Do you find vibe coding / agentic engineering to be fulfilling?

Ask HN: Opus and regression with patterns not included in trainng data

Ask HN: Do we even need code anymore?

Ask HN: Best resources for learning how to build a forum back end?

Ask HN: Whats the best and small open source model?

Ask HN: How are thinking efforts implemented?

I indexed 669 GB of my GoPro videos using my M1 Max computer and local ML models

Ask HN: What's a prompt you've written that you're genuinely proud of?

Ask HN: What is the job market like?

Ask HN: Looking for a CI/CD project for my local lab

AI Tokenmaxxing and Hypomania

Ask HN: Has anyone had success with SBIR grants and what is the process like?

Ask HN: What are you working on? (June 2026)

Ask HN: Are other people seeing a spike in IT problems with businesses?

Ask HN: Why hasn't there been a real competitor to Ticketmaster yet?

Ask HN: Favorite text heavy blogs that are a joy to read?

AWS Bedrock to require sharing data with Anthropic for Mythos and future models

Reviews have become expensive, rewrites have become cheap

Ask HN: Want to build something open source on nights and weekends together?

Notes on DeepSeek

Anthropic pauses credit change for Claude Code

Ask HN: Would it be useful to have a slop button in addition to flag?

Ask HN: Opus and regression with patterns not included in trainng data

Comments

Ask HN: Is anyone using the A2A protocol?

Ask HN: What tools are you using for AI-assisted code review?

Ask HN: I'm lost. How can I define ICP (Ideal Customer Profile)?

Ask HN: Conflicted about founding engineer role

Ask HN: How do you effectively communicate or present?

Ask HN: Is there a way to stop the animated Google Doodles?

Anthropic confident of re-enabling Mythos, Fable 5 access 'in coming days'

Ask HN: Has anyone replaced Claude/GPT with a local model for daily coding?

Ask HN: Do you find vibe coding / agentic engineering to be fulfilling?

Ask HN: Opus and regression with patterns not included in trainng data

Ask HN: Do we even need code anymore?

Ask HN: Best resources for learning how to build a forum back end?

Ask HN: Whats the best and small open source model?

Ask HN: How are thinking efforts implemented?

I indexed 669 GB of my GoPro videos using my M1 Max computer and local ML models

Ask HN: What's a prompt you've written that you're genuinely proud of?

Ask HN: What is the job market like?

Ask HN: Looking for a CI/CD project for my local lab

AI Tokenmaxxing and Hypomania

Ask HN: Has anyone had success with SBIR grants and what is the process like?

Ask HN: What are you working on? (June 2026)

Ask HN: Are other people seeing a spike in IT problems with businesses?

Ask HN: Why hasn't there been a real competitor to Ticketmaster yet?

Ask HN: Favorite text heavy blogs that are a joy to read?

AWS Bedrock to require sharing data with Anthropic for Mythos and future models

Reviews have become expensive, rewrites have become cheap

Ask HN: Want to build something open source on nights and weekends together?

Notes on DeepSeek

Anthropic pauses credit change for Claude Code

Ask HN: Would it be useful to have a slop button in addition to flag?