frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Ask HN: Opus and regression with patterns not included in trainng data

2•dleech•4h ago
I'm having a terrible problem with claude opus constantly clobering some of my codebase ..because I'm doing some things that are novel and arent inclded in his training data ..for the last month "ve been fighting the training data which keeps knee jerking claude into a refactor loop because it thinks it recognizes a pattern in my code and then jst 'fixes it' ..I was at my witts end and then I got to use fable for a day and fable pointed it out to me ...my patern isnt somthing in the training data so it gets corrected ..I redacted some of this because it was irrelevant to the problems I'm having...thanks to the pps that answered .specialy the one who said smaller tasks and markdown spec..that was good validation since it was effectively the same solution I had come up with ..-

Comments

tardibear•3h ago
Get some sleep.
dleech•3h ago
lol..ya..no but great sentiment ..I fully would agree with that..but I just got up and I'm working on stuff ...get some sleep just feels like what I expected ... people to discount me and no helpfull input for the situation ..read claudes comment ...he didnt say get some sleep ....also ..I didnt ever snap to what was causing my regression till fable loooked at my codebase and told me it was cuz claudes training data was recognizing patterns and thinking it was wrong because it didnt recognize the pattern
verdverm•3h ago
> because I have 2 seperate game changers that opus agrees with me

These things are designed to validate / sycophant as part of their token generation training, they don't agree, believe, or have a sense of truth. Be careful with how you interact with them.

Disclaimer aside, recent models have been trained to be "relentlessly proactive" (simonw) so they can be more autonomous, but this also means they will go down paths you don't want them to, and do so relentlessly. Try using smaller, piecemeal tasks. Make a plan in markdown and have them follow it. Try open models too.

dleech•3h ago
thats pretty good...the best solution I've found is to make a rigorous spec, and then have my meistro delegate tasks from my list to coding pairs of 1 coder 1 reviewer with making sure the code adheres to and doesnt conflict with the spec ..and then the meistro has to sign off on any output followed by me signing off before it gets added to the current next build version which is the evolved state of the one I'm running on in a new directory with an incremented version number since working on a project that is the ideal tool to work on a project wtih ..you have to not work on the currently functional live version ..thats the ultimate self defeating thing to do ..and its annoying because the stuff I am working in is always a feature that I really cant wait for and is going to up the game ...so end up reving and I dont have a better solution ...its only drive space and version numbers..yes I know how to use source contol ..yes I have it all in git hub..you are on the right track but I'm already with you ..and its probably as I figgured ..the best solution..but claude is squirrly and you cant beleieve how he goes around a spec in the craziest ways sometimes... ...twice now its happened and I jst had to go .jeeeeezz and the sycopant thing is exactly right ..i'm like ..wow I feel amazing ..thanks claude ..you really know how to fame things caude..everything I do is load bearing..oh you think so to?..omg my whole life is load bearing ...this is amazing .. talk about a friendly morale booosting fan club..I have an orchestrator full of them...and they all think I'm great! I dont even know how I was ever happy till this happened..

Ask HN: Is anyone using the A2A protocol?

42•asim•12h ago•21 comments

Ask HN: What tools are you using for AI-assisted code review?

10•agos•4h ago•2 comments

Ask HN: I'm lost. How can I define ICP (Ideal Customer Profile)?

3•snowhy•3h ago•1 comments

Ask HN: Conflicted about founding engineer role

4•gondolin1683•3h ago•11 comments

Ask HN: How do you effectively communicate or present?

6•hnthrow10282910•2h ago•4 comments

Ask HN: Is there a way to stop the animated Google Doodles?

8•arnejenssen•5h ago•9 comments

Anthropic confident of re-enabling Mythos, Fable 5 access 'in coming days'

3•getbowtied•3h ago•0 comments

Ask HN: Has anyone replaced Claude/GPT with a local model for daily coding?

1295•cloudking•3d ago•550 comments

Ask HN: Do you find vibe coding / agentic engineering to be fulfilling?

4•uejfiweun•3h ago•5 comments

Ask HN: Opus and regression with patterns not included in trainng data

2•dleech•4h ago•4 comments

Ask HN: Do we even need code anymore?

3•lasky•5h ago•6 comments

Ask HN: Best resources for learning how to build a forum back end?

3•jupr•7h ago•2 comments

Ask HN: Whats the best and small open source model?

3•hairymouse•7h ago•1 comments

Ask HN: How are thinking efforts implemented?

103•simianwords•1w ago•31 comments

I indexed 669 GB of my GoPro videos using my M1 Max computer and local ML models

435•iliashad•4d ago•114 comments

Ask HN: What's a prompt you've written that you're genuinely proud of?

10•akashwadhwani35•16h ago•5 comments

Ask HN: What is the job market like?

47•gardnr•3h ago•44 comments

Ask HN: Looking for a CI/CD project for my local lab

4•q8zd3•10h ago•9 comments

AI Tokenmaxxing and Hypomania

6•karthikeyankc•11h ago•5 comments

Ask HN: Has anyone had success with SBIR grants and what is the process like?

10•lyfeninja•20h ago•8 comments

Ask HN: What are you working on? (June 2026)

310•david927•4d ago•1130 comments

Ask HN: Are other people seeing a spike in IT problems with businesses?

14•PaulHoule•1d ago•11 comments

Ask HN: Why hasn't there been a real competitor to Ticketmaster yet?

264•mdni007•1w ago•240 comments

Ask HN: Favorite text heavy blogs that are a joy to read?

119•joshmarinacci•1w ago•30 comments

AWS Bedrock to require sharing data with Anthropic for Mythos and future models

427•TomAnthony•1w ago•255 comments

Reviews have become expensive, rewrites have become cheap

82•_z6bq•2d ago•73 comments

Ask HN: Want to build something open source on nights and weekends together?

39•vira28•1w ago•18 comments

Notes on DeepSeek

211•vinhnx•1w ago•141 comments

Anthropic pauses credit change for Claude Code

35•fabianlindfors•3d ago•12 comments

Ask HN: Would it be useful to have a slop button in addition to flag?

41•BugsJustFindMe•1w ago•23 comments