frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

LLMs learn what programmers create, not how programmers work

21•noemit•6h ago
I ran an experiment to see if CLI actually was the most intuitive format for tool calling. (As claimed by a ex-Manus AI Backend Engineer) I gave my model random scenarios and a single tool "run" - i told it that it worked like a CLI. I told it to guess commands.

it guessed great commands, but it formatted it always with a colon up front, like :help :browser :search :curl

It was trained on how terminals look, not what you actually type (you don't type the ":")

I have since updated my code in my agent tool to stop fighting against this intuition.

LLMs they learn what commands look like in documentation/artifacts, not what the human actually typed on the keyboard.

Seems so obvious. This is why you have to test your LLM and see how it naturally works, so you don't have to fight it with your system prompt.

This is Kimi K2.5 Btw.

Comments

shomp•2h ago
Great observation. The brain of a programmer is still a "black box" to the feed-forward network of nodes . But in theory, if you pumped a lot of the live-coding videos from something like youtube into the process, you could get a bit of that "what's your approach"-erism to bleed into the model. There might not be enough material there to truly "train it to think" but it would be interesting to try and "fill the gaps" of black-box-ed-ness in the LLM with supplemental "here was the process that got us there" video feeds. The next natural move might actually be recording thousands of hours of footage of developers working with the LLMs directly like in Cursor or another IDE that has LLM live-pair-programming , maybe calling it "pair programming" is generous , but it might be a reasonable foray into teaching the next generation of LLMs the "thought process" behind things. In reality you'd be teaching it which files to inspect, which windows to open/close, which tools to switch to and focus on. And while it might be imperfect, it might just be enough.

LLMs learn what programmers create, not how programmers work

21•noemit•6h ago•1 comments

Ask HN: Is using AI tooling for a PhD literature review dishonest?

7•latand6•5h ago•10 comments

Ask HN: Is anyone here also developing "perpetual AI psychosis" like Karpathy?

22•jawerty•7h ago•16 comments

Ask HN: AI productivity gains – do you fire devs or build better products?

102•Bleiglanz•1d ago•194 comments

Veevo Health – book a CT angiogram to see plaque buildup in your arteries

4•arvindsr33•6h ago•2 comments

Ask HN: If there has been no prompt injection, is it safe?

4•sayYayToLife•12h ago•5 comments

Ask HN: How many of you are profiting with LLM wrapper apps?

12•general_reveal•11h ago•1 comments

Ask HN: Are you using OpenClaw or similar agents? How?

4•nclin_•17h ago•6 comments

Tell HN: MS365 upgrade silently to 25 licenses, tried to charge me $1,035

22•davidstarkjava•1d ago•8 comments

Tell HN: H&R Block tax software installs a TLS backdoor

144•yifanlu•3d ago•12 comments

DietPi released a new version v10.2

2•StephanStS•10h ago•0 comments

Ask HN: Growth for me,is realizing how much I didn't know 6 months ago. Yours?

5•kathir05•17h ago•2 comments

What would you do if you have AI software that may be transformers alternative?

2•adinhitlore•1d ago•4 comments

Ask HN: How much are you spending on AI coding at work?

6•habosa•1d ago•7 comments

Spotify playing ads for paid subscribers

149•IncandescentGas•5d ago•127 comments

Anyone know how long it will take to re-start Qatar's helium plants?

9•megamike•1d ago•5 comments

Ask HN: How to get free/cheap Claude and AWS credits

4•jacAtSea•1d ago•6 comments

Ask HN: How do you handle peer-to-peer discovery on iOS without a server?

6•redgridtactical•1d ago•5 comments

SparkVSR: Video Super-Resolution You Can Control with Keyframes

2•steveharing1•1d ago•0 comments

Ask HN: what’s your favorite line in your Claude/agents.md files?

15•khasan222•2d ago•11 comments

Ask HN: What do you look for in your first 10 hires?

28•neilk17•4d ago•34 comments

Structural Friction: A metric for human coordination cost

6•davidvartanian•3d ago•0 comments

Ask HN: Is vibe coding a new mandatory job requirement?

38•newswangerd•6d ago•77 comments

Ask HN: How do you deal with people who trust LLMs?

153•basilikum•5d ago•202 comments

Ask HN: Why isn't the NSA categorized as an APT?

5•TheOpenSourcer•2d ago•9 comments

You've reached the end!