frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The next AI breakthrough: learning on the job

https://medium.com/@rviragh/the-next-ai-breakthrough-learning-on-the-job-fc20fba4d906
2•logicallee•3mo ago

Comments

logicallee•3mo ago
(The above is a medium link, the text is below in case you'd prefer to read it here.)

Who this is for: AI researchers and enthusiasts

I recently deployed a small application (Go server, in-memory database, streaming video, webrtc), while developing it with AI. It's not ready for users yet, so I can't link it yet unfortunately, but progress was solid. Amazingly, AI was able to build a dockerized test framework for it, running end-to-end tests using Chrome headless and mocking up video feeds. That's a huge task that would take weeks to do, if indeed I could ever get it done at all, and I was blown away at the fact that AI could complete it. The tests don't pass yet, so that's how I know the application I'm building definitely isn't ready for users yet.

One thing that struck me is that as I iterated with the AI, there were sometimes regressions. It forgot how it solved something it had already struggled with, and then solved. This tracks with people's experience of AI as an intern with a lot of knowledge of different technologies, little experience handling large codebases by itself, and who doesn't learn anything throughout its internship. What they mean is that the only knowledge the AI has is what is included in its context. It doesn't learn from its "experience" thinking through, writing and developing a codebase, unless it is asked to write the experiences down to read right before its next answer. It would be like being an amnesiac who remembers the contents of the entire Internet and every open source codebase, but doesn't remember anything about the project it's working on except any short note it wrote itself and the current codebase, which it has to read right before its next step. It's like being President by waking up every morning as an amnesiac who has to first reread the entire history of your country, since you don't know anything about it, you only just know about every other country in the world, but never learned your own. (Here "your own" country represents your codebase that you wrote yourself.) Except instead of having to do that every morning, you have to do that after every single step you take.

It would be absurd to expect AI's to reread all of their original training data between every prompt, yet this is what's done for the codebases they themselves write. They don't write them and learn them, they write them and forget them.

Some exciting developments that could be expected in the near future are:

* AI agents that remember or learn from their previous thinking (which they express in chains of thought), and definitely learn the codebase and system they're working on, without having to explicitly write it into their context. It can just become part of the model. Maybe this is why humans sleep each night to integrate their experiences? Do humans retrain their brains while they sleep each night?

* AI agents that ask questions, experiment, and learn and explore the systems they're building, just as humans do. Humans don't just think and then type out a complete application without any experimentation, it would be an absurd way to code. Yet, AI's are expected to do just that, having access only to what they've already written, and none of their "experiences" or conclusions from experiments they run to try to undestand what they're working on.

logicallee•3mo ago
When presented a piece of code to iterate on, the main difference between a human coder and an AI right now is that the human coder says:

"I know this. I just coded it yesterday, and remember how I did it, too. Here's how to add to it or make this specific change I want to add next."

and the AI says:

"Great question. I just read this codebase for the first time so just give me a minute and (thought for 1 minute) here's the answer"

"Great question. I just read this codebase for the first time so just give me a minute and (thought for 1 minute) here's the answer"

"Great question. I just read this codebase for the first time so just give me a minute and (thought for 1 minute) here's the answer"

"Great question. I just read this codebase for the first time so just give me a minute and (thought for 1 minute) here's the answer"

I look forward to when AI's learn on the job, and I think we're not far off from that period.

What exciting developments do you look forward to in the future?

email the author at: rviragh@gmail.com

The Genus Amanita

https://www.mushroomexpert.com/amanita.html
1•rolph•2m ago•0 comments

We have broken SHA-1 in practice

https://shattered.io/
1•mooreds•2m ago•1 comments

Ask HN: Was my first management job bad, or is this what management is like?

1•Buttons840•4m ago•0 comments

Ask HN: How to Reduce Time Spent Crimping?

1•pinkmuffinere•5m ago•0 comments

KV Cache Transform Coding for Compact Storage in LLM Inference

https://arxiv.org/abs/2511.01815
1•walterbell•10m ago•0 comments

A quantitative, multimodal wearable bioelectronic device for stress assessment

https://www.nature.com/articles/s41467-025-67747-9
1•PaulHoule•11m ago•0 comments

Why Big Tech Is Throwing Cash into India in Quest for AI Supremacy

https://www.wsj.com/world/india/why-big-tech-is-throwing-cash-into-india-in-quest-for-ai-supremac...
1•saikatsg•11m ago•0 comments

How to shoot yourself in the foot – 2026 edition

https://github.com/aweussom/HowToShootYourselfInTheFoot
1•aweussom•12m ago•0 comments

Eight More Months of Agents

https://crawshaw.io/blog/eight-more-months-of-agents
3•archb•14m ago•0 comments

From Human Thought to Machine Coordination

https://www.psychologytoday.com/us/blog/the-digital-self/202602/from-human-thought-to-machine-coo...
1•walterbell•14m ago•0 comments

The new X API pricing must be a joke

https://developer.x.com/
1•danver0•15m ago•0 comments

Show HN: RMA Dashboard fast SAST results for monorepos (SARIF and triage)

https://rma-dashboard.bukhari-kibuka7.workers.dev/
1•bumahkib7•15m ago•0 comments

Show HN: Source code graphRAG for Java/Kotlin development based on jQAssistant

https://github.com/2015xli/jqassistant-graph-rag
1•artigent•21m ago•0 comments

Python Only Has One Real Competitor

https://mccue.dev/pages/2-6-26-python-competitor
3•dragandj•22m ago•0 comments

Tmux to Zellij (and Back)

https://www.mauriciopoppe.com/notes/tmux-to-zellij/
1•maurizzzio•23m ago•1 comments

Ask HN: How are you using specialized agents to accelerate your work?

1•otterley•24m ago•0 comments

Passing user_id through 6 services? OTel Baggage fixes this

https://signoz.io/blog/otel-baggage/
1•pranay01•25m ago•0 comments

DavMail Pop/IMAP/SMTP/Caldav/Carddav/LDAP Exchange Gateway

https://davmail.sourceforge.net/
1•todsacerdoti•25m ago•0 comments

Visual data modelling in the browser (open source)

https://github.com/sqlmodel/sqlmodel
1•Sean766•28m ago•0 comments

Show HN: Tharos – CLI to find and autofix security bugs using local LLMs

https://github.com/chinonsochikelue/tharos
1•fluantix•28m ago•0 comments

Oddly Simple GUI Programs

https://simonsafar.com/2024/win32_lights/
1•MaximilianEmel•28m ago•0 comments

The New Playbook for Leaders [pdf]

https://www.ibli.com/IBLI%20OnePagers%20The%20Plays%20Summarized.pdf
1•mooreds•29m ago•1 comments

Interactive Unboxing of J Dilla's Donuts

https://donuts20.vercel.app
1•sngahane•30m ago•0 comments

OneCourt helps blind and low-vision fans to track Super Bowl live

https://www.dezeen.com/2026/02/06/onecourt-tactile-device-super-bowl-blind-low-vision-fans/
1•gaws•32m ago•0 comments

Rudolf Vrba

https://en.wikipedia.org/wiki/Rudolf_Vrba
1•mooreds•32m ago•0 comments

Autism Incidence in Girls and Boys May Be Nearly Equal, Study Suggests

https://www.medpagetoday.com/neurology/autism/119747
1•paulpauper•33m ago•0 comments

Wellness Hotels Discovery Application

https://aurio.place/
1•cherrylinedev•34m ago•1 comments

NASA delays moon rocket launch by a month after fuel leaks during test

https://www.theguardian.com/science/2026/feb/03/nasa-delays-moon-rocket-launch-month-fuel-leaks-a...
1•mooreds•35m ago•0 comments

Sebastian Galiani on the Marginal Revolution

https://marginalrevolution.com/marginalrevolution/2026/02/sebastian-galiani-on-the-marginal-revol...
2•paulpauper•38m ago•0 comments

Ask HN: Are we at the point where software can improve itself?

1•ManuelKiessling•38m ago•2 comments