frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: First autonomous ML and AI engineering Agent

https://marketplace.visualstudio.com/items?itemName=NeoResearchInc.heyneo
2•svij137•1h ago
Founder here. I built NEO, an AI agent designed specifically for AI and ML engineering workflows, after repeatedly hitting the same wall with existing tools: they work for short, linear tasks, but fall apart once workflows become long-running, stateful, and feedback-driven. In real ML work, you don’t just generate code and move on. You explore data, train models, evaluate results, adjust assumptions, rerun experiments, compare metrics, generate artifacts, and iterate; often over hours or days. Most modern coding agents already go beyond single prompts. They can plan steps, write files, run commands, and react to errors.

Where things still break down is when ML workflows become long-running and feedback-heavy. Training jobs, evaluations, retries, metric comparisons, and partial failures are still treated as ephemeral side effects rather than durable state. Once a workflow spans hours, multiple experiments, or iterative evaluation, you either babysit the agent or restart large parts of the process. Feedback exists, but it is not something the system can reliably resume from.

NEO tries to model ML work the way it actually happens. It is an AI agent that executes end-to-end ML workflows, not just code generation. Work is broken into explicit execution steps with state, checkpoints, and intermediate results. Feedback from metrics, evaluations, or failures feeds directly into the next step instead of forcing a full restart. You can pause a run, inspect what happened, tweak assumptions, and resume from where it left off.

Here's an example as well for your reference: You might ask NEO to explore a dataset, train a few baseline models, compare their performance, and generate plots and a short report. NEO will load the data, run EDA, train models, evaluate them, notice if something underperforms or fails, adjust, and continue. If training takes an hour and one model crashes at 45 minutes, you do not start over. Neo inspects the failure, fixes it, and resumes.

Docs for the extension: https://docs.heyneo.so/#/vscode

Happy to answer questions about Neo.

Comments

mring33621•1h ago
I'd love to try this, but i worry about embedded malware or other nastiness in random downloads.

The Darnella test of social media and smartphone regulation

https://heatherburns.tech/2026/01/16/the-darnella-test-of-social-media-and-smartphone-regulation/
1•hn_acker•41s ago•0 comments

Particle IoT acquired by Digi International for $50M

https://www.particle.io/blog/particle-is-being-acquired-by-digi-to-power-the-next-40-years-of-iot...
1•flycatcha•5m ago•0 comments

Thief of $90M in seized U.S.-controlled crypto is gov't contractor's son

https://www.web3isgoinggreat.com/single/lick-theft
2•pavel_lishin•5m ago•0 comments

Ukraine, Sudan, Syria, Yemen, America

https://heatherburns.tech/2026/01/26/ukraine-sudan-syria-yemen-america/
1•hn_acker•5m ago•0 comments

Technology Artisan

https://life-prog.com/tech/technology-artisan/
1•mrngilles•6m ago•0 comments

Trump Says He's Not Concerned with Decline of US Dollar

https://finance.yahoo.com/news/trump-says-not-concerned-decline-205831235.html
3•thomassmith65•7m ago•0 comments

New speech to text tool is better than willow and wispr flow?

https://www.breezevoice.com
1•NalinAtmakur•7m ago•0 comments

Photographer Transformed a Panasonic Lumix G9 II into a Leica Look-Alike

https://petapixel.com/2026/01/09/photographer-transformed-a-panasonic-lumix-g9-ii-into-a-leica-lo...
1•PaulHoule•8m ago•0 comments

NTSB Animation of Flight 5342 2025 Potomac River mid-air collision [video]

https://www.youtube.com/watch?v=LJ10ZOcWuC4
1•antongribok•9m ago•0 comments

Why is the app slow on Pixel 7?

https://binarysky.se/blog/2026/01/why-is-the-app-slow-on-pixel-7/
1•lindskogen•10m ago•0 comments

Sum, Product: A simple-to-understand mathematical paradox on meta-cognition

https://magarshak.com/blog/sum-product-and-the-limits-of-meta-knowledge/
1•EGreg•10m ago•0 comments

40 years later, a new look at lessons from the Challenger disaster

https://www.washingtonpost.com/politics/2026/01/27/challenger-space-shuttle-disaster-40-years/
2•bookofjoe•11m ago•2 comments

Chasing 6 TB/s: an MXFP8 quantizer on Blackwell

https://blog.fal.ai/chasing-6-tb-s-an-mxfp8-quantizer-on-blackwell/
1•amrrs•11m ago•0 comments

How-Dirty-Marketing-Works

https://how-dirty-marketing-works.onrender.com/
1•yashsm01•12m ago•0 comments

We Built Our IVR Scraper (and What Broke Along the Way)

https://phonesupported.dev/blog/how-we-built-our-ivr-scraper-and-what-broke-along-the-way/
1•fast_kalyan•13m ago•0 comments

Chrome, Edge Extensions Caught Stealing ChatGPT Sessions

https://www.securityweek.com/chrome-edge-extensions-caught-stealing-chatgpt-sessions/
1•Bender•16m ago•0 comments

Six JavaScript zero-day bugs lead to fears of supply chain attack

https://www.scworld.com/news/six-javascript-zero-day-bugs-lead-to-fears-of-supply-chain-attack
1•Bender•16m ago•0 comments

The sad and self-inflicted decline of the Washington Post, in one chart

https://www.natesilver.net/p/the-sad-and-self-inflicted-decline
3•JumpCrisscross•17m ago•0 comments

Coding with AI without giving up power

https://medium.com/@sig.segv/coding-with-ai-without-giving-up-power-e0c6ca257ad9
1•fwef64•17m ago•0 comments

Supreme Court to decide how 1988 videotape privacy law applies to online video

https://arstechnica.com/tech-policy/2026/01/supreme-court-to-decide-how-1988-videotape-privacy-la...
2•Bender•17m ago•0 comments

Units of measure in the KCL CAD language

https://www.ncameron.org/blog/kcl-part-1-units/
1•fanf2•17m ago•0 comments

Fun 0.37.62 – The programming language that makes you have fun

https://fun-lang.xyz/2026/01/25/announcing-fun-0.37.62/
1•hanez•18m ago•1 comments

GameStop's original bull is back. CEO Ryan Cohen has Berkshire-like ambitions

https://www.msn.com/en-us/money/savingandinvesting/gamestop-s-original-bull-is-back-ceo-ryan-cohe...
1•petethomas•19m ago•0 comments

How many users can a $50K AI workstation serve? Benchmark data

https://old.reddit.com/r/LocalLLaMA/comments/1qorbdk/dual_rtx_pro_6000_workstation_with_115tb_ram/
2•Blue_Cosma•20m ago•0 comments

AI gives cleaning instructions [video]

https://www.youtube.com/shorts/TP_p5YnRH00
1•6510•21m ago•0 comments

"IG is a drug": Internal messages may doom Meta at social media addiction trial

https://arstechnica.com/tech-policy/2026/01/tiktok-settles-hours-before-landmark-social-media-add...
6•mikestew•22m ago•0 comments

Kimi K2.5 1T runs on 2 M3 Ultras with mlx-lm in it's native precision

https://twitter.com/awnihannun/status/2016221496084205965
2•jeudesprits•24m ago•0 comments

Asteroid 2024 YR4 Has a 4% Chance of Hitting the Moon

https://www.universetoday.com/articles/asteroid-2024-yr4-has-a-4-chance-of-hitting-the-moon-heres...
1•belter•25m ago•0 comments

Hacker News Slop

https://www.hnslop.com/
4•jshchnz•26m ago•0 comments

Updated LLM Benchmark (Gemini 3 Flash)

https://entropicthoughts.com/updated-llm-benchmark
2•surprisetalk•26m ago•1 comments