frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Sweep, Open-weights 1.5B model for next-edit autocomplete

https://huggingface.co/sweepai/sweep-next-edit-1.5B
106•williamzeng0•6h ago
Hey HN, we trained and open-sourced a 1.5B model that predicts your next edits, similar to Cursor. You can download the weights here (https://huggingface.co/sweepai/sweep-next-edit-1.5b) or try it in our JetBrains plugin (https://plugins.jetbrains.com/plugin/26860-sweep-ai-autocomp...).

Next-edit autocomplete differs from standard autocomplete by using your recent edits as context when predicting completions. The model is small enough to run locally while outperforming models 4x its size on both speed and accuracy.

We tested against Mercury (Inception), Zeta (Zed), and Instinct (Continue) across five benchmarks: next-edit above/below cursor, tab-to-jump for distant changes, standard FIM, and noisiness. We found exact-match accuracy correlates best with real usability because code is fairly precise and the solution space is small.

Prompt format turned out to matter more than we expected. We ran a genetic algorithm over 30+ diff formats and found simple `original`/`updated` blocks beat unified diffs. The verbose format is just easier for smaller models to understand.

Training was SFT on ~100k examples from permissively-licensed repos (4hrs on 8xH100), then RL for 2000 steps with tree-sitter parse checking and size regularization. The RL step fixes edge cases SFT can’t like, generating code that doesn’t parse or overly verbose outputs.

We're open-sourcing the weights so the community can build fast, privacy-preserving autocomplete for any editor. If you're building for VSCode, Neovim, or something else, we'd love to see what you make with it!

Comments

plutodev•2h ago
Nice work the next-edit framing matches how real refactors happen much better than token-level autocomplete.

The diff-format insight is especially interesting. Smaller models struggling with unified diffs lines up with what I’ve seen too simpler original/updated blocks reduce noise and improve intent capture.

On the infra side, training a 1.5B model in ~4 hours on 8×H100 is impressive. For folks experimenting with similar mid-scale models, we’ve been running comparable workloads on decentralized GPU aggregators (I’ve used io.net) to avoid cloud quota limits and keep costs predictable with the tradeoff that you handle orchestration yourself.

Curious if you saw diminishing returns when including older edits as context? That cutoff seems tricky in larger repos.

kamranjon•1h ago
I read the release but didn't quite understand the difference between a next-edit model and a FIM model - does anyone have a clear explanation of when to use one over the other? I'd love if there was a sublime plugin to utilize this model and try it out, might see if I can figure that out.
mgz•1h ago
I use Sweep’s Jetbrains autocomplete plugin daily, it really stands out.
8n4vidtmkvmk•24m ago
Better than the one that ships with Jetbrains?

I did buy their $100/yr AI but its about to run out.

dcreater•1h ago
Based on qwen2.5-coder? seems like a "why not/resume embellish/show VC" type release I guess
dang•1h ago
"Please don't post shallow dismissals, especially of other people's work. A good critical comment teaches us something."

https://news.ycombinator.com/newsguidelines.html

ing33k•1h ago
can it be integrated in monaco editor ?
bangaladore•1h ago
So SFT cost less only low hundreds of dollars? (1-10$ per hour per H100 if I'm seeing this correctly).

What about SFT?

Presumably basing this of Qwen is the reason it can be done for so cheap?

syntaxing•1h ago
Wow super fun read, I love how it went into the technical details. Any way to make it work with vscode?
martianlantern•1h ago
This is cool! I am more interested in how you guys generated next edit training data from repos, seems like there are lots of caveats here. Would love your insights

Again amazing work! waiting for what you guys cook next

sim04ful•58m ago
I'm very green to this so forgive if this question sounds silly:

Would instead of the RL step a constrained decoding say via something like xgrammar fix syntax generation issue ?

NitpickLawyer•34m ago
> Would instead of the RL step a constrained decoding say via something like xgrammar fix syntax generation issue ?

It can, but you have to consider two things here:

a) constrained decoding ensures adherence to syntax, not semantics. Say you're editing a field in an enum in rust. You can write syntactically correct rust code that doesn't address the new field further in the code (say in a switch). You'd get correctly syntactic code, but the compiler will scream at you. RL works on both.

b) if your goal is to further train the model, so it works on many tasks, RL helps with exploring new paths and training the model further. Constrained grammars help with inference, but the model doesn't "learn" anything. With RL you can also have many reward functions at the same time. Say one that rewards good syntax, one that rewards "closing" all the functions so tree-sitter doesn't complain, and one that rewards 0 errors from the compiler. The model gets to train on all 3 at the same time.

rationably•54m ago
Do you plan to release Sweep 3B/7B on HF?
_ache_•43m ago
Yeap, the two seems like game changer. For now, I'm using "Qwen2.5-Coder-7B". Sweep 1.5B is "just" 12 % point better than Qwen2.5-Coder, but Sweep 7B is 25% point better.
_ache_•48m ago
It's good. The blog post about it is very interesting. I hope, a plugin for neovim will be made soon.

https://blog.sweep.dev/posts/oss-next-edit

_boffin_•44m ago
Followed your work since the beginning and used it for inspiration for some cool demos on self-healing web scrapers. fascinating to see the transition from original concept to producing models. cool stuff.
whimsicalism•43m ago
Very interesting - and cool to read about the development process. I'd love to hear more about how genetic algorithm worked here.

I wonder whether we are perhaps the point of usefulness of 'next edit' code development in 2026 though.

dainiusse•12m ago
Any easy way to try on vscode?

Show HN: Sweep, Open-weights 1.5B model for next-edit autocomplete

https://huggingface.co/sweepai/sweep-next-edit-1.5B
109•williamzeng0•6h ago•18 comments

Doctors in Brazil using tilapia fish skin to treat burn victims

https://www.pbs.org/newshour/health/brazilian-city-uses-tilapia-fish-skin-treat-burn-victims
22•kaycebasques•1h ago•1 comments

From stealth blackout to whitelisting: Inside the Iranian shutdown

https://www.kentik.com/blog/from-stealth-blackout-to-whitelisting-inside-the-iranian-shutdown/
53•oavioklein•6h ago•4 comments

Threat actors expand abuse of Microsoft Visual Studio Code

https://www.jamf.com/blog/threat-actors-expand-abuse-of-visual-studio-code/
116•vinnyglennon•6h ago•66 comments

Show HN: ChartGPU – WebGPU-powered charting library (1M points at 60fps)

https://github.com/ChartGPU/ChartGPU
555•huntergemmer•15h ago•160 comments

Claude's new constitution

https://www.anthropic.com/news/claude-new-constitution
396•meetpateltech•14h ago•385 comments

Binary fuse filters: Fast and smaller than xor filters (2022)

https://arxiv.org/abs/2201.01174
81•redbell•4d ago•4 comments

Show HN: TerabyteDeals – Compare storage prices by $/TB

https://terabytedeals.com
124•vektor888•9h ago•73 comments

Show HN: High speed graphics rendering research with tinygrad/tinyJIT

https://github.com/quantbagel/gtinygrad
14•quantbagel•2h ago•5 comments

Skip is now free and open source

https://skip.dev/blog/skip-is-free/
356•dayanruben•15h ago•160 comments

Lix – universal version control system for binary files

https://lix.dev/blog/introducing-lix/
29•onecommit•6h ago•10 comments

Hands-On Introduction to Unikernels

https://labs.iximiuz.com/tutorials/unikernels-intro-93976514
5•valyala•5d ago•0 comments

Show HN: Rails UI

https://railsui.com/
147•justalever•11h ago•83 comments

Letting Claude play text adventures

https://borretti.me/article/letting-claude-play-text-adventures
104•varjag•5d ago•39 comments

TrustTunnel: AdGuard VPN protocol goes open-source

https://adguard-vpn.com/en/blog/adguard-vpn-protocol-goes-open-source-meet-trusttunnel.html
103•kumrayu•13h ago•26 comments

Show HN: RatatuiRuby wraps Rust Ratatui as a RubyGem – TUIs with the joy of Ruby

https://www.ratatui-ruby.dev/
108•Kerrick•4d ago•15 comments

Show HN: Open-source certificate from GitHub activity

https://certificate.brendonmatos.com
26•brendonmatos•3d ago•7 comments

Tell HN: 2 years building a kids audio app as a solo dev – lessons learned

91•oliverjanssen•16h ago•35 comments

Convert potentially dangerous PDFs to safe PDFs

https://github.com/freedomofpress/dangerzone
140•dp-hackernews•7h ago•45 comments

The WebRacket language is a subset of Racket that compiles to WebAssembly

https://github.com/soegaard/webracket
116•mfru•4d ago•24 comments

Beowulf's opening "What" is no interjection (2013)

https://www.poetryfoundation.org/poetry-news/69208/new-research-opening-line-of-beowulf-is-not-wh...
68•gsf_emergency_6•3d ago•51 comments

Golfing APL/K in 90 Lines of Python

https://aljamal.substack.com/p/golfing-aplk-in-90-lines-of-python
63•aburjg•5d ago•13 comments

Jerry (YC S17) Is Hiring

https://www.ycombinator.com/companies/jerry-inc/jobs/QaoK3rw-software-engineer-core-automation-ma...
1•linaz•8h ago

Waiting for dawn in search: Search index, Google rulings and impact on Kagi

https://blog.kagi.com/waiting-dawn-search
259•josephwegner•12h ago•155 comments

SIMD programming in pure Rust

https://kerkour.com/introduction-rust-simd
80•randomint64•2d ago•33 comments

Show HN: Differentiable Quantum Chemistry

https://github.com/lowdanie/hartree-fock-solver
19•lowdanie•4d ago•4 comments

Can you slim macOS down?

https://eclecticlight.co/2026/01/21/can-you-slim-macos-down/
207•ingve•22h ago•247 comments

Challenges in join optimization

https://www.starrocks.io/blog/inside-starrocks-why-joins-are-faster-than-youd-expect
61•HermitX•13h ago•11 comments

Nested code fences in Markdown

https://susam.net/nested-code-fences.html
215•todsacerdoti•17h ago•73 comments

Show HN: Dotenv Mask Editor: No more embarrassing screen leaks of your .env

https://marketplace.visualstudio.com/items?itemName=xinbenlv.dotenv-mask-editor
13•xinbenlv•6h ago•14 comments