frontpage.

Show HN: Sweep, AI autocomplete for JetBrains that rewrites code

https://sweep.dev

5•williamzeng0•2h ago

Hi HN, I've been a JetBrains user for the last couple of years, and felt that JetBrains was missing a strong autocomplete tool. Over the last 6 months, we've fine-tuned our own model, built our own inference stack, and integrated it deeply into the IDE. You can use the autocomplete plugin for free (no sign up required) on the JetBrains marketplace.

Here's how we got here:

We started with standard FIM (fill-in-the-middle) autocomplete. Other AI code completion tools in JetBrains can only "add" code at your cursor position. This can be helpful when writing assert statements in unit tests, but FIM is less useful for cases like adding a new parameter to a function.

Autocomplete feels much better when it can rewrite code in addition to simply adding code. This is typically called "next-edit."

To get this capability, we trained a model on granular user actions such as arrow key movements, cursor jumps, and keystroke-level data. This works really well for tasks like adding enumerate to a Python for-loop, refactoring conditionals, and other repetitive changes.

Another problem is that this can be very slow. Out of the box with vLLM, each request had a median latency of 1500ms (too long). To optimize this, we rewrote TensorRT-LLM to support N-gram speculative decoding, which lets us serve completions at a median latency of 94ms. We also wrote more about it here: https://blog.sweep.dev/posts/next-edit-jetbrains

Finally, to get full codebase context awareness, we actually have one unique advantage over VS Code. The JetBrains codebase index (via their Program Structure Interface) is exceptionally well-built and has already indexed the entire codebase. This means we can quickly access the definitions of arbitrary functions or classes. To get extremely precise codebase context, we fetch the definitions of code symbols around your cursor and pass them to our model.

We've spent a lot of time getting the details right, and we'd love to get your thoughts and feedback!

Immortality Factory – Free Factory Automation Game

SSDD: Single-Step Diffusion Decoder for Efficient Image Tokenization

Turkey's flying car starts to fly off the shelves

You can just open source things

Show HN: Convert between Mermaid, draw.io, and Excalidraw diagrams

Context Is Everything Excerpt

HSGM: Hierarchical Segment-Graph Memory for Scalable Long-Text Semantics

Show HN: Color Palette Pro – A synthesizer-style color palette generator

Google lays off dozens of workers in Sunnyvale

Open Agent Specification (Agent Spec): A Unified Representation for AI Agents

My Personal Experience with RSI

Augment Code's pricing is changing on October 20

ClickGems: Free Analytics for RubyGems

Show HN: We Built an AI Developer Bootcamp for LLM Apps

Conservative justices signal openness to allowing conversion therapy

Show HN: A publishing platform for developers to share their stories (and apps)

Jeff Bezos Has a Plan to Send Data Centers to Space

Italian families target Facebook, Instagram and TikTok over child safety

Consequence culture is making martyrs

I vibecoded a port of snig image gallery generator in PHP

As We Become Cameras

IDF downs four Houthi drones over Eilat within an hour

Robin Williams' daughter begs fans to stop sending her AI videos of late father

The Incompatibility of AI and Decarbonization

AI workspace, Run all top AI models side-by-side with memory and live search

Qualcomm Buys Arduino, Will Bring AI Tools to Your DIY Tech Projects

Apple Designs a Virtual Knob

Learning about Rust Benchmarking with Sudoku from 5 minutes to 17 seconds

Police allege 'evil twin' of in-flight Wi-Fi used to steal credentials (2024)

Show HN: Search 4 Cute Cats