Building a Rust-style static analyzer for C++ with AI

http://mpaxos.com/blog/rusty-cpp.html

92•shuaimu•1mo ago

Comments

hu3•1mo ago

Is it just me or this seems like a quick-win for many C++ codebases?

Simple and elegant solution.

jjmarr•1mo ago

Can we see an example of this on a moderately sized codebase?

akshat2602•1mo ago

The author mentions they used it in their mako-db project at the end. https://github.com/makodb/mako

minimaxir•1mo ago

> When I upgraded to Sonnet 4.5, it became less often that it gave phantom answers. But it still sometimes wasn’t able to handle some complex problems. We’d go back and forth and I’d try to give it hints. But with Opus 4.5, that happens much less often now.

The real annoying thing about Opus 4.5 is that it's impossible to tell most people "Opus 4.5 is an order of magnitude better than coding LLMs released just months before it" without sounding like a AI hype booster clickbaiting, but it's the counterintuitive truth. To my continual personal frustration.

throwaway2027•1mo ago

First impressions matter. I felt the same reading comments suggesting that people who praised GPT-5.2-Codex recently were shilling for OpenAI when it has actually gotten much better, faster and the most important one, more time before you reach your weekly limit.

inferiorhuman•1mo ago

  To my continual personal frustration.

That's not the fault of Opus 4.5 because like all AI nonsense it's still not worth the cost. The privacy given up by having to authenticate with services like Github that used to be publicly available before getting constantly DDoSed by AI bots. The reliability and freedom that evaporated into the ether as folks run to the shelter of Cloudflare to mitigate the endless DDoS attacks at the hands of AI data scrapers. The emotional and social development stunted by having AI chatbots pretend to be a significant other and only say what folks want to hear. Whether Opus "can" code is immaterial.

sielakis•1mo ago

The thing is, it still feels like a mixed bag for me.

It's good enough for things I can define well and write okay code for.

But it is far from perfect.

It does too much, like any LLM. For example, I had some test cases for deleted methods, and I was being lazy and didn't want to read a huge test file, so I asked it to fix it.

It did. Tests were green because it mocked non-existing methods, while it should have just deleted the test cases as they were no longer needed.

Luckily, I read the code it produced.

The same thing happened with a bit of decorators I asked it to write in Python. It produced working code, tests were fine, but I reworked the code manually to 1/10 of the size proposed by Opus.

It seems magical, even thinking, but like all LLMs, it is not. It is just a trap.

j16sdiz•1mo ago

Small tips:

When LLMs try to do the wrong thing, don't correct it with new instruction. Instead, edit your last prompt and give more details there.

LLM have limited context length, and they love stuck to their previous error. Just edit the previous prompt. Don't let the failed attempt pollute your context.

sielakis•1mo ago

I know. It was just me being too lazy to write proper prompt.

And code size thing is not fixed by better prompt.

It also likes to even ignore reasonable plan it writen itself just to add more code.

risyachka•1mo ago

>> but I reworked the code manually to 1/10 of the size proposed by Opus.

yeah it writes so much code its crazy - where it can be solved, like you mentioned, with 1/10th

I mean they are in the token business, so this is expected to continue as long as they possibly can as long as they are a bit better than competition.

This is what 99% of devs that praise Claude Code don't notice. The real productivity gains are much lower than 10x.

Maybe they are like 2x tops.

The real gains is that you can be lazy now.

In reality most tasks you do with LLM (not talking about greenfield projects, those are vanity metrics) can be completed by human in mostly same time with 1/10th of code - but the catch here is you need to actually think and work instead of talking to chat or watching YouTube while prompt is running, which becomes 100x harder after you use LLM extensively for a week or so.

maccard•1mo ago

> The real annoying thing about Opus 4.5 is that it's impossible to tell most people "Opus 4.5 is an order of magnitude better than coding LLMs released just months before it" without sounding like a AI hype booster clickbaiting, but it's the counterintuitive truth. To my continual personal frustration.

The problem is that these increases in model performance are like the boy who cried wolf. There's only so many times you can say "this model is so much better, and does X/Y/Z more/less" and have it _still_ not be good enough for general use.

robrain•1mo ago

Indeed - it’s like the last hundred years of detergent marketing: “the whitest whites ever, the gentlest wash you’ve ever experienced”. Then six months later another advance from the boffins in their lab coats. All the time it’s just soap.

virtualritz•1mo ago

What people do not understand is that this really depends on what language you target. So if I write Rust then you sound like an AI hype booster but if I write TS or Python maybe not so much.

From my experience Opus is only good at writing Rust. But it's great at something like TS because the amount of code it has been trained on is probably orders of magnitude bigger for the latter language.

I still use Codex high/xhigh for planning and once the plan is sound I give it to Opus (also planning). That plan I feed back to Codex for sign-off. It takes an average additional 1-2 rounds of this before Opus makes a plan that Codex says _really_ ticks all the boxes of the plan it made itself and which we gave to Opus to start with ...

That tells you something.

Also when Opus is "done" and claims so I let Codex check. Usually it has skipped the last 20% (stubs/todos/logic bugs) so Codex makes a fixup plan that then again goes to through the Codex<->Opus loop of back and forth 2-3 rounds before Codex gives the thumbs up. Only after that has Opus managed to do what the inital plan said that Codex made in the first place.

When I have Opus write TS code (or Python) I do not have to jump through those hoops. Sometimes one round of back and forth is needed but never three, as with Rust.

MeetingsBrowser•1mo ago

> One thing I used to hope for is better interop between C++ and Rust … But after closely following discussions in the Rust committee, I do not think this is likely to happen soon.

Interesting. I thought C++ interop was one of the top priorities right now.

It’s one of the top items mentioned in recent language progress reports, the Rust foundation received a million dollar grant to work on it, and there was a talk at the most recent RustConf about how Google is doing Rust/C++ interop.

Curious to know what discussions led to that conclusion.

testdelacc1•1mo ago

I think it could be different timelines. The Google projects interested in better C++ interop have been around for 2 decades - Chromium and Android. They’ll be ok if the effort bears fruit in a year or two.

The latest on this project is in this update - https://github.com/rust-lang/rust-project-goals/issues/388. They’re working on it, but no progress yet.

I think the author wanted something now, which is a completely acceptable reason to start his own project.

shuaimu•1mo ago

I was refering to this: https://hackmd.io/@rust-lang-team/rJvv36hq1e

I don't know if they later changed their minds. From the meetings notes it seemed they didn't want implement a C++ frontend in rustc.

dash2•1mo ago

How readable is the code produced by opus, when you try to interact with it manually?

SkiFire13•1mo ago

> Then I thought: how hard is it to write this C++ static analyzer? Conceptually, I think it’s not hard. It requires going through the AST. And since the static analysis is mostly statically scoped, it doesn’t require heavy cross-file analysis.

How do you handle function lifetimes then? Those are generally non-local to infer, and Rust requires annotating functions with informations for that. I tried taking a look at the mako db's refactor but I didn't see any lifetime annotation being added there.

j16sdiz•1mo ago

It need new annotations. (see the paragraph under "Comment-Based Syntax")

hmry•1mo ago

The article doesn't show any function lifetime annotations, only @safe and @unsafe.

Functions need annotations like "return value lives as long as argument 1" or "return value lives as long as both arguments are alive"

aw1621107•1mo ago

> The article doesn't show any function lifetime annotations, only @safe and @unsafe.

It does, but it's under the "External Annotations" section:

    // External annotations go in a header file
    // @external: {
    //   strlen: [safe, (const char* str) -> owned]
    //   strcpy: [unsafe, (char* dest, const char* src) -> char*]
    //   strchr: [safe, (const char* str, int c) -> const char* where str: 'a, return: 'a]
    //
    //   // Third-party libraries work the same way
    //   sqlite3_column_text: [safe, (sqlite3_stmt* stmt, int col) -> const char* where stmt: 'a, return: 'a]
    //   nlohmann::json::parse: [safe, (const string& s) -> owned json]
    // }

> The where clause specifies lifetime relationships—like where stmt: 'a, return: 'a means the returned pointer lives as long as the statement handle. This lets the analyzer catch dangling pointers from external APIs.

The GitHub repo also has an annotations guide with some more info [0]. The general syntax appears to be:

    // @lifetime: (parameters) -> return_type where constraints

[0]: https://github.com/shuaimu/rusty-cpp/blob/main/docs/annotati...

SkiFire13•1mo ago

`@lifetime` seem to be what I was referring to, strange though it wasn't mentioned at all in the article.

The ones in `@external` seem to be limited to C++ definitions outside the user control.

aw1621107•1mo ago

I had assumed the lifetime syntax with the `where` clauses would not be specific to @external blocks, to be fair.

hmry•1mo ago

Ah, I see! Thank you!

judofyr•1mo ago

Very cool project! Always happy to see more work around static analysis.

However, looking at the recent commits it doesn't quite look like the most solid foundation: https://github.com/shuaimu/rusty-cpp/commit/480491121ef9efec...

    fn is_interior_mutability_type(type_name: &str) -> bool {
        type_name.starts_with("rusty::Cell<") ||
        type_name.starts_with("Cell<") ||
        type_name.starts_with("rusty::RefCell<") ||
        type_name.starts_with("RefCell<") ||
        // Also check for std::atomic which has interior mutability
        type_name.starts_with("std::atomic<") ||
        type_name.starts_with("atomic<")
    }

… which then 30 minutes later is being removed again because it turns out to be completely dead code: https://github.com/shuaimu/rusty-cpp/commit/84aae5eff72bb450...

There's also quite a lot of dead code. All of these warnings are around unused variable, functions, structs, fields:

    warning: `rusty-cpp` (bin "rusty-cpp-checker") generated 90 warnings (44 duplicates)

usefulposter•1mo ago

    Generated with [Claude Code](https://claude.ai/code)
    via [Happy](https://happy.engineering)

    Co-Authored-By: Claude <noreply@anthropic.com>
    Co-Authored-By: Happy <yesreply@happy.engineering>

This isn't just vibe code. It's mobile vibe code.

No logic, no coherence———just inconsistency.

---

Note: This is an experimental shitpost. Fork it. Share it. Use it. [EMOJI ROCKET]

yadaeno•1mo ago

This whole thing feels like clever marketing. Why would the mobile app be credited?

mgaunard•1mo ago

Just looking at the code excerpt makes it clear the code must be quite low quality

SkiFire13•1mo ago

https://github.com/shuaimu/rusty-cpp/blob/3707c09f5ff42bc5f6...

It also looks like it's skipping some lifetime checks in some sketchy way

hu3•1mo ago

Yours an other similar comments are disproportionally rude given that the author was very upfront about their methodology.

And I don't think it's constructive to cherrypick commits in this context.

> I even started trying out the fully autonomous coding: instead of examining its every action, I just write a TODO list with many tasks, and ask it to finish the tasks one by one.

> I never had to fully understand the code. What I had to do is: I asked it to give me a plan of changes before implementation, it gave me a few options, and then I chose the option that seemed most reasonable to me. Remember, I’m not an expert on this. I think most of the time, anybody who has taken some undergraduate compiler class would probably make the right choice.

The idea has merits. Take it as a PoC.

wavemode•1mo ago

I don't see anything that is the slightest bit "rude" in the comment you're replying to. It actually begins with enthusiastic praise of the project and its goals.

I don't understand why you feel it's not "constructive" to review the quality of code of a project. Are people supposed to just blindly believe in the functionality without peeking under the hood?

hu3•1mo ago

Let's agree to disagree then.

Initial praising doe not preclude rudeness. And complaining about a commit that was undone 30 minutes later is not only pointless in the presented context, it's a cheap attempt at insulting.

> Are people supposed to just blindly believe in the functionality without peeking under the hood

False dichotomy. No one said that. And we both know this is not the way regardless of the codebase.

I think the idea has merits and given the honesty of the post, it's rather more productive to comment on it instead.

UncleMeat•1mo ago

> The idea has merits. Take it as a PoC.

Does it? There have been a gazillion such static analyzers. They all do one of two things: ignore the hard parts of tackle the hard parts. If you ignore the hard parts then your tool is useless. If you tackle the hard parts then your tool is orders of magnitude more complex and it still struggles to work well for real world projects. This is in the former category.

The article says "And since the static analysis is mostly statically scoped, it doesn’t require heavy cross-file analysis."

Oops. Suddenly you either handle aliasing soundly and your tool is plagued with zillions of false positives or you handle aliasing unsoundly and... you aren't getting what makes rust different. Separate compilation has been a problem for C++ analyzers for ages. Just declaring it to not actually be a big deal is a huge red flag.

Heck, even just approaching this as an AST-level analysis is going to struggle when you encounter basic things like templates.

The article says this: "Everybody tries to fix the language, but nobody tries to just analyze it." This is just flagrantly false. What's bizarre is that there are people at Stony Brook who have done this. Also, introducing new syntax (even if they are annotations) is more-or-less the same thing as "fixing the language" except that there is almost no chance that your dependencies (including the standard library) are annotated in the way you need.

lifetimerubyist•1mo ago

> made with AI

UncleEntity•1mo ago

> …which then 30 minutes later is being removed again because it turns out to be completely dead code

I'm not sure if it's a good or bad thing people expect the robots to produce proper code on the first attempt?

mgaunard•1mo ago

unique_ptr/shared_ptr (and box/arc) still need to handle null states due to how moving works in C++.

mgaunard•1mo ago

Wrapping every pointer in a smart pointer is very bad style; that suggests you simply have the bad level of abstraction in your code.

That problem seems even more prevalent in Rust, where I see Arc used everywhere, presumably as a cop-out not to have to figure out how to satisfy the borrow checker in smarter ways.

bluGill•1mo ago

C++ doesn't have a smart ponter for 'i will not need this for longer than something else' - shared pointer gets the overhead of reference counting. Rust makes borrows easy - taking and returning a unique_ptr is conceptually the same thing but the syntax makes it tedious. Borrows don't cover the case of I'll store it but whole program analisys would show I won't store it as long as the owner (i'm not a rust expert but I think my understanding is right here)

both languages don't have a good way to handle circular references should you need them (again my rust isn't strong but I think that is right). You are correct to say avoid that - but sometimes you need them.

mgaunard•1mo ago

It does, it's called a pointer. You're literally not allowed to dereference a pointer once the pointee has ceased to exist, so by using them you're making the promise you'll ensure this is satisfied.

C++ is not limited to unique_ptr, the language (unlike Rust) allows you to define your own semantics of what a value is. You can then work in terms of copying or moving values, which makes lifetime management trivial as they are scope-bound.

bluGill•1mo ago

Smart is the key. You can use a raw pointer, but that doesn't tell or enforce anything about lifetime. How long will that pointer be valid - can I save it to a class member - we don't know.

C++ gives you more more things, but none of them are enforced. (I'm sure Rust wants those same things at time - but since I'm not aware of anyone with any ideas how to enforce them so Rust has decided to not allow those - a reasonable choice overall, but sometimes annoying when it means you can't do something that you "know" is correct just because it can't be proved correct in the language)

mgaunard•1mo ago

It remains a requirement, whether it is enforced or not.

Valid programs don't need guardrails, since you need to satisfy those requirements for the program to be valid in the first place.

bluGill•1mo ago

Humans have a bad history of getting things right without the guardrails. We know how to do it, but there often is one code path we didn't think about correctly - we may get it right 99.99% of the time, but that leaves a lot of mistakes in the code.

I want guard rails to ensure that I got everything right, not just 99.99% of the cases right.

wtetzner•1mo ago

> C++ has a perfect match for Rust’s mutability: const and non-const.

Rust has inherited mutability, while I believe const in C++ is shallow. I don't think it's a perfect match.

mgaunard•1mo ago

members of a const struct are also const.

Now you obviously can still have escape hatches and cast the const away whenever you want.

wtetzner•1mo ago

> members of a const struct are also const.

Yes, but if your struct contains references, the constness doesn't apply to what those references point to. In Rust it does.

mgaunard•1mo ago

For pointers, const only affects whether you can re-set it to point to something else, not the pointee.

Nothing prevents you from building a smart pointer with those semantics though, std::indirect is an example of this (arguably closer to Rust's Box).

wtetzner•1mo ago

Sure, but my point is that the semantics between C++ and Rust are different, and are therefore not an exact match as the article stated.

mgaunard•1mo ago

In C++, you define the semantics yourself.

wtetzner•1mo ago

No, const semantics are defined by the language definition.

mgaunard•1mo ago

It's defined by whatever you put in your const overloads.

const is primarily a type annotation that affects overload resolution.

You must be confused because Rust has no overloading to begin with.

feverzsj•1mo ago

That must be some bait post, as we all know, C++ and Rust developers hate AI.

dmarwicke•1mo ago

curious what the token costs look like on a real codebase. opus ain't cheap and C++ headers get big fast

loglog•1mo ago

1. There are tons of static analyzers for C++. Does none of them support type system augmentation? 2. Is the proposed solution really more practicable than "typescript for c++" [0]?

[0] https://github.com/hsutter/cppfront

thefaux•1mo ago

Honestly, I found this piece depressing. Life is too short and precious to waste on crappy software.

So often the question ai related pieces ask is "can ai do X?" when by far the more important question is "should ai do X?" As written, the piece reads as though the author has learned helplessness around c++ and their answer is to adopt a technology that leaves them even more helpless, which they indeed lament. I'd challenge the author to actually reflect on why the are so attached to this legacy software and why they cannot abandon it if it is causing this level of angst.

shuaimu•1mo ago

OP here: Ah, thanks for all these comments! Didn't expect so much passion.

Many comments are right this is prototype and there isn't any code guarantee! It is purely test case driven.

But this prototype sort of proves that to have Rust-equivalent memory safety, you don't really need to completely ditch C++, and all those "rewrite in Rust" clones of C++ repos. The time I spent on this project is very limited. I did maybe half of the dev on my phone through Happy. If Microsoft or Google who has lots of C++ code is willing to put some serious resources on this idea, I am sure they can have something a lot more solid. And they don't have to give up C++ (they shouldn't, it is very unclever engineering wise).

To me personally this prototype is a "usable" alternative to Circle C++. It saved me a lot of hard debugging time.

Tiny C Compiler

Show HN: LocalGPT – A local-first AI assistant in Rust with persistent memory

SectorC: A C Compiler in 512 bytes

Speed up responses with fast mode

Software factories and the agentic moment

OpenCiv3: Open-source, cross-platform reimagining of Civilization III

Brookhaven Lab's RHIC concludes 25-year run with final collisions

Stories from 25 Years of Software Development

Hoot: Scheme on WebAssembly

Show HN: Craftplan – Elixir-based micro-ERP for small-scale manufacturers

FDA intends to take action against non-FDA-approved GLP-1 drugs

First Proof

Vocal Guide – belt sing without killing yourself

Show HN: I saw this cool navigation reveal, so I made a simple HTML+CSS version

Al Lowe on model trains, funny deaths and working with Disney

The F Word

Show HN: A luma dependent chroma compression algorithm (image compression)

Start all of your commands with a comma (2009)

IBM Beam Spring: The Ultimate Retro Keyboard

Eigen: Building a Workspace

Microsoft account bugs locked me out of Notepad – Are thin clients ruining PCs?

The AI boom is causing shortages everywhere else

Selection rather than prediction

I write games in C (yes, C) (2016)

Reinforcement Learning from Human Feedback

Unseen Footage of Atari Battlezone Arcade Cabinet Production

Learning from context is harder than we thought

Where did all the starships go?

Show HN: Look Ma, No Linux: Shell, App Installer, Vi, Cc on ESP32-S3 / BreezyBox

Hackers (1995) Animated Experience

Tiny C Compiler

Show HN: LocalGPT – A local-first AI assistant in Rust with persistent memory

SectorC: A C Compiler in 512 bytes

Speed up responses with fast mode

Software factories and the agentic moment

OpenCiv3: Open-source, cross-platform reimagining of Civilization III

Brookhaven Lab's RHIC concludes 25-year run with final collisions

Stories from 25 Years of Software Development

Hoot: Scheme on WebAssembly

Show HN: Craftplan – Elixir-based micro-ERP for small-scale manufacturers

FDA intends to take action against non-FDA-approved GLP-1 drugs

First Proof

Vocal Guide – belt sing without killing yourself

Show HN: I saw this cool navigation reveal, so I made a simple HTML+CSS version

Al Lowe on model trains, funny deaths and working with Disney

The F Word

Show HN: A luma dependent chroma compression algorithm (image compression)

Start all of your commands with a comma (2009)

IBM Beam Spring: The Ultimate Retro Keyboard

Eigen: Building a Workspace

Microsoft account bugs locked me out of Notepad – Are thin clients ruining PCs?

The AI boom is causing shortages everywhere else

Selection rather than prediction

I write games in C (yes, C) (2016)

Reinforcement Learning from Human Feedback

Unseen Footage of Atari Battlezone Arcade Cabinet Production

Learning from context is harder than we thought

Where did all the starships go?

Show HN: Look Ma, No Linux: Shell, App Installer, Vi, Cc on ESP32-S3 / BreezyBox

Hackers (1995) Animated Experience

Building a Rust-style static analyzer for C++ with AI

Comments