frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Brokk: AI for Large Codebases

https://brokk.ai
51•handfuloflight•1y ago

Comments

jbellis•1y ago
Hi all, Brokk creator here, happy to answer any questions!

I made an intro video with a live demo here: https://www.youtube.com/watch?v=Pw92v-uN5xI

soco•1y ago
Is there something also to read for those of us who will never watch videos?
lutzleonhardt•1y ago
Hi, yes there are some blog posts:

https://brokk.ai/blog/brokk-under-the-hood

bchapuis•1y ago
Really cool project! I tried it a couple of weeks ago with an Anthropic API key and will give it another shot.

Could you share a bit more about how you handle code summarization? Is it mostly about retaining method signatures so the LLM gets a high-level sense of the project? In Java, could this work with dependencies too, like source JARs?

More generally, how’s it been working with Java for this kind of project? Does limited GPU access ever get in the way of summarization or analysis (Jlama)?

jbellis•1y ago
That officially makes you an early adopter, thanks!

Yes, it's basically just parsing for declarations. (If you doubleclick on any context in the Workspace it will show you exactly what's inside.)

You have to import the dependencies via File -> Decompile Dependency and then it gets parsed like the rest of your source, only read-only.

I have a love-hate relationship with Java, mostly love lately, the OpenJDK team is doing a great job driving the language forward. It's so much faster than Python, it's nice being able to extend a language in itself and get native performance.

Since we're just using Jlama to debounce the LLM requests, we can use a tiny model that runs fine on CPU alone. The latest Jlama supports GPU as well but we're not using that.

neoncontrails•1y ago
I'd be interested to try this out. I'm especially keen on AI tools that implement a native RAG workflow. I've given Cursor documentation links, populated my codebase with relevant READMEs and diagram files that I'm hoping might provide useful context, and yet when I ask it to assist on some refactoring task it often spends 10-20 minutes simply grepping for various symbol names and reading through file matches before attempting to generate a response. This doesn't seem like an efficient way for an LLM to navigate a medium-sized codebase. And for an IDE with first-class LLM tooling, it is a bit surprising that it doesn't seem to provide powerful vector-based querying capabilities out of the box — if implemented well, a Google-like search interface to one's codebase could be useful to humans as well as to LLMs.

What does this flow look like in Brokk? Do models still need to resort to using obsolete terminal-based CLI tools in order to find stuff?

lutzleonhardt•1y ago
We implemented a multi-step process to find the required context:

1. Quick Context Shows the most relevant files based on a pagerank algorithm (static analysis) and semantic embeddings (JLama inference engine). The input are the instructions and the AI workspace fragments (i.e. files).

2. Deep Scan A richer LLM receives the summaries of the AI workspace files (+instructions) and returns a recommendation of files and tests. It also recommends the type of inclusion (editable, read-only, summary/skeleton).

3. Agentic Search The AI has access to a set of tools for finding the required files. But the tools are not limited to grep/rg. Instead you can: - find symbols (classes, methods, ...) in the project - ask for summaries/skeletons of files - provide class or method implementations - find usages of symbols (where is x used?) - call sites (in/out) ...

You can read more about this in the Brokk.ai blog: https://brokk.ai/blog/brokk-under-the-hood

silverlake•1y ago
No offense, but that video is brutally boring. Even at 1.5x speed I couldn’t get past 10 min. You should transcribe the audio and use an LLM to write a punchy sales pitch.
corysama•1y ago
How large is "Large"? Are we testing on Unreal Engine? :D
jbellis•1y ago
no, but I've tested on intellij (~5M loc, takes forever to import b/c of delombok, do not recommend)
lutzleonhardt•1y ago
I tested it with Ghidra recently and got very good results
saratogacx•1y ago
Likely not an important note but the name sounds close enough to grok that I assumed this was a spin off of some xAI product. I had to look around to see if it was actually associated (it looks like it isn't) but it may be something to be aware of.
tschellenbach•1y ago
wrote a guide on how to use cursor for large codebases here: https://getstream.io/blog/cursor-ai-large-projects/ working well over here

cool to see more AI tools address this

ElijahLynn•1y ago
Thank you! I think this is the next evolution of using LLM for coding. Understanding all the context from large codebases...
lutzleonhardt•1y ago
The amazing thing here is that the Brokk AI can access your code like an IDE, can ask for usages or gather the summary of a file before deciding to get the implementation of a method! It mimics like a Dev is navigating the codebase. And this is more reliable and token-efficient than the usual grep/rg approach
esafak•1y ago
This ought to be an IDE plugin. Don't make me context switch.
danjl•1y ago
The "Read" file list sounds a lot like Copilot Edit mode, where you manually specify the list of files that are added to the context. Similarly, Copilot has an Ask (Chat) mode that doesn't change the code. One of the downsides of all these new IDEs is that it is difficult, even for the developers of those tools, to have enough time to test out coding in each of their competitors. Also, the switching cost of changing IDEs is pretty high, even if they are forks of the same code base, which makes it hard for the users to really test out all the options. In the long run, I expect that the "larger" IDE providers will purchase the smaller ones. IOW, if you wait long enough, all the good bits will be in Copilot (or maybe Cursor with their new funding).
jbellis•1y ago
(creator here)

idk, everyone else seems to want to take the 40 year old IDE paradigm we're all used to (really! that's how old Turbo Pascal 3 is!) and graft AI onto it. I think we need a fundamentally different design to truly take advantage of the change from "I'm mostly reading and writing code at human speeds" to "I'm mostly supervising the AI which is better at generating syntax than I am."

of course the downside to going against the crowd is that the crowd is usually right, we'll see how it goes!

danjl•1y ago
I am a huge supporter of completely re-working the IDE UI as well. I'm not arguing for keeping the existing IDE interfaces. I like that folks are experimenting with entirely new interfaces. In fact, I'd go further and suggest that all of the overly complex interfaces used on any sort of content-creation app, like Unity, Unreal, Photoshop, as well as code IDEs, will eventually be completely refactored to remove all the old complexity in favor of either chat-based or other AI-driven interfaces. My point is simply that there are too many new AI-driven IDEs for folks to try out, even the developers of those IDEs. Many of the features in Brokk that were seemingly described in the Brokk 101 blog video as "differentiators" are existing Copilot features. Has the author ever used Copilot? Or just Cursor? Or another AI variant?
danjl•1y ago
I'd love to see things like Brokk experiment a bit more with what other information to include in our git repositories, besides the code, that helps improve AI-based code generation. For example, perhaps the repo should include more design information about the look-and-feel, as visual information or Figma files, rather than just, say the CSS and HTML. Or it might help if the repository included more business requirements so that the AI has better information to guide prioritization of changes. Obviously other bits, like coding standards, should be included as well, though perhaps using a larger context might mitigate the need for coding standards if the generated code followed the existing code (which often doesn't happen).
bb88•1y ago
I think that's what's going to happen over time. We're going to be writing more and more code, but supervising an AI mostly.

The big problem is that we're treating the AI as an all knowing oracle. And probably what we should be doing is treating the AI as a colleague -- allowing it to ask questions about the code base to develop the subtle clues.

Often what happens is that subtlety is lost on the code base, and sometimes, AI will think it's an outright error, when in fact it's completely on purpose.

Comments go a long way towards this end, but in large legacy codebases, comments may not exist, and the coders expected people to understand at first glance the code was correct.

Test driving Junie, I've had it remove a feature it thought was broken code, and then it fixes the unit tests, instead of trying to understand if the unit test was actually broken or the feature was broken.

insin•1y ago
LLM for Large Codebases

Teleport-env – <500ms stateful rollbacks for AI agents via CRIU

https://github.com/JaiCode08/teleport-env
1•Jainish08•5m ago•0 comments

Unlocking Liquidity on Prediction Markets [pdf]

https://docs.lattica.finance/unlocking-liquidity.pdf
1•stephenflanders•6m ago•0 comments

Love Language – The undying dream of Esperanto

https://harpers.org/archive/2026/06/love-language-katie-thornton-esperanto/
1•pseudolus•7m ago•0 comments

Show HN: What 1k Harness Experiments Taught Me About Self-Improving Agents

https://www.henrypan.com/blog/2026-05-25-self-improvement-harness/
1•megadragon9•7m ago•0 comments

Meta to start testing AI subscription services

https://www.cnbc.com/2026/05/27/meta-testing-ai-subscription-services-cheapest-plan-at-7point99-a...
1•mattas•8m ago•0 comments

Google engineer charged with insider trading after making $1.2M on Polymarket

https://techcrunch.com/2026/05/27/google-engineer-charged-with-insider-trading-after-making-1-2m-...
3•evo_9•11m ago•0 comments

Bubbles – an HN-like link aggregator for the non-tech internet

https://bubbles.town/
1•Curiositry•12m ago•0 comments

Planescape: Torment, Part 1: From the Tabletop

https://www.filfre.net/2026/05/planescape-torment-part-1-from-the-tabletop/
1•doppp•14m ago•0 comments

Zig 2026: No-AI Policy, $670K Foundation, Left GitHub and Why Zig Isn't 1.0 [video]

https://www.youtube.com/watch?v=iqddnwKF8HQ
2•doppp•14m ago•0 comments

Dario Amodei warned of an AI white-collar bloodbath, now he's changing narrative

https://fortune.com/2026/05/05/dario-amodei-jevons-paradox-will-ai-wipe-out-white-collar-jobs/
2•gmays•15m ago•0 comments

Microsoft's New Governance Toolkit MCP Extensions

https://medium.com/c-sharp-programming/securing-your-net-f2020c72027e
1•sukhpinder0804•15m ago•0 comments

Do they know we can tell it's AI slop?

5•jwsteigerwalt•16m ago•2 comments

Connectix RAM Doubler

https://computeradsfromthepast.substack.com/p/connectix-ram-doubler
1•myth_drannon•19m ago•0 comments

Building complex functions out of real parts

https://www.johndcook.com/blog/2026/05/22/complex-functions-real-parts/
1•ibobev•22m ago•0 comments

Real and Imaginary Parts

https://www.johndcook.com/blog/2026/05/23/real-and-imaginary-parts/
1•ibobev•22m ago•0 comments

Fitting the parameters of a Besace curve like the Meta logo

https://www.johndcook.com/blog/2026/05/27/the-meta-logo-and-fitting-besace-curves/
1•ibobev•22m ago•0 comments

Tabby's Star (KIC 8462852)

https://en.wikipedia.org/wiki/Tabby%27s_Star
1•Jimmc414•24m ago•0 comments

Language Modeling Materializes a World Model of Protein Biology [pdf]

https://biohub.ai/papers/esm_protein.pdf
1•y1zhou•27m ago•0 comments

Modos's open-hardware 13.3″ color e-paper monitor goes live on Crowd Supply

https://www.crowdsupply.com/modos-tech/modos-flow
2•Curiositry•29m ago•0 comments

Justice Dept. Is Said to Open Criminal Inquiry of E. Jean Carroll

https://www.nytimes.com/2026/05/27/us/politics/criminal-inquiry-e-jean-carroll-trump-accusations....
2•JumpCrisscross•31m ago•0 comments

Satire with Permits: A Brushed-Metal Pole Targeting Epstein Headed to Wisconsin

https://easternherald.com/2026/05/28/consentivus-pole-epstein-files-capitol-chaz-stevens-first-am...
1•ChazStevens•32m ago•0 comments

Ghost CMS flaw abused to push ClickFix attacks on sites

https://securityaffairs.com/192655/cyber-crime/ghost-cms-flaw-abused-to-push-clickfix-attacks-on-...
1•mooreds•33m ago•0 comments

Show HN: Verbum Vitae – Bible memorization [pt]

https://vvitae.com
1•barddoo•34m ago•0 comments

With coding agents, specs feel more like source code

https://blog.yushi91.com/blog/spec-is-the-new-source-code/
1•OldDod•39m ago•0 comments

Workshop: Sandboxed development environments on Ubuntu with a single command

https://discourse.ubuntu.com/t/introducing-workshop-launch-sandboxed-development-environments-on-...
2•zikani_03•43m ago•0 comments

Finding Miscompiles for Fun, Not Profit

https://newsletter.semianalysis.com/p/finding-miscompiles-for-fun-not-profit
1•tmoertel•44m ago•0 comments

Illinois Lawmakers Just Passed America's Strongest AI Safety Bill

https://www.wired.com/story/illinois-pass-major-ai-safety-law-pritzker/
4•droidjj•49m ago•5 comments

AI Cheats [pdf]

https://metr.org/risk-report-feb-mar-2026.pdf
1•brian_herman•1h ago•0 comments

One Job That Is Growing in the A.I. Era? Cybersecurity Experts

https://www.nytimes.com/2026/05/24/technology/ai-cybersecurity-jobs.html
1•mooreds•1h ago•0 comments

Two Researchers Are Rebuilding Mathematics from the Ground Up

https://www.quantamagazine.org/two-researchers-are-rebuilding-mathematics-from-the-ground-up-2026...
1•pseudolus•1h ago•0 comments