frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: Mini-swe-agent achieves 65% on SWE-bench in 100 lines of python

https://github.com/SWE-agent/mini-swe-agent
5•lieret•18h ago

Comments

lieret•18h ago
In 2024, we developed SWE-bench and SWE-agent at Princeton University and helped kickstart the coding agent revolution.

Back then, LMs were optimized to be great at chatting, but not much else. This meant that agent scaffolds had to get very creative (and complicated) to make LMs perform useful work.

But in 2025, LMs are actively optimized for agentic coding, and we ask:

*What the simplest coding agent that could still score near SotA on the benchmarks?*

*Turns out, it just requires 100 lines of code!*

And this system still *resolves 65% of all GitHub issues in the SWE-bench verified benchmark* with Sonnet 4 (for comparison, when Anthropic launched Sonnet 4, they reported 70% with their own scaffold that was never made public).

Honestly, we're all pretty stunned ourselves—we've now spent more than a year developing SWE-agent, and would not have thought that such a small system could perform nearly as good.

I'll link to the project below (all open-source, of course). The hello world example is incredibly short & simple (and literally what gave us the 65%). But it is also meant as a serious command line tool + research project, so we provide a Claude-code style UI & some utilities on top of that.

We have some team members from Princeton/Stanford here today, let us know if you have any questions/feedback :)

Oras•17h ago
Is there an option to learn from mistakes? most coding agents I tried, including the Sonnet 4 based one will make same mistake again and again in a new chat.

It would be great to have the agent adding a memory (even locally) to avoid mistakes, checking for new versions of libraries, and write a list of tasks first before the execution (similar to Kiro and Trae SOLO).

Mini-Swe-Agent

https://github.com/SWE-agent/mini-swe-agent
2•handfuloflight•11m ago•0 comments

Small Wars Manual

https://en.wikipedia.org/wiki/Small_Wars_Manual
1•Michelangelo11•15m ago•0 comments

Nginx / Nginx Plus High Performance Cookbook (2021) [pdf]

https://www.f5.com/content/dam/f5/corp/global/pdf/ebooks/NGINX_Cookbook-final.pdf
3•superjose•18m ago•1 comments

Vite plugin to break Tailwind CSS classes

https://github.com/borela-tech/multiline-tailwindcss/tree/main/packages/vite-plugin-multiline-tailwindcss
1•borela•23m ago•0 comments

Locality-Sensitive Hashing

https://en.wikipedia.org/wiki/Locality-sensitive_hashing
3•Bluestein•31m ago•0 comments

Crackable Worlds

https://domofutu.substack.com/p/crackable-worlds
1•domofutu•31m ago•0 comments

Blending education and artificial intelligence technology

https://ikignosis.github.io/
1•joaompinto•32m ago•0 comments

UK condemns Hong Kong cash offer for help in arresting activists

https://www.bbc.com/news/articles/cdx069we39xo
3•testrun•34m ago•0 comments

Spotify exodus over arms industry link

https://www.theguardian.com/music/2025/jul/26/king-gizzard-and-the-lizard-wizard-join-spotify-exodus-over-arms-industry-link-ntwnfb
1•torrance•34m ago•0 comments

PostgreSQL streaming replication characteristics on UNLOGGED tables

https://ivdl.co.za/2024/11/04/what-happens-if-you-enable-logging-on-an-unlogged-postgresql-table-with-streaming-replication/
1•Ianvdl•36m ago•0 comments

Show HN: Show HN: YouTube Controls Fix – Restore the Player Layout

https://greasyfork.org/en/scripts/543679-youtube-repositions-the-volume-button
1•ArcticLangoor•36m ago•0 comments

The Steely Dan Dictionary: 30th June 2025 – 25th anniversary

https://steelydandictionary.com
1•tempodox•37m ago•0 comments

The Case for Open Source Investment in Europe's Digital Sovereignty Push

https://www.techpolicy.press/the-case-for-open-source-investment-in-europes-digital-sovereignty-push/
2•jruohonen•43m ago•0 comments

Canada First (1930)

https://time.com/archive/6745625/canada-canada-first/
1•thomassmith65•44m ago•0 comments

Automating Oral Argument

https://adamunikowsky.substack.com/p/automating-oral-argument
1•gone35•44m ago•0 comments

Information Security Protection for EV Charging Stations

https://sinoevse.com/information-security-protection-for-ev-charging-stations/
1•infotechme•47m ago•1 comments

Show HN: I'm trying to make it easier to run local LLMs directly in the browser

https://github.com/jakobhoeg/built-in-ai
1•jakobhoeg•47m ago•0 comments

Terence Tao: Applying Red Team / Blue Team Duality to AI Workflows

https://mathstodon.xyz/@tao/114915604830689046
2•bertman•49m ago•0 comments

Add AI coding assistant configuration to Linux kernel

https://lore.kernel.org/workflows/20250725175358.1989323-1-sashal@kernel.org/
1•watusername•49m ago•0 comments

Ambigrammia: Between Creation and Discovery (Hofstadter, 2025)

https://yalebooks.yale.edu/book/9780300275438/ambigrammia/
2•lorenzuru•51m ago•1 comments

Heredocs Can Make Your Bash Scripts Self-Documenting

https://holdtherobot.com/blog/heredocs-can-make-your-bash-scripts-self-documenting/
2•chmaynard•56m ago•0 comments

Neovide: GUI for Neovim with Cool Features

https://neovide.dev/features.html
2•AbuAssar•1h ago•0 comments

The Thermodynamics of Trading

https://signalsandthreads.com/the-thermodynamics-of-trading/
3•tosh•1h ago•1 comments

Next edit prediction in Neovim (magenta.nvim)

https://github.com/dlants/magenta.nvim/pull/162
1•anonymid•1h ago•2 comments

Show HN: Auto Favicon MCP Server

https://github.com/dh1011/auto-favicon-mcp
5•dh1011•1h ago•0 comments

When JavaScript Decided My Day Starts at 9AM

https://senhongo.com/blog/when-javaScript-decided-my-day-starts-at-9am
2•SenHeng•1h ago•0 comments

Kind of Confusing

https://aeon.co/essays/how-jazz-and-dolphins-can-help-explain-consciousness
1•jruohonen•1h ago•0 comments

Acompanhador+de+ExercíCIOs

https://blog.nilo.pro.br/posts/2024-11-10-acompanhador/
1•morrison27•1h ago•0 comments

First You Create the Work and Then the Work Creates You: Nietzsche's Life

https://ristonthomas.com/essays/works
2•paul_riston•1h ago•1 comments

I Hate, Therefore I Am

https://www.nytimes.com/2025/07/23/opinion/hate-identity-conflict.html
3•Michelangelo11•1h ago•0 comments