frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

How AI Became More Expensive Than the Workers It Replaced [video]

https://www.youtube.com/watch?v=cfaZZPjA3g0
1•Bender•38s ago•0 comments

Linux DRM Scheduler Patches Yield Improvement for Job Submission Latency

https://www.phoronix.com/news/DRM-Scheduler-Lower-Job-Submit
1•Bender•3m ago•0 comments

Don't Hang Up on AI Scammers. Do This Instead [video]

https://www.youtube.com/watch?v=lk3jCuITwcE
1•wisemanwillhear•3m ago•0 comments

Show HN: Mise – A keyboard-driven Python/Qt6 browser built for fanless laptops

https://github.com/Rakosn1cek/Mise
1•Rakosn1cek•7m ago•0 comments

Exclusive-Meta's Zuckerberg says AI agent tech progressing slower than expected

https://finance.yahoo.com/technology/ai/articles/exclusive-zuckerberg-says-ai-agent-201123441.html
1•_____k•7m ago•0 comments

Show HN: Sieze the means of production from our agentic overlords

https://github.com/Xophmeister/wean
2•Xophmeister•9m ago•0 comments

Show HN: I built an encrypted BLE dongle for pasting stuff to air-gapped devices

https://github.com/Brisk4t/ToothPaste
2•Brisk4t•10m ago•1 comments

Operation Ivy Bells

https://en.wikipedia.org/wiki/Operation_Ivy_Bells
2•m-hodges•12m ago•0 comments

Visualize how many files in a codebase you contributed

https://app.principal-ade.com/anomalyco/opencode
1•fernando-ram•15m ago•0 comments

Early Web Links

https://earlyweblinks.com/
1•bookofjoe•21m ago•0 comments

Arroup – record screen, edit, share with link

https://www.arroup.com/
1•vladsmigelski•24m ago•1 comments

How to build a full body ultrasound [video]

https://www.youtube.com/watch?v=4nzzpUKhj1M
1•Element_•24m ago•0 comments

$85,000 in tokens later: What I learned from scaling agentic coding at Lovable

https://lovable.dev/blog/85000-in-tokens-later-scaling-agentic-coding-at-lovable
2•aliclark•24m ago•1 comments

A Peculiarly Dutch Summer Rite: Children Let Loose in the Night Woods (2019)

https://www.nytimes.com/2019/07/21/world/europe/netherlands-dropping-children.html
1•edward•25m ago•0 comments

One Month of Ecosystem Security Engineering

https://thephp.foundation/blog/2026/06/23/one-month-of-ecosystem-security-engineering/
1•campuscodi•26m ago•0 comments

Show HN: Using Wake-on-LAN for an AI Project

https://guilhermefrj.medium.com/i-built-a-local-chatgpt-killer-on-a-single-rtx-5080-heres-everyth...
1•guilhermef•27m ago•0 comments

The Unique Universe (2009)

https://physicsworld.com/a/the-unique-universe/
3•mellosouls•32m ago•0 comments

GTA 2 ported to JavaScript, with WebRTC P2P multiplayer

https://gta2js.vercel.app/
3•possiblelion•35m ago•1 comments

Babel, a construction that builds and unbuilds itself forever

https://sand-morph.up.railway.app/babel
1•echohive42•35m ago•0 comments

Ask HN: New employeer not providing equipment

7•gl9•41m ago•14 comments

Protocol Prying: Vulnerability Research in AirDrop and Quick Share

https://arxiv.org/abs/2606.26967
2•logickkk1•45m ago•0 comments

Providence AI

https://providenceai.app
1•j_anderssen•46m ago•0 comments

Rare things become common at scale (2014)

https://longform.asmartbear.com/scale-rare/
5•Tomte•46m ago•0 comments

Show HN: Grade your code's post-quantum crypto exposure A–F, free, in-browser

https://throndar.ai/cbom
1•algo26•46m ago•0 comments

Bitpoint: Turn a directory of Python files into HTTP endpoints

https://github.com/tanrax/bitpoint
1•andros•47m ago•0 comments

Build some capacity to be alone

https://felipevanbeetz.substack.com/p/build-some-capacity-to-be-alone
1•felipevb•50m ago•0 comments

Trees are mostly made of air and a generalizable lesson for AI safety

https://www.lesswrong.com/posts/xiTBpBDwubnr4MLRe/trees-are-mostly-made-of-air-and-a-generalizabl...
2•paulpauper•52m ago•0 comments

A Twist in This Year's Strangest Literary AI Scandal

https://www.theatlantic.com/technology/2026/07/commonwealth-prize-ai-writing-jamir-nazir/687806/
2•paulpauper•52m ago•0 comments

What I Learned from ROTC

https://www.theatlantic.com/ideas/2026/07/army-military-service-training-lessons/687589/
2•paulpauper•53m ago•0 comments

The Medici effect: multidisciplinary insights for entrepreneurship research

https://www.emerald.com/neje/article/27/1/2/1231319
1•bookofjoe•56m ago•0 comments