frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Since Linux 6.9, LUKS suspend stopped wiping disk-encryption keys from memory

https://mathstodon.xyz/@iblech/116769502749142438
1•IngoBlechschmid•37s ago•0 comments

Falling fertility on the left as key driver of US birth decline

https://www.nature.com/articles/s41598-026-57582-3
1•geox•1m ago•0 comments

CISA: Microsoft SharePoint RCE flaw now actively exploited

https://www.bleepingcomputer.com/news/security/cisa-microsoft-sharepoint-rce-flaw-now-actively-ex...
1•Brajeshwar•1m ago•0 comments

Cloudflare Workers Build Delays

https://www.cloudflarestatus.com/incidents/2z7ff0166l22
1•infomiho•1m ago•0 comments

Tsunami: The Rescued Dog Saving Lives After Venezuela Earthquakes

https://www.forbes.com/sites/luisromero/2026/07/01/tsunami-the-rescued-dog-saving-lives-after-ven...
1•throwme_123•1m ago•0 comments

Vibe Coded X11 Server Written in Rust Adds Xinerama, FreeBSD Support

https://www.phoronix.com/news/YSERVER-1.3-Released
1•hochmartinez•2m ago•0 comments

Blizzard coped with World of Warcraft's blood plague and other disasters

https://www.pcgamer.com/how-blizzard-coped-with-world-of-warcrafts-blood-plague-and-other-early-d...
1•colinprince•5m ago•0 comments

Show HN: NamedLocal – does ChatGPT recommend your local business?

https://namedlocal.com/
1•only_jeff•5m ago•0 comments

Show HN: Design Slop Cop – score how AI-generated a website looks

https://slopcop.adriankrebs.ch
1•hubraumhugo•6m ago•0 comments

Google loses EU court fight over $4.7B Android antitrust fine

https://www.aa.com.tr/en/economy/google-loses-eu-court-fight-over-47b-android-antitrust-fine/3984015
2•sharpshadow•7m ago•0 comments

I made a macOS desktop app to manage Nix-Darwin and Home Manager

https://github.com/darkmatter/nixmac
2•czxtm•7m ago•1 comments

DASH diet associated with lower risk of cognitive decline

https://jamanetwork.com/journals/jamaneurology/article-abstract/2845466
1•rawgabbit•8m ago•0 comments

We Need a 'Truth Campaign' for the AI Era

https://www.techpolicy.press/why-we-need-a-truth-campaign-for-the-ai-era/
1•cdrnsf•9m ago•0 comments

Generative AI creates delicious, sustainable, and nutritious burgers

https://arxiv.org/abs/2602.03092
1•sebg•9m ago•0 comments

Show HN: Debriefr – Discussions that unlock as you watch

https://debriefr.tv
1•jhedlund•10m ago•0 comments

The Importance of Humility in Software Development (2020)

http://humbletoolsmith.com/2020/08/10/the-importance-of-humility-in-software-development/
1•downbad_•13m ago•0 comments

Launch HN: Manufact (YC S25) – MCP Cloud

https://manufact.com
9•pzullo•14m ago•0 comments

Show HN: OSS Tests to Fix AI Gen Code. 110 Test for Major API – Supabase, Auth0

https://github.com/qualtyco/api-doctor
1•Reuben_Santoso•14m ago•0 comments

US employers still reluctant to add many jobs as hiring slows in June

https://apnews.com/article/jobs-economy-hiring-labor-49c7a993b394e6ae3f801c8e3c0d39dd
1•mattas•16m ago•0 comments

AI Increased Our Open PRs by 36%. That Wasn't the Whole Story

https://www.stackbuilders.com/insights/ai-in-software-delivery-whats-working-whats-hard-and-what-...
1•StackBuilders•16m ago•0 comments

Show HN: UATC – A Closed-Loop Controller to Prevent GPU OOM

https://github.com/sajjaddoda72-design/UATC
1•L_u_u_6•16m ago•0 comments

Evolution of moral expression in song lyrics

https://www.nature.com/articles/s41598-026-53778-9
1•sebg•17m ago•0 comments

I Tried Rips, the Card App Where Users Spend Thousands Chasing Pricey Pokémon

https://www.wired.com/story/i-tried-rips-the-card-pack-app-where-users-spend-thousands-chasing-pr...
1•reece_rogers•17m ago•0 comments

Reverse-engineering the Whoop 5.0 to work without a subscription in 24 hours

https://twitter.com/b_nnett/status/2061434000766382246
2•MrBuddyCasino•18m ago•0 comments

US deaths hit record low in 2025

https://san.com/cc/living-longer-in-america-u-s-deaths-hit-record-low-in-2025/
2•Cabal•20m ago•1 comments

The competing federal efforts behind America's 250th anniversary plans

https://www.govexec.com/management/2026/07/inside-competing-federal-efforts-behind-americas-250th...
1•everybodyknows•21m ago•0 comments

The last mile of AI-assisted coding is a signup form

https://trustysquire.ai/blog/the-last-mile-is-a-signup-form
1•lunchboxfortwo•22m ago•0 comments

Spain Orders Blacklist of Palantir from Public and Private Companies

https://clashreport.com/world/articles/spain-orders-blacklist-of-us-tech-giant-palantir-from-publ...
3•mgh2•23m ago•0 comments

T-Mobile moving virtual machines off VMware amid lawsuit

https://arstechnica.com/information-technology/2026/07/t-mobile-moving-tens-of-thousands-of-virtu...
1•Brajeshwar•24m ago•0 comments

Show HN: Oculor: find warm intro paths to investors

https://oculor.ai
1•fredrussias•24m ago•0 comments