frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•9mo ago

Comments

kate_at_refact•9mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Evaluations for Testing Agentic AI

1•stichers•11s ago•0 comments

Amazon: "We are also forming a new engineering team in India "

https://twitter.com/PlumbNick/status/2017239458677231736
1•sergiotapia•1m ago•0 comments

Standalone Android utility apps and a VS Code companion I built

2•kalinuxer•3m ago•0 comments

Ask HN: Is free identity theft protection after a data breach worth the bother?

1•daoboy•3m ago•0 comments

Preserving Human Voices and Faces

https://www.vatican.va/content/leo-xiv/en/messages/communications/documents/20260124-messaggio-co...
1•swannodette•3m ago•0 comments

HumanConsumption.Live – Real-Time Global Animal Consumption Stats

https://www.humanconsumption.live/
1•speckx•5m ago•0 comments

Show HN: Hud – eBPF blocking detector for Tokio

https://cong-or.xyz/blocking-async-rust
1•cong-or•5m ago•1 comments

Why ChatGPT Can't Draw a Full Glass of Wine [video]

https://www.youtube.com/watch?v=160F8F8mXlo
1•ustad•6m ago•0 comments

AlphaGenome: AI for better understanding the genome

https://deepmind.google/blog/alphagenome-ai-for-better-understanding-the-genome/
1•Anon84•7m ago•0 comments

Microsoft Loses $440B in One of Tech's Largest Single-Day Drops

https://www.ghacks.net/2026/01/30/microsoft-loses-440-billion-in-one-of-techs-largest-single-day-...
3•speckx•9m ago•0 comments

AI found 12 of 12 OpenSSL zero-days

https://www.lesswrong.com/posts/7aJwgbMEiKq5egQbd/ai-found-12-of-12-openssl-zero-days-while-curl-...
1•jelsisi•9m ago•0 comments

Electric Fields Can Assist Prebiotic Reactivity on Hydrogen Cyanide Surfaces

https://pubs.acs.org/doi/10.1021/acscentsci.5c01497
1•PaulHoule•9m ago•0 comments

Show HN: CronPulse Community – Self-hosted job monitoring with alerts

https://github.com/techfort/cronpulse-community
1•joeminichino•10m ago•0 comments

Oracle seeks to build bridges with MySQL developers

https://www.theregister.com/2026/01/30/oracle_mysql/
1•mikece•11m ago•0 comments

Shitposting to Label Printers: Building an AirPrint Bridge for Cups

https://blog.slowest.network/post/5
2•indrora•12m ago•0 comments

Building vertical microfront ends on Cloudflare's platform

https://blog.cloudflare.com/vertical-microfrontends/
1•mikece•12m ago•0 comments

France Just Created Its Own Open Source Alternative to Microsoft Teams and Zoom

https://itsfoss.com/news/france-ditches-microsoft-teams-and-zoom/
7•mikece•13m ago•1 comments

Hive: Outcome driven agent development framework that evolves

https://github.com/adenhq/hive
1•simonpure•14m ago•0 comments

Show HN: AI Agent Architecture Patterns for Production Systems

https://github.com/devwithmohit/ai-agent-architecture-patterns
1•mohitdevops•15m ago•0 comments

Still In A Dream – my new book, out in June

http://blissout.blogspot.com/2026/01/still-in-dream-my-new-book-out-in-june.html
1•evo_9•15m ago•0 comments

Prime Radiant

https://dsehnal.github.io/prime-radiant/
1•dsehnal•16m ago•0 comments

How can I retain access to the data in a SAFEARRAY after my method returns?

https://devblogs.microsoft.com/oldnewthing/20260129-00/?p=112023
1•ibobev•18m ago•0 comments

Why Singapore and Estonia's EdTech Works, but America's Doesn't?

https://www.governance.fyi/p/why-singapore-and-estonias-edtech
3•guardianbob•19m ago•0 comments

Native lakehouse experience in Postgres powered by DuckDB and Ducklake

https://pgducklake.select
1•kakoni•21m ago•0 comments

How to Add a Quick Interactive Map to Your Website

https://blog.miguelgrinberg.com/post/how-to-add-a-quick-interactive-map-to-your-website
2•ibobev•21m ago•0 comments

Show HN: Cow, an humble AI for your terminal

https://github.com/jolexxa/cow
1•jolexxa•22m ago•0 comments

How We Exploited Qodo: From a PR Comment to RCE and AWS Admin Key – Leaked Twice

https://kudelskisecurity.com/research/qodo-dynaconf-aws-admin-key-leaked-twice
1•spiridow•23m ago•0 comments

The Vitalists: hardcore longevity enthusiasts who believe death is wrong

https://www.technologyreview.com/2026/01/29/1131815/vitalism-longevity-enthusiasts-influence/
1•rbanffy•24m ago•0 comments

Zo Computer

https://www.zo.computer/
2•erhuve•25m ago•0 comments

Low-power integrated optical amplification through second-harmonic resonance

https://www.nature.com/articles/s41586-025-09959-z
2•westurner•25m ago•0 comments