frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•4mo ago

Comments

kate_at_refact•4mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Building a Reliable Cloud Live Streaming Pipeline for Netflix

https://netflixtechblog.com/building-a-reliable-cloud-live-streaming-pipeline-for-netflix-8627c60...
1•flavioribeiro•1m ago•0 comments

Understanding the New Economics of Attention

https://www.economist.com/finance-and-economics/2025/09/11/can-you-make-it-to-the-end-of-this-column
2•pseudolus•2m ago•1 comments

Tuberculosis Defenses

https://www.science.org/content/blog-post/tuberculosis-defenses
2•etiam•3m ago•0 comments

How Opus and o3 saved me from permanent blindness

https://mmaaz.ca/writings/blindness.html
1•pr337h4m•4m ago•0 comments

Magical Systems Thinking

https://www.worksinprogress.news/p/magical-systems-thinking
1•komape•5m ago•0 comments

Tips and Shortcuts for Better Browsing

https://www.google.com/chrome/tips/
1•kamaraju•6m ago•0 comments

Boring Is Good

https://jenson.org/boring/
1•zdw•9m ago•0 comments

In Memory of Mat Travizano

https://maraoz.com/mat/
2•wslh•11m ago•0 comments

Simplaix – Agent-first project management and workflow automation

https://simplaix.com/
1•hanyuan_peng•12m ago•1 comments

I made an AI expense tracker because I don't like typing

https://apps.apple.com/us/app/xpendai-track-your-expenses/id6752033430
1•bruuuuuuuuh•13m ago•0 comments

turdus merula — iOS downgrade tool for A9-A10X devices

https://sep.lol/
2•Lammy•13m ago•0 comments

White House Plans Broad Crackdown on Liberal Groups

https://www.nytimes.com/2025/09/15/us/politics/jd-vance-charlie-kirk-show.html
3•hughw•14m ago•0 comments

Repairing sequential consistency in C/C++11 [pdf]

https://plv.mpi-sws.org/scfix/full.pdf
3•fanf2•18m ago•0 comments

Free Startup Ideas

https://www.minimumviablenl.com/
1•minimumviable•18m ago•0 comments

Robinhood plans to launch a startups fund open to all retail investors

https://techcrunch.com/2025/09/15/robinhood-plans-to-launch-a-startups-fund-open-to-all-retail-in...
2•jaredwiener•23m ago•0 comments

Europe Is a Terrified Child

https://davekeating.substack.com/p/europe-is-an-abused-child
6•ironyman•23m ago•0 comments

The Adventures of Reemo Green [video]

https://www.youtube.com/watch?v=5bYA2Rv2CQ8
1•tantalor•25m ago•0 comments

Field-Programmable Logic 2025 Best Paper Awards and FPL Community Award

https://2025.fpl.org/program/best-paper-awards/
2•gnabgib•26m ago•0 comments

Godot 4.5, making dreams accessible – Godot Engine

https://godotengine.org/releases/4.5/
5•makepanic•28m ago•1 comments

ChatPerson, our new RI (real intelligence) service

https://www.mcsweeneys.net/articles/introducing-chatperson
2•Geekette•29m ago•1 comments

What problems are worth solving?

6•KopyWasTaken•29m ago•0 comments

Rustlantis: Randomized Differential Testing of the Rust Compiler

https://plf.inf.ethz.ch/research/oopsla24-rustlantis.html
4•mooreds•31m ago•0 comments

Deaths are projected to exceed births in 2031

https://www.cbo.gov/publication/61390
8•johntfella•31m ago•0 comments

CLion Introduces Constexpr Debugger

https://blog.jetbrains.com/clion/2025/09/introducing-constexpr-debugger/
2•vitaut•32m ago•0 comments

Show HN: Blocks – Dream work apps and AI agents in minutes

https://blocks.diy
3•shelly_•33m ago•0 comments

Stategraph – Terraform without the state file bottleneck

https://stategraph.dev
1•lawnchair•34m ago•0 comments

Widespread Data Theft Targets Salesforce Instances via Salesloft Drift

https://cloud.google.com/blog/topics/threat-intelligence/data-theft-salesforce-instances-via-sale...
1•mooreds•34m ago•0 comments

Ghost Kitchens Are Dying. Here's the $15B Lesson Every Restaurateur Must Learn

https://davidrmann3.substack.com/p/ghost-kitchens-are-dying-heres-the
4•mooreds•35m ago•3 comments

The importance of sandboxing and access control in AI agents

https://gr1m0ire.xyz/articles/sandboxing_ai_agents
1•gemini-15•35m ago•0 comments

Elon Musk Promises Full Self-Driving "Next Year" [2014-2024]

https://www.youtube.com/watch?v=B4rdISpXigM
11•JumpinJack_Cash•38m ago•0 comments