frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•9mo ago

Comments

kate_at_refact•9mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Ask HN: Why were green and amber CRTs more comfortable to read?

1•CalvinBuild•2m ago•0 comments

Show HN: MCP Codebase Index – 87% fewer tokens when AI navigates your codebase

https://github.com/MikeRecognex/mcp-codebase-index
1•localforthewin•4m ago•1 comments

How to Red Team Your AI Agent in 48 Hours – A Practical Methodology

1•manuelnd•6m ago•0 comments

I wasted 80 hours and $800 setting up OpenClaw – so you don't have to

https://twitter.com/jordymaui/status/2023421221744877903
1•MrBuddyCasino•7m ago•0 comments

Europeans are dangerously reliant on US tech Now is a good time to build our own

https://www.theguardian.com/commentisfree/2026/feb/17/europeans-are-dangerously-reliant-on-us-tec...
2•beardyw•9m ago•0 comments

Ask HN: How Reliable Is Btrfs?

1•pregnenolone•14m ago•0 comments

Who Killed Kerouac

https://whokilledkerouac.com/mission
1•xavaki•15m ago•0 comments

Show HN: MCP Storage Map – One MCP Server for MySQL, MongoDB, and Athena

https://github.com/cyhoon/mcp-storage-map
1•jeffchoi•21m ago•0 comments

WD and Seagate confirm: Hard drives sold out for 2026

https://www.heise.de/en/news/WD-and-Seagate-confirm-Hard-drives-for-2026-sold-out-11178917.html
18•layer8•21m ago•0 comments

The Creator of OpenCode Thinks You're Fooling Yourself About AI Productivity

https://blog.codacy.com/the-creator-of-opencode-thinks-youre-fooling-yourself-about-ai-productivity
1•thunderbong•23m ago•0 comments

Elon Musk on Space GPUs, AI, Optimus, and His Manufacturing Method

https://cheekypint.substack.com/p/elon-musk-on-space-gpus-ai-optimus
1•JeanKage•24m ago•0 comments

Show HN: Llmfit;94 models, 30 providers.1 tool to see what runs on your hardware

https://github.com/AlexsJones/llmfit
1•axjns•25m ago•0 comments

Margins Aren't Just Numbers

https://news.ycombinator.com/submit
1•usus87•25m ago•2 comments

Zero Knowledge (About) Encryption: Security Analysis of Password Managers

https://zkae.io/
2•mweibel•29m ago•0 comments

Japan Is What Late-Stage Capitalist Decline Looks Like

https://oceandrops.substack.com/p/japan-is-what-late-stage-capitalist
3•olabolola•33m ago•0 comments

Product Management is all about people, not technology

https://www.leadinginproduct.com/p/product-management-is-a-people-role
3•benkan•34m ago•2 comments

Precious Computer Age relic, Unix v4, turns up in Univ. of Utah storage room

https://attheu.utah.edu/science-technology/precious-computer-age-relic-turns-up-in-u-storage-room/
1•gurjeet•34m ago•0 comments

Communal Living Is the Ultimate Parenting Hack

https://www.nytimes.com/2026/02/16/opinion/housing-communal-parenting-friends.html
3•yshunnar•37m ago•0 comments

Fast python project template with uv, ruff, ty and more

https://github.com/ritwiktiwari/copier-astral
1•ritwiktiwari•38m ago•1 comments

EU Parliament blocks AI tools over cyber, privacy fears

https://www.politico.eu/article/eu-parliament-blocks-ai-features-over-cyber-privacy-fears/
2•robtherobber•41m ago•0 comments

Fast Earth-to-Mars Travel: The Antimatter Harvesting Solution

https://github.com/julienreszka/julienreszka/blob/main/notes/Achieve-fast-Earth-to-Mars-travel.md
1•julienreszka•41m ago•0 comments

Show HN: Electronic Music Genre Set Theory

https://tjf.lol/genre/
2•thomasfuller•41m ago•1 comments

Proton and NordVPN blocked in Spain during soccer matches

https://bandaancha.eu/articulos/laliga-telefonica-escalan-bloqueos-11668
2•EbNar•42m ago•1 comments

Show HN: I built an AI trainer and calorie scanner

https://fitversehub.com/download
1•mihaibundea•43m ago•0 comments

Denonomicon: The Dark Arts of Deno Foreign Function Interface Programming

https://denonomicon.deno.dev/introduction
1•enz•43m ago•0 comments

Is Dark Energy Evolving?

https://www.universetoday.com/articles/is-dark-energy-actually-evolving
2•rbanffy•47m ago•0 comments

UK PM: "No platform gets a free pass"

https://www.gov.uk/government/news/pm-no-platform-gets-a-free-pass-government-takes-action-to-kee...
2•kaelyx•51m ago•1 comments

Thinking Machines Lab Will Hire Me. They Just Don't Know It Yet.

https://medium.com/@redjonzaci/thinking-machines-lab-will-hire-me-they-just-dont-know-it-yet-0a59...
1•redjonzaci•56m ago•3 comments

Show HN: ACDC – A non-agentic AI coding tool with L0-L3 context cache tiering

https://github.com/flatmax/AI-Coder-DeCoder
1•flatmax•56m ago•2 comments

Declarative, Inquisitive, then Imperative (2017) [pdf]

https://www.forth.org/svfig/kk/11-2017-Falvo.pdf
1•tosh•57m ago•0 comments