frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•10mo ago

Comments

kate_at_refact•10mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Iran war shows Green Deal 'fundamental' to EU security

https://www.politico.eu/article/eu-green-deal-energy-security-iran-war/
1•vrganj•4m ago•0 comments

PEP 827 – Type Manipulation

https://peps.python.org/pep-0827/
1•arusahni•5m ago•0 comments

Volkswagen to cut 50k jobs as China offers cheaper electric cars

https://www.telegraph.co.uk/business/2026/03/10/volkswagen-to-cut-50000-jobs-after-failed-bet-on-...
2•emzo•6m ago•0 comments

Show HN: Iran War Clock

https://www.iranwarclock.com/
1•martialg•6m ago•0 comments

Cold DMs don't work anymore. Here's what got me my first users

1•deep1283•7m ago•0 comments

AI C-Suite – Chat with a fictional leadership team (1-on-1 or group chat)

https://99helpers.com/tools/csuite-advisor
1•nickk81•10m ago•1 comments

Most B2B Lead Generation Tools Create Contacts, Not Customers

1•lakshmirk•11m ago•0 comments

Show HN: A mission-based game to help students apply math in real life

https://www.owsterlabs.com/module/eagle-in-the-sky/
1•firepegasus11•11m ago•0 comments

Who Killed German Nuclear?

https://zionlights.substack.com/p/who-really-killed-german-nuclear
1•mpweiher•11m ago•1 comments

Removing recursion via explicit callstack simulation

https://jnkr.tech/blog/removing-recursion
1•gsky•13m ago•0 comments

Amygdala Research: Prompt topic, get footnoted report from experts in seconds

https://amygdala.eu/research
1•JoranCornelisse•14m ago•0 comments

Gemini Exporter – a Chrome extension to export Gemini chats

1•nongquy•18m ago•0 comments

Understanding React Native's new architecture

https://www.z1.digital/blog/react-native-s-new-architecture-a-paradigm-shift
1•ClarisaGuerra•19m ago•0 comments

IronPE – Minimal Windows PE manual loader written in Rust

https://github.com/iss4cf0ng/IronPE
1•iss4cf0ng•19m ago•0 comments

Nasdaq partners with Kraken to distribute tokenized stocks globally

https://www.coindesk.com/business/2026/03/09/nasdaq-and-kraken-are-teaming-up-to-let-you-trade-to...
1•giuliomagnifico•21m ago•0 comments

Show HN: .ispec – because documentation always lies and I'm trying to fix that

https://github.com/johnfire/ispec
1•alby-durer•22m ago•0 comments

We now know why some people had blood clots after Covid shots

https://www.thehindu.com/sci-tech/science/we-now-know-why-some-people-had-severe-blood-clots-afte...
1•thisislife2•23m ago•0 comments

Mnemos,persistent memory for AI agents

https://github.com/mem9-ai/mem9
2•mountainview•25m ago•0 comments

I put my whole life into a single database

https://howisfelix.today/
2•lukakopajtic•26m ago•0 comments

Lambda Calculus Explorer

http://kmicinski.com/cis352-s26/lambda-playground/
1•todsacerdoti•26m ago•0 comments

Ask HN: What AI content automation stack are you using in 2026?

2•jackcofounder•28m ago•1 comments

Bash is all you need. A nano Claude Code–like agent, built from 0 to 1

https://github.com/shareAI-lab/learn-claude-code
1•Oras•28m ago•0 comments

Hardware passkeys are winning on security, losing on adoption

https://www.corbado.com/blog/hardware-passkey-adoption-observability
1•vdelitz•29m ago•0 comments

Too Much Color

https://www.keithcirkel.co.uk/too-much-color/
5•Keithamus•29m ago•0 comments

CPG – Generate Cilium network policies from dropped Hubble flows

1•soulkyu•31m ago•0 comments

What's my JND? – a colour guessing game

https://www.keithcirkel.co.uk/whats-my-jnd/?r=ARUjKP__-ve-
3•Keithamus•32m ago•2 comments

I think I'm turning into a vibe coder

2•bekauridev•34m ago•2 comments

Measuring the Weight of an Electron (2017)

https://deftly.net/posts/2017-06-01-measuring-the-weight-of-an-electron.html
2•asimovDev•35m ago•0 comments

I made myself a device that tells me what plane flies above my home

https://old.reddit.com/r/aviation/comments/1roy7qs/i_made_myself_a_device_that_tells_me_what_plane/
1•taubek•36m ago•0 comments

Working to Decentralize FedCM

https://atproto.com/blog/working-to-decentralize-fedcm
1•erlend_sh•39m ago•0 comments