frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Building Pi with Pi

https://lucumr.pocoo.org/2026/5/24/pi-oss/
1•mplanchard•1m ago•0 comments

Life with locked-in syndrome: 'Despite everything, you are alive'

https://www.thetimes.com/uk/healthcare/article/life-with-locked-in-syndrome-despite-eveything-you...
1•bookofjoe•1m ago•1 comments

Does the human still make the decision?

https://benjosaur.substack.com/p/does-the-human-still-make-the-decision
1•benjosaur•2m ago•0 comments

Don't know where your data is from? Bayesian modeling for unknown coordinates

https://christopherkrapu.com/blog/2026/dont-know-where-your-data-is-from/
1•ckrapu•5m ago•0 comments

Preventing AI agents from executing destructive terminal commands

https://github.com/7Majesty-M/terminal-guardian-mcp
1•majesty-m•8m ago•0 comments

Enhanced Games CEO insists 'Doping Olympics' is safer than traditional sport

https://www.dailymail.com/sport/othersports/article-15844011/enhanced-games-maximilian-martin-dop...
1•Bender•9m ago•0 comments

The Wizard with the Defensible Pond

https://worksonmymachine.ai/p/the-wizard-with-the-very-defensible
1•pkilgore•12m ago•0 comments

Advancing Mathematics Research with AI-Driven Formal Proof Search

https://arxiv.org/abs/2605.22763
1•tamnd•13m ago•0 comments

Agents Dont Want VMs

https://zachsmith.ai/blog/agents-dont-want-vms/
3•zachdev1•18m ago•0 comments

The Energy Transition Is Happrning Faster Than You Think

https://www.youtube.com/watch?v=HgBTARXEfxU
1•InitialBP•19m ago•1 comments

Topo Designs Rover Trail Pack Is the Best Backpack I've Ever Used

https://www.wired.com/story/topo-designs-rover-trail-pack/
1•joozio•21m ago•0 comments

A Dual-Node Home NAS Cluster for $210

https://laser-coder.net/articles/home-nas/index.html
1•lasercoder•23m ago•1 comments

Hitler as a Unit of Measurement

https://hermitome.wordpress.com/2013/08/30/hitler-as-a-unit-of-measurement/
3•lisper•25m ago•0 comments

When Canada's Metric Switch Left a New Boeing 767 with Half the Fuel It Needed

https://viewfromthewing.com/both-engines-died-at-41000-feet-canadas-metric-switch-left-a-brand-ne...
1•crescit_eundo•25m ago•1 comments

Active supply chain attack across NPM, PyPI, and Crates. io

https://twitter.com/socketsecurity/status/2058565153138844043
1•rob•29m ago•0 comments

An Introduction to Objectivist-C

https://fdiv.net/2012/04/01/introduction-objectivist-c
1•pcfwik•31m ago•0 comments

New Rocket League is using Unreal Engine 6

https://twitter.com/QNDZYcom/status/2058586983656726640
5•astlouis44•35m ago•2 comments

Can't they get the small science stuff right?

https://www.theguardian.com/commentisfree/2026/may/24/hill-i-will-die-on-hollywood-blockbusters-s...
4•pjbk•37m ago•0 comments

What Gets Kept

https://www.newyorker.com/culture/the-weekend-essay/what-jack-kerouac-left-behind
2•lermontov•37m ago•0 comments

Oss.zone, a Pubnix Running on NixOS

https://oss.zone
1•f1nniboy•38m ago•1 comments

Apple Preparing New 'Gen AI' Website Ahead of WWDC

https://www.macrumors.com/2026/05/23/apple-gen-ai-subdomain/
3•Brajeshwar•39m ago•0 comments

FieldStation42 – Turn your computer into a vintage TV cable system

https://fieldstation42.com/
2•indigodaddy•43m ago•0 comments

Beating C with Dyalog APL: wc

https://ummaycoc.github.io/wc.apl/
1•tosh•44m ago•0 comments

Dynamical Governed Topology Engine with 3D Viewer

https://pub-6f95fcb400b34d7290952c160671872b.r2.dev/Screenshot%202026-05-24%20093053.png
1•verhash•45m ago•1 comments

FreeBSD Foundation Executive Director Tries Daily Driving FreeBSD on Laptop

https://www.phoronix.com/news/FreeBSD-On-Laptop-Driver
16•Bender•47m ago•10 comments

First ever Cray T3D Supercomputer goes up for auction with $81,000 reserve

https://www.tomshardware.com/tech-industry/supercomputers/first-ever-cray-t3d-supercomputer-goes-...
3•LorenDB•48m ago•1 comments

AT&T sues to ditch Cali copper phone lines to save billions

https://www.theregister.com/networks/2026/05/22/att-sues-to-ditch-cali-copper-phone-lines-to-save...
1•Bender•48m ago•0 comments

HP investigating BIOS updates that leave premium laptop users in boot loop limbo

https://www.theregister.com/personal-tech/2026/05/24/hp-investigating-bios-updates-that-leave-pre...
3•Bender•48m ago•0 comments

Hello TinyTree

https://tinytree.dev/blog/hello-tinytree/
2•oftenwrong•50m ago•0 comments

Memory has grown to nearly two-thirds of AI chip component costs

https://epoch.ai/data-insights/ai-chip-component-cost-shares
22•intelkishan•52m ago•6 comments