frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Microsoft is upgrading Windows 11 touchpad with four new gestures

https://www.windowslatest.com/2026/05/09/microsoft-is-upgrading-windows-11-trackpad-with-automati...
1•thunderbong•1m ago•0 comments

Apple Private Relay and network parental control

https://tech.chrishardie.com/2026/apple-private-relay-network-parental-control/
1•ChrisHardie•6m ago•0 comments

Who sets the Doomsday Clock – and what can they tell us about our future?

https://www.theguardian.com/science/2026/may/09/doomsday-clock-ai-iran-ukraine-war-climate-breakd...
1•mitchbob•6m ago•0 comments

Data-Oriented Demo: SOA, composition (2015) [video]

https://www.youtube.com/watch?v=ZHqFrNyLlpA
1•tosh•9m ago•0 comments

Reactions to "We Have Learned Nothing"

https://reactionwheel.net/2026/05/reactions-to-we-have-learned-nothing.html
1•jeffreyrogers•9m ago•0 comments

CPanel's Black Week: 3 New Vulnerabilities Patched After Attack on 44k Servers

https://www.copahost.com/blog/cpanels-black-week-three-new-vulnerabilities-patched-after-ransomwa...
1•ggallas•9m ago•0 comments

On Cloudflare

https://indiscretemusings.substack.com/p/on-cloudflare
1•gpi•11m ago•0 comments

Ask HN: How do you give estimates in the age of Agentic coding

2•nibbleyou•12m ago•0 comments

What you need to know about the cruise ship hantavirus outbreak

https://www.technologyreview.com/2026/05/08/1136988/heres-what-you-need-to-know-about-the-cruise-...
1•joozio•12m ago•0 comments

Gartner: AI layoffs don't create returns, they just create vacancies

https://www.theregister.com/ai-and-ml/2026/05/06/ai-layoffs-backfire-as-cutting-staff-doesnt-cut-...
2•feverzsj•12m ago•0 comments

Preparing for a 'Vulnerability Patch Wave'

https://www.ncsc.gov.uk/blogs/prepare-for-vulnerability-patch-wave
2•mooreds•13m ago•0 comments

From phones to humanoid robots: China's supply chain eyes next growth curve

https://www.scmp.com/tech/tech-trends/article/3352212/phones-robots-chinas-supply-chain-eyes-next...
1•mooreds•13m ago•0 comments

Taiwan's Plastic Habit Collides with Shortages Caused by a Faraway War

https://www.nytimes.com/2026/05/09/business/taiwan-plastic-bag-shortage.html
1•mooreds•13m ago•0 comments

Ask HN: What would you include if you could make your own phone?

1•vednig•13m ago•1 comments

The Sights of the Venice Biennale

https://www.nytimes.com/2026/05/05/arts/venice-biennale-photos-video.html
1•danecjensen•14m ago•0 comments

I'm 16 and I built FutuRole to fight the "humiliation ritual" of job hunting

https://futurole.com
2•whatdoyoumean02•17m ago•1 comments

Will We Ever Be Able to Forecast Volcanic Eruptions Like Weather?

https://www.quantamagazine.org/will-we-ever-be-able-to-forecast-volcanic-eruptions-like-weather-2...
1•Brajeshwar•22m ago•0 comments

A major watchdog claims that data centers are wreaking havoc on the power grid

https://www.businessinsider.com/nerc-issues-alert-on-data-centers-threatening-grid-stability-2026-5
1•01-_-•22m ago•0 comments

Nailing jelly to a wall: is it possible? (2005)

https://greem.co.uk/otherbits/jelly.html
1•microsoftedging•25m ago•0 comments

Has anyone else hit expert homogeneity collapse in small MoE models?

https://github.com/eriirfos-eng/ternary-intelligence-stack
1•rfi-irfos•25m ago•0 comments

A soccer simulator played by AI Agents

https://gangtao.github.io/AgentPitch/
2•gangtao•25m ago•0 comments

Disappearing Polymorph

https://en.wikipedia.org/wiki/Disappearing_polymorph
2•canjobear•31m ago•0 comments

Regression Towards the Mean

https://en.wikipedia.org/wiki/Regression_toward_the_mean
1•soupspaces•34m ago•0 comments

Pushing Local Models in Coding Agents with Focus and Polish

https://lucumr.pocoo.org/2026/5/8/local-models/
1•goranmoomin•34m ago•0 comments

Open-source experiment: collaborative AI cognition through wiki pages

https://mentisphere.wiki/wiki/Main_Page
2•franzvill•37m ago•0 comments

Hacking Time: Spoofing Atomic Clocks with Audio Harmonics

https://josephhall.org/blog/texture-of-time-wwvb/
1•jdblair•41m ago•0 comments

Anazoa WebRTC Tunnel

https://github.com/anazoa/anazoa
2•kawks•44m ago•0 comments

Pedestrian Killed by Frontier Airlines Plane Leaving Denver Airport

https://www.forbes.com/sites/antoniopequenoiv/2026/05/09/pedestrian-killed-by-frontier-airlines-p...
4•gpi•46m ago•2 comments

I Will Not Add Query Strings to Your URLs

https://susam.net/no-query-strings.html
2•susam•48m ago•0 comments

Darwinian – A self-evolving system optimizer written in Rust

https://github.com/skorotkiewicz/darwinian_cleaner
1•modinfo•51m ago•0 comments