frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Proxmox Virtual Environment 9.2 with Dynamic Load Balancer Released

https://www.proxmox.com/en/about/company-details/press-releases/proxmox-virtual-environment-9-2
1•speckx•3m ago•0 comments

Codex for Everything Exfiltrates Connected Data

https://www.promptarmor.com/resources/codex-for-everything-exfiltrates-connected-data
2•takira•5m ago•0 comments

Inside SpaceX's IPO Plan

https://www.ft.com/content/a59be3cf-eee2-4b10-9c86-b6e4dc0dbbdb
2•1vuio0pswjnm7•5m ago•0 comments

The fastest growing political party is Cockroach Janata Party [video]

https://www.youtube.com/watch?v=uuFmKx5K9tc
1•Guestmodinfo•5m ago•0 comments

Leetcode.nvim

https://github.com/sidntrivedi/leetcode.nvim
2•sidntrivedi•6m ago•1 comments

Agents Sometimes Catastrophize

https://futuresearch.ai/blog/agents-catastrophize/
5•ddp26•7m ago•0 comments

EPA Official Agrees to Review Data Center Water Impact (AOC Shows Dirty Water)

https://news.bloomberglaw.com/environment-and-energy/epa-to-investigate-meta-data-center-link-to-...
2•zzzeek•8m ago•1 comments

DashAttention: Differentiable and Adaptable Sparse Hierarchical Attention

https://arxiv.org/abs/2605.18753
3•cmogni1•10m ago•0 comments

Test-Driving the Lance Lakehouse Format in DuckDB

https://duckdb.org/2026/05/21/test-driving-lance
2•tanelpoder•12m ago•0 comments

S3-Compatible object storage at $15/TB with free egress and CDN

https://filebase.com/blog/introducing-filebase-object-storage-with-free-egress/
4•acejam•12m ago•0 comments

Temporal is becoming Crystal Palace Football Club's front-of-shirt partner

https://temporal.io/blog/crystal-palace-partnership
2•ldite•12m ago•0 comments

SpaceX is heavily reliant on Starlink for growth and profit for IPO

https://www.cnbc.com/2026/05/21/spacex-starlink-growth-profit-nasdaq-ipo.html
2•drob518•13m ago•1 comments

SpaceX IPO reads like Hollywood fantasy version of the future

https://fortune.com/2026/05/21/spacex-ipo-musk-mars-colony-dinosaurs-space-exploration/
3•1vuio0pswjnm7•14m ago•1 comments

Apple to broadcast MLS game shot entirely on 15 iPhones

https://variety.com/2026/digital/news/apple-mls-match-shot-entirely-on-iphone-first-time-1236755744/
1•dkobia•14m ago•0 comments

White House postpones AI executive order signing ceremony

https://www.axios.com/2026/05/21/white-house-postpones-ai-eo-signing
2•anigbrowl•14m ago•0 comments

Ask HN: Failing interviews for mid-level SWE in UK, advice please

1•mjb8086•15m ago•0 comments

I created an extension for Claude that shares context on how you work

https://github.com/stubbleapp/Stubble
1•satay_chicken31•18m ago•0 comments

A multi-agent system for automating scientific discovery

https://www.nature.com/articles/s41586-026-10652-y
1•Timofeibu•18m ago•0 comments

Chewing gum restores dad's taste and smell years after Covid

https://discover.swns.com/2026/05/chewing-gum-restores-dads-taste-and-smell-years-after-covid/
6•speckx•20m ago•0 comments

Show HN: From one Claude agent to a fleet – in five small steps

1•sermakarevich•20m ago•0 comments

Sony Flamingo - The Coolest Record Player Ever Made

https://obsoletesony.substack.com/p/the-coolest-record-player-ever-made
2•reconnecting•20m ago•0 comments

A permissively licensed Vita FPGA Architecture in only 380 lines of Verilog

https://github.com/VitaSetLLC/VitaOS-Libre
1•VitaSetLLC•21m ago•0 comments

Nature's Hardware Store: building the future with biology [video]

https://aeon.co/videos/fungi-homes-and-more-ways-biology-could-sustain-life-beyond-earth
2•bryanrasmussen•21m ago•0 comments

Inside the next phase of OpenAI's political strategy

https://www.politico.com/news/2026/05/20/chatgpt-state-ai-fight-00928903
2•1vuio0pswjnm7•22m ago•0 comments

Trump Postpones AI Executive Order Due to Concerns About Overregulation

https://www.wsj.com/tech/ai/trump-executive-order-ai-advanced-models-57bcc955
3•berkeleyjunk•24m ago•0 comments

Japanese Verb Conjugation the Simple Hard Way

https://underreacted.leaflet.pub/3mmevu6woys27
2•danabramov•24m ago•0 comments

Show HN: Canonry tracks how AI cites you – agent-first, open source

https://github.com/AINYC/canonry
1•arberx•25m ago•0 comments

Show HN: Online Sound Test

https://soundtestx.com/
1•artiomyak•25m ago•0 comments

IRS requires identity verification with a private company for refunds?

https://help.id.me/hc/en-us/articles/8214940302999-IRS-and-ID-me
1•SilverElfin•26m ago•3 comments

Pivoting Out of Healthcare

https://saffron.health/
1•brandonb•28m ago•0 comments