frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

C++26: Standard Library Hardening

https://www.sandordargo.com/blog/2026/05/13/cpp26-library-hardening
1•jandeboevrie•1m ago•0 comments

Show HN: Free IP fraud score detection (without ads or captcha)

1•TemporaryMail•1m ago•0 comments

The nuclear-physics infrastructure behind PET scans

https://www.lanl.gov/media/publications/1663/proton-power-for-public-health
1•LAsteNERD•4m ago•0 comments

China criticizes US chip equipment bill in run-up to Beijing talks

https://www.reuters.com/legal/government/china-criticizes-us-chip-equipment-bill-run-up-beijing-t...
1•tartoran•5m ago•0 comments

Hosting a website on an 8-bit microcontroller

https://maurycyz.com/projects/mcusite/
1•jandeboevrie•6m ago•0 comments

The Sarcophagus Dealer

https://www.theatlantic.com/culture/2026/05/serop-simonian-egypt-theft-artifacts/686591/
1•Brajeshwar•7m ago•0 comments

Prayer-Driven Development

https://digitaliziran.si/2026/05/08/prayer-driven-development.html
1•gregman1•8m ago•1 comments

Dandelion Rubber for Sustainable Tires?

https://www.dw.com/en/could-rubber-from-dandelions-make-tires-more-sustainable/a-56766389
1•rickcarlino•8m ago•0 comments

You Go Next (Standup Tool)

https://www.yougonext.com/
1•nsypteras•8m ago•0 comments

Ask HN: What are you working on (non-AI)?

1•BrunoBernardino•9m ago•0 comments

Magical thinking about magical thinking

https://heatherburns.tech/2026/05/13/magical-thinking-about-magical-thinking/
1•tao_oat•10m ago•0 comments

The Carbon Market Is Set for a Major Shake-Up

https://oilprice.com/Alternative-Energy/Renewable-Energy/The-Carbon-Market-Is-Set-for-a-Major-Sha...
1•PaulHoule•10m ago•0 comments

Show HN: Vim file browser that runs in separate terminal

https://github.com/hoffa/vitree
2•crehn•11m ago•0 comments

Tilebox – workflow orchestration for satellite data pipelines

https://console.tilebox.com/sign-up
1•meesher•12m ago•0 comments

The Forge We Deserve

https://btao.org/posts/2026-05-09-the-forge-we-deserve/
1•tao_oat•12m ago•0 comments

Show HN: DailyHabit – A client-side habit tracker (React with IndexedDB storage)

2•souhail_dev•13m ago•0 comments

.NET 11 Preview 4

https://devblogs.microsoft.com/dotnet/dotnet-11-preview-4/
2•majora2007•14m ago•1 comments

The billionaires' club at the center of America's public lands fight

https://www.hcn.org/articles/the-billionaires-club-at-the-center-of-americas-public-lands-fight/
1•cdrnsf•14m ago•0 comments

Ask HN: What is better Opus 4.6 High or Opus 4.7 Medium?

2•franze•15m ago•0 comments

How Brazil is starting to rein in Big Tech

https://www.codastory.com/authoritarian-tech/how-brazil-is-starting-to-rein-in-big-tech/
2•cdrnsf•15m ago•0 comments

How to make your blog more accessible, and why you should care

https://grizzlygazette.bearblog.dev/how-to-make-your-blog-more-accessible-and-why-you-should-care/
1•speckx•15m ago•1 comments

Theorem proving as a (brain tumor) therapy/distraction

https://orcid.org/0000-0002-4206-3283
1•fredokun•15m ago•1 comments

Show HN: Spreadsheets Cells with Uncertainty Distributions

https://github.com/PragmaticMachineLearning/maybe
2•tobiadefami•16m ago•0 comments

Revisiting mshare in Linux

https://lwn.net/SubscriberLink/1072333/c5c762d9490916e5/
2•chmaynard•17m ago•0 comments

A spatial canvas of every public US Government UAP record

https://openuap.space
1•dominikmartn•18m ago•0 comments

OpenCS2 – 5k hours recording of Counter Strike for world model training

https://blanchon-opencs2-dataset-viewer.hf.space/
1•blanchon•20m ago•1 comments

Stupidly Simple SVG Sparklines

https://shkspr.mobi/blog/2026/05/stupidly-simple-svg-sparklines/
1•Brajeshwar•20m ago•0 comments

Show HN: Promptcellar – capture every Claude Code prompt as JSONL in your repo

https://github.com/dominiek/promptcellar-for-claude-code
3•dominiek•21m ago•0 comments

What if your AI could buy you a car?

https://medium.com/@alex_21933/what-if-your-ai-could-actually-buy-you-a-car-7ba84bae4a55
1•yankouskia•21m ago•1 comments

The US Is Winning the AI Race

https://avkcode.github.io/blog/us-winning-ai-race.html
17•akrylov•23m ago•6 comments