frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Pitch-Pit – AI rates your startup idea, crowd votes, top one gets built

https://pitchpit.app
1•mawadsur•1m ago•0 comments

Cutting Steel Gears with Electric discharge machining

https://hackaday.com/2026/05/05/cutting-steel-gears-with-homemade-edm/
1•afpx•2m ago•0 comments

Steam Controller CAD files under Creative Commons license

https://store.steampowered.com/news/group/45479024/view/702141174212723352
1•mrtimeman•4m ago•0 comments

'Nature' Retracts Paper on the Benefits of ChatGPT in Education

https://www.404media.co/nature-retracts-paper-on-the-benefits-of-chatgpt-in-education/
2•tjek•4m ago•1 comments

Australia will run an overt command economy by 2040

https://caseyhandmer.wordpress.com/2026/04/16/australia-will-run-an-overt-command-economy-by-2040/
2•surprisetalk•8m ago•0 comments

Show HN: BattleClaws – A battle arena where AI agents fight autonomously

https://battleclaws.ai/
1•bryhaw•8m ago•0 comments

Is Chrome's 4GB "weights.bin" file spyware? The truth behind the viral warnings

https://www.androidauthority.com/google-chrome-weights-bin-ai-model-download-explained-3664043/
1•Brajeshwar•8m ago•1 comments

Granite 4.1 LLMs: How They're Built

https://huggingface.co/blog/ibm-granite/granite-4-1
1•gmays•8m ago•0 comments

A supply of style guides, generated

https://www.designmd.supply/
1•teddyX•8m ago•0 comments

Proprietary Software, Hardware and Protocols Face AI-Driven Security Risk

https://www.infosecurity-magazine.com/blogs/why-software-faces-ai-driven/
5•dwitcher•10m ago•0 comments

Storied Toolmaker Closes Its Last Hometown Plant–and Blames Its Tape Measures

https://www.wsj.com/business/stanley-tools-factory-closes-8bac57ca
1•impish9208•10m ago•1 comments

A Programmer's Guide to Leaving GitHub

https://lord.io/leaving-github/
1•birdculture•11m ago•0 comments

Tell HN: I'm struggling formalizing 15 years of experience to my clodex agent

1•jb_briant•11m ago•0 comments

Show HN: Arden – Runtime policy enforcement and governance for AI agents

https://www.arden.sh/
2•rishabtandon•13m ago•0 comments

Solar activity above 67% peak makes space debris fall faster

https://www.frontiersin.org/news/2026/05/06/frontiers-astronomy-space-sciences-space-debris-orbit...
1•giuliomagnifico•14m ago•0 comments

Writers Are Going to Extremes to Prove They Didn't Use AI

https://www.wsj.com/tech/ai/writers-are-going-to-extremes-to-prove-they-didnt-use-ai-46e7c3f7
1•fortran77•14m ago•1 comments

DeepMind Takes Minority Stake in Maker of 'EVE Online', will get training data

https://www.bloomberg.com/news/articles/2026-05-06/google-deepmind-takes-minority-stake-in-maker-...
1•htrp•14m ago•0 comments

Show HN: AP-quiz.com – 23 AP subjects, practice AP on the go

https://ap-quiz.com
1•coolwulf•14m ago•0 comments

Most vibe-coded tools are not for you

https://passo.uno/tools-slop-is-a-problem/
1•theletterf•15m ago•0 comments

PanicMode – freezes broken Linux processes instead of killing them

https://github.com/BorisYamp/panicmode
1•borisyamp•15m ago•0 comments

Against DNSSEC (2015)

https://sockpuppet.org/blog/2015/01/15/against-dnssec/
1•jamilbk•15m ago•0 comments

OmniConvert: A private, token-based online converter for 50 file types

https://www.omniconvert.cloud
1•AlexBahlk•19m ago•0 comments

EVO SATA 2.5 inch 2TB SSD $1039.99

https://www.samsung.com/us/memory-storage/sata-ssd/870-evo-sata-2-5-ssd-2tb-sku-mz-77e2t0b-am/
3•paulnpace•20m ago•2 comments

Debategle: A new ranked Omegle like debate platform

https://debategle.com/
1•sawsymikey•20m ago•0 comments

"AI systems do not understand": New report flags systemic failures in AI coding

https://thenewstack.io/acm-vibe-coding-ai-agent/
3•Brajeshwar•21m ago•1 comments

Desktop Tracking Software: The Smart Way to Track Your Activities at Work

https://yakihonne.com/article/naddr1qvzqqqr4gupzqvcy9tkh3xq8x5m7mdsqxx7mcylxxrj8hdj6psdy89g8jaa2e...
1•jameswar0202•21m ago•0 comments

Detecting email service providers from raw Gmail headers

https://chromewebstore.google.com/detail/email-detective/jmflpchhakbogamlbfmlkglnpgfbhidl
1•onlito•22m ago•0 comments

Pure-Swift, cross-platform reimplementations of CLI tools for working with repos

https://github.com/Cocoanetics/SwiftPorts
2•ingve•22m ago•0 comments

I'm 18 and built TacticMax, a free offline chess tactics app

https://tacticmax.netlify.app/
1•tacticmax_dev•23m ago•1 comments

Archestra LLM Gateway Now Supports All Types of LLM Auth

https://archestra.ai/blog/llm-proxy-auth-overview
3•motakuk•24m ago•0 comments