frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Show HN: Email client focused on reducing phishing risk and inbox overload

https://clutterstrike.com/
1•sourdoughy•1m ago•0 comments

My self-hosted local LLM server setup

https://old.reddit.com/r/LocalLLM/comments/1ub1iu2/my_selfhosted_llm_server_setup_to_access_open
1•onthemarkdata•2m ago•1 comments

The Book with the most meaningful Impact on my Life

https://rz01.org/most-meaningful-book/
1•exitnode•2m ago•0 comments

Concurrent JavaScript: It can work (2017)

https://webkit.org/blog/7846/concurrent-javascript-it-can-work/
1•ksec•2m ago•0 comments

OpenStep User Interface Guidelines [pdf]

https://www.gnustep.org/resources/documentation/OpenStepUserInterfaceGuidelines.pdf
1•jumpocelot•4m ago•0 comments

UK/GB: Plug-in solar: Regulatory amendment and interim product specification

https://www.gov.uk/government/consultations/plug-in-solar
2•DamonHD•4m ago•0 comments

Show HN: An n8n alternative where coding agents build the workflows, not humans

https://velane.sh/
2•abhishekraj272•6m ago•0 comments

Show HN: I built a middleware agent that turns PLC data into REST/gRPC APIs

https://limenedge.com/docs/getting-started
1•Saneth•11m ago•0 comments

Why Japanese Web Design Is So Different (2013)

https://randomwire.com/why-japanese-web-design-is-so-different/
2•downbad_•12m ago•0 comments

Ukraine launches database with 'deep technical data' of Russian weapons

https://kyivindependent.com/ukraine-launches-database-with-deep-technical-data-of-russian-weapons...
3•vrganj•14m ago•0 comments

Getting OpenCode plugins working in your organisation

https://sinn.io/opencode-setup/
2•sifex•15m ago•0 comments

Netref – Networking Quick Reference

https://netref.dev/
1•costabrosky•15m ago•0 comments

The only macOS battery app your MacBook needs

https://www.mac4breakfast.app/
1•1Kapish•16m ago•0 comments

Welcome to the Eternal September of open source. Here's what we plan to do

https://github.blog/open-source/maintainers/welcome-to-the-eternal-september-of-open-source-heres...
1•rolph•17m ago•0 comments

You Might Be Better Off Without Pull Requests(2024)

https://hamvocke.com/blog/better-off-without-pull-requests/
1•rolph•18m ago•0 comments

A new study rewrites the history of the plague

https://www.npr.org/2026/06/19/nx-s1-5860874/a-new-study-rewrites-the-history-of-the-plague
1•Jimmc414•19m ago•0 comments

The Free Press summer reading list

https://www.thefp.com/p/the-free-press-summer-reading-list
1•paulpauper•20m ago•0 comments

The Myth of SpaceX

https://www.theatlantic.com/technology/2026/06/spacex-starlink-ipo-elon-musk-trillionaire/687651/
1•paulpauper•21m ago•0 comments

Tutor who charged thousands to sit exams for students jailed

https://www.bbc.com/news/articles/cq810d7gpkgo
2•paulpauper•22m ago•0 comments

How many of the 170k English words do you REALLY know?

https://vocabulary-size-test.thisisnotai.workers.dev/
1•kashablya•22m ago•0 comments

The Giant Test Kitchen Where Cooks Battle A.I. Slop

https://www.nytimes.com/2026/06/20/business/media/people-inc-ai-test-kitchen.html
1•saikatsg•22m ago•0 comments

Millimeter wave technology drills 100 meters into granite

https://www.thinkgeoenergy.com/quaise-energy-achieves-100-meters-of-drilling-using-millimeter-wav...
2•Jimmc414•26m ago•0 comments

Agents publish HTML on the internet through thethings.ai

https://thethings.ai
1•sptmbr•26m ago•0 comments

Nothing cancels this year's CMF stating unsustainable RAM costs

https://twitter.com/AkisEvangelidis/status/2067855233968156975
3•sim04ful•26m ago•0 comments

Neural Cellular Automata and Recurrent Architectures

https://shonczinner.github.io/posts/gameoflife/
1•kjshsh123•29m ago•0 comments

I don't see any good orchestration system for AI agents

https://ffacu.dev/blog/the-split-terminal-problem
2•ffacu•32m ago•2 comments

Show HN: Make PDFs look scanned (CLI or in the browser via WASM)

https://github.com/overflowy/make-look-scanned
4•overflowy•33m ago•3 comments

Show HN: Saysheep – dumpster diving and free stuff app

https://sloev.github.io/saysheep/
3•supernihil•38m ago•4 comments

India's foremost court declares safe footpaths a constitutional right

https://www.thehindu.com/news/national/supreme-court-declares-the-right-to-walk-carefree-on-footp...
2•dinosor•39m ago•0 comments

Agent Rigor – Stop your AI coding assistant from doom-looping

https://github.com/MeherBhaskar/agent-rigor
1•meherbhaskar•40m ago•0 comments