frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Everyone got excited they can suddenly code, and missed the point

https://kasperjunge.com/blog/should-pms-code-with-agents/
1•juunge•38s ago•0 comments

O'Reilly Animal Menagerie

https://www.oreilly.com/animals.csp
1•skogstokig•1m ago•0 comments

Story Lab

https://story-lab.ai/
1•Aftermidn8•4m ago•0 comments

China plans to spend $295B on AI buildout

https://www.bloomberg.com/news/articles/2026-06-09/china-prepares-295-billion-plan-to-fund-nation...
1•loandbehold•5m ago•0 comments

Plane Launch Week Gallery

https://plane.so/launch-week/q2-2026
1•bbor•5m ago•1 comments

Nearly Everyone, Everywhere, Veers Left When Walking

https://www.nytimes.com/2026/06/10/science/humans-walking-veer-left-counterclockwise.html
1•donohoe•7m ago•0 comments

The Device Paradigm [pdf]

https://web.cs.ucdavis.edu/~rogaway/classes/188/spring04/projects/5.pdf
1•burnto•7m ago•0 comments

Fable 5 creates full Swiss lever watch movement in Three.js

https://twitter.com/quanghuynt14/status/2064509430650065278
2•mhb•8m ago•0 comments

Nuts – pip/NPM for Java with first-class workspaces and JDK provisioning (9y+)

https://github.com/thevpc/nuts
1•thevpc•8m ago•0 comments

AI-review: reviewing AI code before it lands

https://www.jackfranklin.co.uk/blog/ai-review-plan/
1•mooreds•8m ago•0 comments

Whole Earth Garden

https://wholegarden.falso.net/
1•gdss•9m ago•0 comments

How to enter side doors: guide to jobs, cold emails, and making yourself legible

https://velvetnoise.substack.com/p/how-to-enter-side-doors
1•nowflux•9m ago•0 comments

Who runs the ransomware group 'The Gentlemen?'

https://krebsonsecurity.com/2026/06/who-runs-the-ransomware-group-the-gentlemen/
2•krebsonsecurity•10m ago•0 comments

T9 Texting Is Back on iPhone

https://apps.apple.com/us/app/t9-keyboard-text-like-its-04/id6765738931
1•ItsMeDavidV•10m ago•0 comments

Ongoing attempt to standardise a DOM Templating API in browsers

https://github.com/justinfagnani/dom-templating-api-proposal
1•llcooliovice•13m ago•0 comments

Show HN: Kctx – A read-only Kubernetes context engine for SREs and AI Agents

https://github.com/lucasepe/kctx
2•lucasepe•15m ago•0 comments

Anthropic's Self Governance Is an Act of Social Violence

https://cezarbabin.com/notes/anthropic-self-governance-is-an-act-of-social-violence.html
1•nibab•16m ago•1 comments

Google Liable for Hallucinations (In Germany)

https://garymarcus.substack.com/p/breaking-google-liable-for-hallucinations
1•PaulDavisThe1st•16m ago•1 comments

The maths behind a leopards spots

https://www.bbcearth.com/news/the-maths-behind-a-leopards-spots
2•marysminefnuf•19m ago•0 comments

Jumping spiders inspire ultra-efficient 3D camera

https://news.northwestern.edu/stories/2026/06/jumping-spiders-inspire-ultra-efficient-3d-camera
1•gmays•21m ago•0 comments

US stock market to stop shrinking for first time in 23 years

https://www.ft.com/content/f7dae4e1-d650-45ab-ac97-043c7a965d24
4•JumpCrisscross•22m ago•0 comments

Ask HN: What are your thoughts on your critical thinking abilities and AI?

2•ciwolex•23m ago•5 comments

Piano Learning App focused on Sight-reading

http://virtuoso.host.eco.br/app/
2•ltouro•23m ago•2 comments

Show HN: LocksBet – a price comparison tool for prediction markets

https://locksbet.com
1•at-w•24m ago•0 comments

What Is It Like to Be a Bat? [pdf]

https://www.sas.upenn.edu/~cavitch/pdf-library/Nagel_Bat.pdf
5•shadow28•25m ago•0 comments

Apple Announces Maps Feature That Could Bring CarPlay to Tesla

https://www.notateslaapp.com/news/4278/could-this-new-apple-feature-finally-bring-carplay-to-tesla
2•dabinat•26m ago•0 comments

The Music Understanding framework [video]

https://developer.apple.com/videos/play/wwdc2026/253/
2•gok•26m ago•0 comments

Show HN: CtxGov – a local claim firewall for AI memory claims

https://ctxgov.github.io/ctxgov/try-in-5-minutes.html
1•LuxBennu•27m ago•0 comments

Beyond Enumerable: For Want of Better Windows

https://baweaver.com/writing/2026/05/31/beyond-enumerable-for-want-of-better-windows/
1•kurinikku•27m ago•0 comments

Breaking the Ice: Analyzing Cold Start Latency in vLLM

https://arxiv.org/abs/2606.07362
2•matt_d•29m ago•0 comments