frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Laravel-Lang Supply Chain Attack

https://github.com/Laravel-Lang/http-statuses/issues/277
1•varunsharma07•2m ago•1 comments

Nano: Coding agent in under 200 lines

https://github.com/pnegahdar/nano
1•kristianp•2m ago•0 comments

The abstractions in SICP need a revisit

https://www.khola.blog/p/sicp-an-architectural-trace-of-pointer
1•bcapchickadee•4m ago•0 comments

Denuvo has been cracked day 1

https://old.reddit.com/r/CrackWatch/comments/1tkvwbb/legobatmanlegacyofthedarkknightvoices38/
1•lazylion2•11m ago•0 comments

The Prehistory of A.I. Slop

https://www.newyorker.com/magazine/2026/05/25/the-prehistory-of-ai-slop
1•fortran77•13m ago•0 comments

Frustrated franchisee sues Pizza Hut over crappy kitchen AI

https://www.theregister.com/ai-ml/2026/05/19/frustrated-franchisee-sues-pizza-hut-over-crappy-kit...
2•gnabgib•16m ago•1 comments

WordPress 7.0

https://wordpress.org/download/releases/7-0/
1•Curiositry•23m ago•0 comments

Show HN: Claude Code for Customer Support

2•darweenist•27m ago•0 comments

SF Bay Ferry

https://sanfranciscobayferry.com/
1•Austin_Conlon•27m ago•0 comments

FBI director's Based Apparel site has been spotted hosting a 'ClickFix' attack

https://www.pcmag.com/news/kash-patels-apparel-site-is-trying-to-trick-visitors-into-installing-m...
13•bilalq•32m ago•3 comments

Why Your Calls to Congress Matter More Than You Think [Video]

https://odysee.com/@techlore:3/why-your-calls-to-congress-matter-more:6
1•Cider9986•35m ago•0 comments

Notiqo – Visual analytics tracking validation directly from Figma

https://notiqoapp.com/
1•abeltarazona•36m ago•0 comments

Is the Government Running a Dragnet on VPN Users? [video]

https://www.youtube.com/watch?v=hV9QEVf6CgI
1•Cider9986•37m ago•0 comments

The Resilience Premium

https://artsabintsev.substack.com/p/the-resilience-premium
1•Arts86•37m ago•0 comments

Show HN: Agentikus

https://agentikus.com
1•Modecir•37m ago•0 comments

Show HN: 24/7 AI Pet Assistant – Pookie by Purr

https://play.google.com/store/apps/details?id=com.nem.purr&hl=en_US
1•mehtapaxshal•38m ago•0 comments

Dopamine drives persistent remodelling of the maternal brain

https://www.nature.com/articles/s41586-026-10509-4
2•bookofjoe•40m ago•0 comments

Ask HN: What to learn and do, that makes me least affected by AI in STEM?

2•s3arch•52m ago•0 comments

Write Thin to Write Fast (2021)

https://breck7.github.io/breckyunits.com/write-thin-to-write-fast.html
1•KTibow•53m ago•0 comments

Students Create a Self-Balancing, Self-Driving Bicycle

https://www.core77.com/posts/144138/Students-Create-a-Self-Balancing-Self-Driving-Bicycle
6•sizzle•1h ago•1 comments

Context windows are the wrong solution to AI memory

https://www.thesecondbrain.dev
2•rahilpirani•1h ago•0 comments

Pynear 2.3 Is Out

3•pcael•1h ago•0 comments

GitHub Actions Is a Trap

https://tylercipriani.com/blog/2026/04/24/on-the-software-supply-chain-doom-spiral/
1•thcipriani•1h ago•0 comments

Robustness Principle

https://en.wikipedia.org/wiki/Robustness_principle
1•bookofjoe•1h ago•0 comments

When Code Is Cheap, Does Quality Still Matter?

https://yusufaytas.com/does-code-quality-still-matter
1•thunderbong•1h ago•0 comments

The Culture of Childhood: We've Almost Destroyed It

https://petergray.substack.com/p/the-culture-of-childhood-weve-almost
2•rendx•1h ago•0 comments

Cards and Stars – Where the Cards Meet the Cosmos

https://cardsandstars.org/
1•Anon84•1h ago•0 comments

TikTok disproportionately served anti-Democratic videos during the 2024 election

https://www.psypost.org/tiktok-disproportionately-served-anti-democratic-videos-during-the-2024-e...
9•CharlesW•1h ago•1 comments

The Lottery – Shirley Jackson (1948)

https://www.newyorker.com/magazine/1948/06/26/the-lottery
2•jxmorris12•1h ago•0 comments

Community response to Wikimedia layoffs and labor concerns

https://lists.wikimedia.org/hyperkitty/list/wikitech-l@lists.wikimedia.org/thread/NEZZ25FAX3VMBER...
8•rendx•1h ago•0 comments