frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Tor v3 .onion addresses are the same Ed25519 pubkey as Solana wallets

https://ouija.social/directory
1•notstacc•1m ago•0 comments

The Book of Mozilla

https://en.wikipedia.org/wiki/The_Book_of_Mozilla
1•TheSilva•1m ago•0 comments

The Internet can't stop watching Figure AI's humanoid robots handling packages

https://arstechnica.com/ai/2026/05/the-internet-cant-stop-watching-figure-ais-humanoid-robots-han...
1•Brajeshwar•1m ago•0 comments

The Hardware Lottery

https://hardwarelottery.github.io/
1•intelkishan•4m ago•0 comments

Show HN: We wrote forensic intelligence reports on 20 open-source codebases

https://github.com/zero-intelligence/zero-intel
1•DhruvKumarJha•4m ago•0 comments

The surprising story behind the first British person in space

https://www.bbc.com/culture/article/20260518-helen-sharman-the-story-behind-the-first-british-per...
2•xoxxala•5m ago•0 comments

Newborns' cry melody is shaped by their native language

https://pubmed.ncbi.nlm.nih.gov/19896378/
1•thunderbong•6m ago•0 comments

My Hermes and Obsidian Setup and Use Cases

https://metedata.substack.com/p/013-my-hermes-and-obsidian-set-up
1•young_mete•7m ago•0 comments

Show HN: Lance – image/video generation and understanding in one model

https://github.com/bytedance/Lance
2•cleardusk•7m ago•0 comments

micnik – 10s voice message anonymous microblogging

https://micnik.stagas.deno.net/
1•stagas•7m ago•0 comments

You never learned to delegate. AI just made it obvious

https://jeroensangers.com/2026/05/18/you-never-learned-to-delegate.html
1•speckx•7m ago•0 comments

What do you think of my new website?

https://www.movingcompared.co.uk
1•ty00001•7m ago•0 comments

I built 2k badges for Replay and I'm not sure I've slept since December

https://temporal.io/blog/badges-for-replay-and-i-havent-slept-since-december
1•raybb•8m ago•1 comments

LightSolver Partners with Boeing

https://www.photonics.com/Articles/LightSolver-Partners-with-Boeing/a72200
1•ramon156•8m ago•0 comments

Show HN: Zero Protocol – A personal context file for AI

https://github.com/zero-intelligence/zero-protocol
1•DhruvKumarJha•9m ago•0 comments

U.S. PV manufacturing capex could reach $7B in 2027 in breakout year

https://www.pv-magazine.com/2026/05/20/u-s-pv-manufacturing-capex-could-reach-7-billion-in-2027-i...
2•ndr42•10m ago•0 comments

Show HN: Chatroom with curl command (requires IPv6)

https://chat.est.im/ai/hackernews
1•est•12m ago•0 comments

Career coach based on psychological tests (desktop,BYOK)

https://yggdra.garden/
1•teslavr•12m ago•0 comments

SBCL: The Assembly Code Breadboard (2014)

https://pvk.ca/Blog/2014/03/15/sbcl-the-ultimate-assembly-code-breadboard/
1•yacin•12m ago•0 comments

From Skills Back to Tools: Why Our Dashboard Assistant Moved Off the Claude Code

https://blog.promptlayer.com/from-skills-back-to-tools-why-our-dashboard-assistant-moved-off-the-...
1•jonpon•13m ago•0 comments

StrangerLoops

https://strangerloops.com
1•alanbotts•13m ago•0 comments

The brain's code seems to be in constant flux. Neuroscientists are baffled

https://www.nature.com/articles/d41586-026-01554-0
2•Brajeshwar•16m ago•0 comments

AI-generated abandonware is hollowing out open source

https://leaddev.com/software-quality/ai-generated-abandonware-is-hollowing-out-open-source
2•chhum•17m ago•0 comments

Chat client for Meshtastic LoRa mesh networks in Emacs

https://git.andros.dev/andros/meshtastic.el
2•andros•18m ago•0 comments

Localgcp: LocalStack for GCP, emulating 14 Google Cloud services locally

https://github.com/slokam-ai/localgcp
1•linkdd•19m ago•0 comments

How much should we worry about secretly loyal AIs?

https://www.the-substrate.net/p/how-much-should-we-worry-about-secretly
2•erwald•24m ago•0 comments

Digitally stuck on an island with 30 people

https://isle31.com/
1•vincentlenoach•24m ago•0 comments

Formal Verification Gates for AI Coding Loops

https://reubenbrooks.dev/blog/structural-backpressure-beats-smarter-agents/
1•pyrex41•26m ago•1 comments

How Musk Might Defeat the Statute of Limitations Defense

https://chatlaw.substack.com/p/how-musk-might-defeat-the-statute
1•dsubburam•27m ago•0 comments

Alexander Grothendieck Revolutionized 20th-Century Mathematics

https://www.quantamagazine.org/how-alexander-grothendieck-revolutionized-20th-century-mathematics...
1•digital55•27m ago•0 comments