frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Show HN: yawac – a macOS client for WhatsApp, Swift, no Electron, no BS

https://github.com/vadika/yawac/
1•vadikas•6m ago•0 comments

We Built a Real-Time Implied Volatility Engine for Commodity Options

https://medium.com/@DolphinDB_Inc/how-we-built-a-real-time-implied-volatility-engine-for-commodit...
1•CrazyTomato•6m ago•0 comments

The Ideal Bestest Base Font Size That Everyone Is Keeping a Secret

https://adrianroselli.com/2024/03/the-ultimate-ideal-bestest-base-font-size-that-everyone-is-keep...
1•ravenical•7m ago•0 comments

Anthropic CEO Dario Amodei Has Only One Direct Report

https://www.bloomberg.com/news/articles/2026-06-10/anthropic-ceo-dario-amodei-is-a-manager-to-onl...
2•petethomas•12m ago•0 comments

Billions in Loans Didn't Make a Dent in Global Poverty

https://www.wsj.com/finance/banking/poverty-microfinancing-loans-entrepreneurs-de458ee8
1•JumpCrisscross•14m ago•0 comments

Show HW: nomd, HTML md editor

https://nomd.dev
1•pcald•15m ago•0 comments

Web Browsers on Video Game Consoles

https://vale.rocks/posts/game-console-browsers
2•robin_reala•16m ago•0 comments

Ex-Board Member Reveals Corruption and Dysfunction at Gnome Foundation

https://lunduke.substack.com/p/ex-board-member-reveals-corruption
2•MrJulia•16m ago•0 comments

Show HN: Corterm – self-hosted remote terminal that survives disconnects

https://github.com/monster-echo/CortexTerminal2
1•rwecho•16m ago•1 comments

V2 Editor (2025)

https://oktana.dev/blog/introducing-v2-editor/
2•rapnie•17m ago•0 comments

Getting Started with Datastar – Build a Rust and Axum Todo App

https://hamy.xyz/blog/2026-03_datastar-rust-todo
1•alex_hirner•17m ago•0 comments

Our AI-slop ad turned out weirdly good [video]

https://www.youtube.com/watch?v=FPgq4eopYcs
1•nxnze•17m ago•1 comments

I made a chess leaderboard that rewards cool checkmates instead of just Elo

https://chessranks.net/
1•nashrashal•20m ago•0 comments

Tiny wasp helps prevent first global bird extinction in Britain for 60 years

https://www.rspb.org.uk/whats-happening/news/tiny-wasp-helps-prevent-the-first-global-bird-extinc...
1•austinallegro•21m ago•2 comments

OT Segmentation: Why the Framework Matters Less Than the Discipline

https://www.emberot.com/resources/blog/ot-segmentation-discipline-framework/
1•TheWiggles•21m ago•0 comments

I added a prompt to future ASI – TLBIC Policy Proposal v5 now available

1•michikawa59•23m ago•0 comments

IBM's Spyre AI Accelerator Deep Dive – By Gavin Bonshor

https://morethanmoore.substack.com/p/ibms-spyre-ai-accelerator-deep-dive
1•rbanffy•25m ago•0 comments

Making a Vintage LLM from Scratch

https://crlf.link/log/entries/260525-1/
2•croqaz•25m ago•1 comments

Collaborative Memory Chrome Extension

https://chromewebstore.google.com/detail/xysq-memory-for-you-and-y/knpcnfdnahkinongbiedcllmigffodpm
1•ximihoque•26m ago•0 comments

Unmasking the Energy Transition Myth [video]

https://www.youtube.com/watch?v=H24Xzi7Xi5I
1•leonidasrup•27m ago•1 comments

The "steroid Olympics" were a circus–and a window into our culture

https://www.technologyreview.com/2026/06/10/1138670/enhanced-games-doping-steroids-hormones-suppl...
1•joozio•30m ago•0 comments

Show HN: SynCodeLive – code and talk with your team along with AI, live

https://syncodelive.com/
1•ketul_shah•32m ago•0 comments

Agentic Coding and Mental Models

https://philbooth.me/blog/agentic-coding-and-mental-models
1•philbo•34m ago•0 comments

Framework delays Laptop 13 Pro due to bugs, but there's a bonus

https://www.pcworld.com/article/3162530/framework-delays-laptop-13-pro-due-to-bugs-but-theres-a-b...
1•cassianoleal•34m ago•0 comments

Why Sell Lifetime Plans, in a Default Subscription World?

https://pketh.org/lifetime-plans.html
1•ZacnyLos•35m ago•1 comments

Extra Time – a retro-newspaper companion for the 2026 World Cup

https://extra-time-wc2026.netlify.app
1•regevaz•37m ago•0 comments

The vulnerability bottleneck has moved

https://evahill1.substack.com/p/the-vulnerability-bottleneck-has
2•evaXhill•38m ago•0 comments

Connected Notes and Writing from Curiosity Turned a Hobby into a Career

https://www.ssp.sh/blog/why-i-still-blog/
2•zazuke•39m ago•0 comments

Unintended Consequences of Video Surveillance

https://spectrum.ieee.org/unintended-consequences-video-surveillance
1•rbanffy•44m ago•0 comments

Intelligence Not Included

https://morrick.me/archives/10319
1•jandeboevrie•44m ago•0 comments