frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•11mo ago

Comments

kate_at_refact•11mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

I want a better [build] action graph serialization

https://jyn.dev/i-want-a-better-action-graph-serialization/
1•jackschu•2m ago•0 comments

UltraDAG – Sub-4MB blockchain full node in Rust

https://github.com/UltraDAGcom/core
1•johanmichel•3m ago•0 comments

Designing Reproducible Test Environments for RPPG

https://www.mdpi.com/3042-7886/2/2/3
1•PaulHoule•5m ago•0 comments

A shared state system for plugins (Rust)

https://ahoyiski.neocities.org/posts/shared-state/
2•itzlambda•6m ago•0 comments

Fun with an indecisive AI coding agent

https://benhoyt.com/writings/indecisive-ai-agent/
1•ingve•7m ago•0 comments

Canada revokes crypto firms' registrations

https://www.icij.org/investigations/coin-laundry/canada-revokes-dozens-of-crypto-firms-registrati...
2•gnabgib•7m ago•0 comments

I ran Gemma 4 as a local model in Codex CLI

https://blog.danielvaughan.com/i-ran-gemma-4-as-a-local-model-in-codex-cli-7fda754dc0d4
1•dvaughan•8m ago•0 comments

I audited Garry's website after he bragged about 37K LOC/day

https://twitter.com/Gregorein/status/2038953944475472316
1•medalblue•8m ago•0 comments

Tips to Give Effective Employee Feedback

https://effortbox.com/blog/tips-to-give-effective-and-constrictive-employee-feedback/
2•andreylangovoy•9m ago•0 comments

Running Gemma 4 Locally with the Codex CLI: What Works

https://codex.danielvaughan.com/2026/04/10/gemma-4-local-model-codex-cli-setup-guide/
1•dvaughan•10m ago•0 comments

Show HN: walnut – Error tracking AI agents

https://github.com/bilalg1/walnut
1•bgwmj•13m ago•0 comments

I built autonomous AI with memory and sleep, and it had nightmares

https://negrenavarro.me/blog/lana
1•isitdan•16m ago•0 comments

FIY – A general purpose federation protocol

https://fiy.to/
1•mrunix•18m ago•0 comments

Gaming Film and Sponsors

https://rockyhaag.substack.com/p/remember-the-breakfast-club
1•maxalias•19m ago•0 comments

Uncharted island soon to appear on nautical charts

https://www.awi.de/en/about-us/service/press/single-view/unkartierte-insel-demnaechst-auf-seekart...
1•tannhaeuser•21m ago•1 comments

Israeli strike kills infant girl in south Lebanon during father's funeral

https://www.reuters.com/world/middle-east/israeli-strike-kills-infant-girl-south-lebanon-during-f...
34•lr0•22m ago•2 comments

Physicists zero in on the mass of the fundamental W boson particle

https://news.mit.edu/2026/physicists-report-mass-fundamental-w-boson-particle-0408
3•01-_-•23m ago•0 comments

Show HN: Redactify – macOS/iOS app to redact sensitive data before using LLMs

3•ladino•24m ago•0 comments

The Maintainers

https://themaintainers.org/
1•gpvos•25m ago•0 comments

Navy to use underwater drones to help clear Iranian mines from Strait of Hormuz

https://defensescoop.com/2026/04/11/strait-of-hormuz-mine-clearance-navy-centcom-underwater-drones/
3•delichon•29m ago•0 comments

Solar panels are creating an unexpected effect by forming rainfall clouds

https://www.ecoportal.net/en/solar-panels-are-creating-rain-clouds/19854/
3•rbanffy•29m ago•0 comments

Nouns Agentic – Enabling AI Agents to Buy Nouns and Participate

https://noun.wtf/grants/8
1•developerfred•33m ago•0 comments

Yuri's Night: commemoration of Gagarin as the first human in space 12 April

https://yurisnight.net/
1•thinkingemote•33m ago•0 comments

Springdrift: An Auditable Persistent Runtime for LLM Agents

https://arxiv.org/abs/2604.04660
1•s_brady•34m ago•0 comments

Slightly Against the Expanding Circle

https://pelorus.substack.com/p/slightly-against-the-expanding-circle
2•paulpauper•34m ago•0 comments

Tahoe TCP Overflow Bug

https://mjtsai.com/blog/2026/04/07/tahoe-tcp-overflow-bug/
2•rbanffy•34m ago•1 comments

A GitHub agentic workflow

https://blog.frankel.ch/agentic-github-workflows/
1•saikatsg•35m ago•0 comments

Quantum Safe Bitcoin

https://github.com/avihu28/Quantum-Safe-Bitcoin-Transactions
2•wslh•35m ago•0 comments

The Three Enterprise Layers Are Collapsing into One

https://walsenburgtech.com/blog/hub-and-spoke-architecture-production-ai
2•cowartc•36m ago•1 comments

Hungary's Orban concedes landmark defeat to centre-right opposition

https://www.reuters.com/world/europe/hungarians-vote-landmark-election-closely-watched-by-eu-russ...
7•markerbrod•37m ago•0 comments