frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•10mo ago

Comments

kate_at_refact•10mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

SLork (Stanford Laptop Orchestra)

https://slork.stanford.edu/
1•WorldPeas•1m ago•0 comments

A Collection of Hard to Find Pieces of Software

https://www.rarewares.org/rrw/programs.php
2•TigerUniversity•1m ago•0 comments

Decision Guardian: My first open source project

https://github.com/DecispherHQ/decision-guardian
4•poor_hustler•1m ago•0 comments

Show HN: HELmR – A runtime control layer for autonomous agents

https://github.com/helmr-labs/helmr-core
1•systems_arch•3m ago•0 comments

Testing Apple's 2026 16-inch MacBook Pro, M5 Max, and its new performance cores

https://arstechnica.com/gadgets/2026/03/testing-apples-2026-16-inch-macbook-pro-m5-max-and-its-ne...
2•rbanffy•3m ago•0 comments

So you want to write an "app" (2025)

https://arcanenibble.github.io/so-you-want-to-write-an-app.html
1•jmusall•3m ago•0 comments

Smooth UI animations on server-rendered HTML

https://blog.siami.fr/smooth-ui-animations-on-server-rendered-html
1•ksec•4m ago•0 comments

Binex – Debuggable runtime for AI agent pipelines (YAML, trace, replay, diff)

https://github.com/Alexli18/binex
1•alexli1807•5m ago•1 comments

Formalizing Data Structures and Algorithms with Agents

https://risemsr.github.io/blog/2026-03-06-autoclrs/
1•alpaylan•7m ago•0 comments

Is the AI Compute Crunch Here?

https://martinalderson.com/posts/is-the-ai-compute-crunch-here/
1•gmays•8m ago•0 comments

Show HN: A tool that automatically installs Python and common dev libraries

1•Alexpan_dev•8m ago•0 comments

Show HN: AI-Proof Careers Leaderboard

https://github.com/yoyothesheep/ai-resilient-occupations-data
1•yoyothesheep•8m ago•0 comments

LeRobot v0.5.0: Scaling Every Dimension

https://huggingface.co/blog/lerobot-release-v050
2•ibobev•12m ago•0 comments

Ulysses Sequence Parallelism: Training with Million-Token Contexts

https://huggingface.co/blog/ulysses-sp
1•ibobev•12m ago•0 comments

In the '90s Germany's air traffic control ran on Emacs

https://old.reddit.com/r/emacs/comments/lly7po/comment/gnvzisy/
3•clyfe•12m ago•0 comments

Simulating Queueing 2

https://buttondown.com/jaffray/archive/simulating-queueing-2/
2•ibobev•13m ago•0 comments

Trump says Iran 'war is complete,' talks to Putin

https://www.cnbc.com/2026/03/09/trump-iran-war-end.html
4•kamaraju•13m ago•0 comments

Feed Palestine

1•alpple•13m ago•0 comments

Skill to slim down your bloated AGENTS.md file

https://mheadd.github.io/agent-slimmer/
1•mjheadd•13m ago•0 comments

I wrote a OpenClaw Operators Field Guide for operating multi-agent AI systems

https://bethegorilla.com/
2•pathowlett•14m ago•1 comments

Snice – 130 web components and a decorator-based framework

1•hedzer•15m ago•0 comments

Some skills become second nature

https://news.mit.edu/2026/how-some-skills-become-second-nature-0304
1•rbanffy•15m ago•0 comments

One Year with Hyprland

https://www.whileforloop.com/en/blog/2026/03/09/one-year-with-hyprland/
2•wookashh•16m ago•0 comments

Oracle is building yesterday's data centers with tomorrow's debt

https://www.cnbc.com/2026/03/09/oracle-is-building-yesterdays-data-centers-with-tomorrows-debt.html
10•spenvo•18m ago•0 comments

Show HN: Making Codex stop rediscovering the same repository over and over

1•oldskultxo•18m ago•1 comments

Setting Up a Debug Environment for QEMU PCI Device Exploitation

https://varik.dev/blog/htb/nftdrm/debug-env-for-qemu-pwn
1•varik77•19m ago•0 comments

Taara Beam

https://taaraconnect.com/product/beam
3•rglover•20m ago•0 comments

Talking Face Animation Using a Learned Kalman Filter on Mobile Devices

https://www.mdpi.com/1424-8220/26/4/1377
2•PaulHoule•20m ago•0 comments

Show HN: DevToolbox – 13 browser-based dev tools, privacy-first

https://geld-verdienen-app-kbpcmxfq.devinapps.com
1•DevToolboxApp•21m ago•1 comments

Thomas Selfridge: The First Airplane Fatality

https://www.amusingplanet.com/2026/03/thomas-selfridge-first-airplane-fatality.html
6•Hooke•22m ago•1 comments