frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•10mo ago

Comments

kate_at_refact•10mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

The Death of OpenAI's Whistleblower Makes No Sense: What Happened to Suchir[video]

https://www.youtube.com/watch?v=v5WgQHCPB8Q
1•Imustaskforhelp•49s ago•0 comments

We're burning the future to simulate intelligence. Aether is the alternative

https://github.com/stillsilent22-spec/Aether-
1•Trybetter•5m ago•0 comments

OCP – Use your Claude Pro/Max subscription as an OpenAI-compatible API($0 extra)

https://github.com/dtzp555-max/openclaw-claude-proxy
1•dtzp555-max•6m ago•1 comments

PicoZ80 Is a Drop-In Replacement for Everyone's Favorite Zilog CPU

https://hackaday.com/2026/03/23/picoz80-is-a-drop-in-replacement-for-everyones-favorite-zilog-cpu/
1•neomech•12m ago•0 comments

March, 19-21: God is a comedian

https://no01.substack.com/p/march-19-21-god-is-a-comedian
1•tastyface•19m ago•0 comments

Show HN: Knitting – shared-memory function calls for JavaScript workers

https://knittingdocs.netlify.app/
1•mimiMonads•20m ago•0 comments

MagicAudio – Free Noise, Echo and Background Music Remover

https://magicaudio.pro/
2•polayan•21m ago•0 comments

Mixing Post-Quantum KEMs into Noise

https://runxiyu.org/comp/nkem1/
1•runxiyu•25m ago•0 comments

Modular 26.2: Image Generation and Upgraded AI Coding with Mojo

https://www.modular.com/blog/modular-26-2-state-of-the-art-image-generation-and-upgraded-ai-codin...
1•tosh•26m ago•0 comments

A Billionaire-Backed Startup Wants to Grow 'Organ Sacks' to Replace Animal Test

https://www.wired.com/story/a-billionaire-backed-startup-wants-to-grow-organ-sacks-to-replace-ani...
1•joozio•26m ago•0 comments

Women Are Falling in Love with A.I. It's a Problem for Beijing

https://www.nytimes.com/2026/02/26/technology/china-ai-dating-apps.html
1•Markoff•31m ago•0 comments

What Does the AGPL Require?

https://runxiyu.org/comp/agpl/
3•runxiyu•33m ago•1 comments

A simple feed to keep up with AI drops

https://www.a2i.now/
1•markke•33m ago•0 comments

Low-level Git plumbing library in pure Go

https://github.com/runxiyu/furgit
1•runxiyu•34m ago•0 comments

AI Proteomics Competition 2026 – $13K Prize, Internships and Compute Support

https://www.bohrium.com/competitions/9813928053?tab=introduce
1•choubao•35m ago•1 comments

Opera: Rewind The Web to 1996 (Opera at 30)

https://www.web-rewind.com
2•thushanfernando•36m ago•0 comments

CDP Alternatives for Startups and Small Teams

https://www.sentohq.com/posts/cdp-alternatives-small-teams
2•adrved•37m ago•0 comments

The Death of Character in Game Console Interfaces

https://vale.rocks/posts/game-console-interfaces
1•rockstar2001•39m ago•0 comments

Papal-American Tax Problems and a Solution

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5340391
1•nxobject•40m ago•1 comments

Nvidia CEO Jensen Huang says 'I think we've achieved AGI'

https://www.theverge.com/ai-artificial-intelligence/899086/jensen-huang-nvidia-agi
3•iugtmkbdfil834•40m ago•1 comments

Show HN: ProofShot – Give AI coding agents eyes to verify the UI they build

https://proofshot.argil.io/
4•jberthom•43m ago•0 comments

Helios: 14B open source video model, real time at 19.5fps, runs on 6GB VRAM

https://github.com/PKU-YuanGroup/Helios
1•steveharing1•44m ago•0 comments

How good is Claude, really?

https://alinpanaitiu.com/blog/how-good-is-claude-really/
1•imaq•45m ago•0 comments

Show HN: Kern – One agent. One folder. One mind. Every channel

https://github.com/oguzbilgic/kern-ai
1•obilgic•46m ago•0 comments

Liquid Glass Is Permanent

https://mjtsai.com/blog/2026/03/23/liquid-glass-is-permanent/
2•imaq•46m ago•0 comments

A Minimal NixOS Config That Still Feels Premium

https://slicker.me/nixos/premium_minimal.html#premium
1•weatherlight•49m ago•0 comments

JP Morgan's Monitors Employee's Keystrokes and Meetings; for Their Wellbeing

https://www.inc.com/moses-jeanfrancois/jp-morgans-junior-banker-tech-monitoring/91319918
3•tuananh•50m ago•2 comments

Show HN: Prompts Directory for Data Analyst

https://mljar.com/ai-prompts/data-analyst/
1•pplonski86•53m ago•0 comments

Ask HN: How are you monitoring what OpenClaw does when it runs autonomously?

1•jialu1•53m ago•0 comments

Tech Founders Can Access Investors and What Davos and Tulum Has to Do with It

https://irishtechnews.ie/how-tech-founders-can-access-investors/
1•ybelkin•54m ago•0 comments