frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•11mo ago

Comments

kate_at_refact•11mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

A catastrophic climate event is upon us

https://www.theguardian.com/commentisfree/2026/apr/23/catastrophic-climate-event-scientists-atlan...
1•yrcyrc•1m ago•0 comments

Show HN: Clawd Penguin – a virtual hangout for when Claude goes down

https://clawdpenguin.com
1•ossa-ma•3m ago•0 comments

Specsmaxxing

https://acai.sh/blog/specsmaxxing
1•brendanmc6•4m ago•0 comments

Microsoft Offers Voluntary Retirement to About 7% of US Workers

https://www.bloomberg.com/news/articles/2026-04-23/microsoft-offers-voluntary-retirement-to-about...
1•helsinkiandrew•5m ago•0 comments

Freak Heat Spikes Pay Big on Polymarket, Rousing Weather Nerds' Suspicion

https://www.wsj.com/business/unusual-weather-bets-on-polymarket-spur-french-investigation-b799bec8
1•julienchastang•5m ago•0 comments

Show HN: We're building Apache spark for agents with Rust and Datafusion

https://github.com/SkardiLabs/skardi
1•btnokami•6m ago•0 comments

Google TPU 8i for Inference and TPU 8T for Training Announced

https://www.servethehome.com/google-tpu-8i-for-inference-and-tpu-8t-for-training-announced/
1•teleforce•7m ago•0 comments

Show HN: Code garden deep-dive: my Forth C64 tetromino game

https://github.com/ekipan/sss/blob/share-hn/Design.md
1•ekipan•7m ago•0 comments

The Price of AI Is the Internet

https://vanilla.sh/blog/price-of-ai/
2•speckx•8m ago•0 comments

The NCSC's AI threat warning and the gap in AI agent security

https://agentshield.pro/blog/ncsc-perfect-storm
1•eigenart•8m ago•0 comments

Engineering Architecture: A Syllabus?

https://www.argmin.net/p/engineering-architecture-a-syllabus
1•sebg•8m ago•0 comments

America Cannot Lose the Robotics Race

https://a16z.com/america-cannot-lose-the-robotics-race/
1•nowflux•9m ago•0 comments

The Sleeper in the Payment Stack

https://franktyoung.substack.com/p/the-sleeper-in-the-payments-stack
1•manojr13•9m ago•0 comments

Atlassian to begin using customer metadata and and in-app data to train AI

https://www.atlassian.com/trust/ai/data-contribution/faqs
1•AaronM•11m ago•2 comments

Malicious Checkmarx Artifacts Found in Official KICS Docker Repository

https://socket.dev/blog/checkmarx-supply-chain-compromise
1•darkwater•11m ago•0 comments

Google Workspace Intelligence

https://workspace.google.com/blog/product-announcements/introducing-workspace-intelligence
1•oscarfr•12m ago•0 comments

Math Is Hard

http://miod.online.fr/software/openbsd/stories/vaxfp.html
1•signa11•12m ago•0 comments

Gemini Enterprise for the agentic task force

https://cloud.google.com/blog/products/ai-machine-learning/whats-new-in-gemini-enterprise
1•oscarfr•12m ago•0 comments

Gemini Enterprise Agent Platform

https://cloud.google.com/blog/products/ai-machine-learning/introducing-gemini-enterprise-agent-pl...
1•oscarfr•13m ago•0 comments

Why AI coding speed does not translate into engineering speed

https://blog.reqproof.com/p/ai-writes-your-code-nobody-verifies
1•LeonidBugaev•17m ago•0 comments

Web debugging proxy in your coding agent

https://www.telerik.com/blogs/when-your-coding-assistant-finally-got-x-ray-vision
1•zlatkov•17m ago•0 comments

Netflix Was Held Together with Duct Tape

https://marcrandolph.substack.com/p/netflix-was-held-together-with-duct
2•theorchid•17m ago•0 comments

The Declining Driver's License: Good, Bad, or Both?

https://maxmautner.com/2026/04/21/teen-drivers-license-decline.html
1•mslate•18m ago•0 comments

What Is AI Share of Voice? and Why You Should Care

https://liatbenzur.com/2026/04/22/what-is-ai-share-of-voice/
1•AISupportTeam•18m ago•0 comments

Is AI an Expensive Hobby?

https://adandai.wordpress.com/2026/04/23/is-ai-an-expensive-hobby/
2•allessa•18m ago•0 comments

Lossless Image Compression Architecture for Deep-Space CMOS Cameras

https://www.mdpi.com/2076-3417/16/6/2873
2•PaulHoule•19m ago•0 comments

Original Hello World in "B" Programming Language - Computerphile [video]

https://www.youtube.com/watch?v=cYS57xJuRP8
1•em-bee•19m ago•0 comments

High-Dose Flu Vaccine Linked to Lower Dementia Risk

https://www.medscape.com/viewarticle/high-dose-flu-vaccine-linked-lower-dementia-risk-2026a1000cjf
4•kieranmaine•19m ago•0 comments

Show HN: Sable Found a SQL Injection in a Legacy Financial Portal

https://blog.vulnetic.ai/how-sable-found-a-sql-injection-in-a-legacy-financial-portal-58a329b96b0b
1•danieltk76•21m ago•0 comments

Evolving Distributed Tracing at Uber Engineering

https://www.uber.com/in/en/blog/distributed-tracing/
1•sebg•22m ago•0 comments