frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•11mo ago

Comments

kate_at_refact•11mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Little-known Bay Area city will keep San Francisco in its airport's name

https://sfstandard.com/2026/04/28/oak-sfo-reach-naming-settlement/
1•mikhael•58s ago•0 comments

Before GitHub

https://lucumr.pocoo.org/2026/4/28/before-github/
1•mlex•1m ago•0 comments

The Sad Story of Heisenberg's Doctoral Oral Exam (1998)

https://www.aps.org/archives/publications/apsnews/199801/heisenberg.cfm
1•downbad_•1m ago•1 comments

Hundred Rabbits

https://en.wikipedia.org/wiki/Hundred_Rabbits
1•quijoteuniv•2m ago•0 comments

Critical Vulnerability with CPanel and WHM Login Authentication

https://support.cpanel.net/hc/en-us/articles/40073787579671-Critical-Vulnerability-with-cPanel-WH...
3•ollybee•2m ago•0 comments

AI Wellbeing: Measuring and improving the functional pleasure and pain of AIs

https://www.ai-wellbeing.org/
1•anematode•4m ago•0 comments

Walmart Upstream Facility Services: What We Do

https://public.walmart.com/content/wp/upstream-home/WhatWeDo.html
1•petethomas•5m ago•0 comments

Donating Agent Payments Protocol to the Fido Alliance

https://blog.google/products-and-platforms/platforms/google-pay/agent-payments-protocol-fido-alli...
1•pentagrama•5m ago•0 comments

Preparing for the AI-Enhanced Attacker and the Impact on CISOs

https://www.armadin.com/blog-posts/prepare-for-the-ai-enhanced-attacker
1•wslh•7m ago•0 comments

Does it work, or does it keep working?

https://shippingbytes.com/2026/04/22/does-it-work-or-does-it-keep-working/
2•gianarb•8m ago•0 comments

Anglo-Persian Oil Company

https://en.wikipedia.org/wiki/Anglo-Persian_Oil_Company
2•notepad0x90•8m ago•0 comments

MeetingEscape – vibe-coded panic button for escaping meetings

https://meetingescape.net/
1•Enginehead_•8m ago•0 comments

Confectionary Satellites

https://zenodo.org/records/19799778
1•svnt•9m ago•0 comments

Denuvo has been cracked in all single-player games it previously protected

https://www.tomshardware.com/video-games/pc-gaming/denuvo-has-been-bypassed-in-all-single-player-...
2•oceansky•10m ago•0 comments

Vompeccc from Scratch: Picking Produce with ICR in Emacs

https://www.chiply.dev/post-vompeccc-fruits
1•chiply•11m ago•1 comments

Router has been narrating your browsing habits this whole time

https://www.makeuseof.com/your-isp-watching-your-browsing-whole-time-heres-windows-11-fix/
1•galaxyLogic•12m ago•0 comments

CATL secures 60 GWh sodium-ion battery order with Hyperstrong

https://www.pv-magazine.com/2026/04/28/catl-secures-worlds-largest-sodium-ion-battery-order-with-...
3•konschubert•13m ago•1 comments

How well does S3 checkpointing hold up when running Airflow on spot?

https://spot.rackspace.com/blog/building-fault-tolerant-airflow-pipelines-on-spot-infrastructure
1•aleroawani•14m ago•1 comments

Why the Chicken Crossed the Road, According to Various Entities

https://dynomight.net/chicken/
1•webninja•15m ago•0 comments

First Amendment advocates blast the FCC's early review of ABC broadcast licenses

https://www.nbcnews.com/news/us-news/first-amendment-advocates-blast-fccs-early-review-abc-broadc...
3•ceejayoz•15m ago•0 comments

AMD's Lemonade SDK 10.3 Now 10x Smaller by Getting Rid of Electron

https://www.phoronix.com/news/AMD-Lemonade-10.3
3•canucker2016•15m ago•1 comments

Ask HN: What happens when you paste a screenshot, and ask questions in LLM?

1•orsenthil•16m ago•2 comments

Building simulations and/or digital twins with AI

https://github.com/plugboard-dev/plugboard
1•tjc45•18m ago•1 comments

Show HN: PeopleMesh, Semantic Search for People

https://github.com/francescopace/peoplemesh
1•francescopace•21m ago•0 comments

BP profits more than double as Iran war sends oil prices higher

https://www.bbc.com/news/articles/c2eveyvgn9no
19•breve•28m ago•3 comments

Trillions per Second

https://github.com/c4chaos-io/trillions-per-second
1•kordlessagain•29m ago•1 comments

Japan Airlines trials humanoid robots as ground handlers

https://www.bbc.com/news/articles/cpwp87j1llvo
2•breve•32m ago•0 comments

FDA turns up heat on Amgen, proposing to rescind approval of Tavneos

https://www.fiercepharma.com/pharma/fda-puts-more-heat-amgen-proposing-rescind-approval-tavneos
1•randycupertino•32m ago•1 comments

Reviving Teletext for Ham Radio

https://spectrum.ieee.org/reviving-teletext-for-ham-radio
1•austinallegro•33m ago•0 comments

How electronic warfare is sowing confusion in cockpits

https://www.cnn.com/2026/04/28/science/gps-jamming-plane-navigation-problems
2•breve•34m ago•0 comments