frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Trap Street

https://en.wikipedia.org/wiki/Trap_street
1•ivanjermakov•2m ago•0 comments

Why are some people mosquito magnets? Clues are emerging

https://phys.org/news/2026-05-people-mosquito-magnets-clues-emerging.html
1•jnord•2m ago•0 comments

Show HN: I made a screen recording app to make demos like an Apple commercial

https://shotglass.app
4•jakemanger•5m ago•0 comments

Satteri – High-Performance Markdown and MDX Processing for the JavaScript

https://github.com/bruits/satteri
2•todotask2•6m ago•0 comments

GitHub Launchpad Proposal

https://github.com/liamromanis101/Github_Strategic_Proposal-2026
2•lromanis•7m ago•0 comments

FaceTime Without the Internet [video]

https://www.youtube.com/watch?v=0b1XL91-q48
2•marklit•8m ago•0 comments

OpenAI, Microsoft and Friends Build a Better, More Scalable Ethernet

https://www.nextplatform.com/connect/2026/05/12/openai-microsoft-and-friends-build-a-better-more-...
3•rbanffy•10m ago•0 comments

Cuckooland – Tom Burgis on the abuse of power and influence (2024)

https://www.ft.com/content/b82c15dd-0c63-4657-a2e7-488ce30b8afc
3•robtherobber•13m ago•0 comments

Google's Android-powered laptops are called Googlebooks, and coming this year

https://arstechnica.com/gadgets/2026/05/googles-android-powered-laptops-are-called-googlebooks-an...
3•rbanffy•14m ago•0 comments

Hysteria: A QUIC-Based Proxy Designed to Resist Censorship

https://github.com/apernet/hysteria
4•steveharing1•16m ago•0 comments

Bun is being ported to Rust using Claude. Here's a code review using GPT

https://github.com/Swival/security-audits/blob/main/bun-rust/README.md
2•jedisct1•18m ago•0 comments

Alien

https://typesetinthefuture.com/2014/12/01/alien/
3•tosh•18m ago•0 comments

AllSkyKamera: A Citizen-Science Network for Global Night Sky Monitoring

https://allskykamera.space/index.php?lang=en
3•ptrsrtp•20m ago•0 comments

Show HN: Recursant, a mesh-based control plane for AI agents

https://github.com/ajensenwaud/recursant
2•hestefisk•22m ago•0 comments

OpenCL 3.1

https://www.khronos.org/blog/opencl-3.1-is-here
3•tosh•24m ago•0 comments

Valve snuck a Wilhelm scream Easter egg into the new Steam Controller [video]

https://www.youtube.com/watch?v=tw5Luf_7F8c
2•HelloUsername•27m ago•0 comments

Terence Tao: New mathematical workflows [video]

https://www.youtube.com/watch?v=Uc2zt198U_U
1•energy123•30m ago•0 comments

Codex Computer Use

https://developers.openai.com/codex/app/computer-use
1•tr33house•30m ago•1 comments

AluminiumOS, by Google: Android Reimagined for the Desktop

https://aluminium-os.com/
9•brysonreece•34m ago•14 comments

Urlsify.com Made This Free to Use URL Shortener with Indepth Analytics

https://old.reddit.com/r/sideprojects/comments/1tabelm/finished_making_this_url_shortener_complet...
1•godlymod•35m ago•0 comments

Hantavirus Map

https://hantavirusmap.net/
1•leonvonblut•39m ago•0 comments

Wrote this for humans, now I use it as a prompt

https://x-x.codes/posts/supplementary-guide-to-code-reviews
1•alex_x•40m ago•1 comments

Sick of Ads on Free QR Generator

https://miqr.mx/
1•rubiocanino•41m ago•2 comments

Genera OS

https://wiki.c2.com/?GeneraOs
2•tosh•43m ago•0 comments

Leak reveals Google's Aluminium OS with a 16-minute video

https://www.androidauthority.com/google-aluminium-os-leak-3665979/
4•thunderbong•43m ago•0 comments

The Role of HubSpot in Driving Successful CRM Adoption in Africa

https://amdan.pro/the-role-of-hubspot-in-driving-successful-crm-adoption-in-africa/
1•amdanmerit•45m ago•0 comments

Tep: A Sinatra-flavoured framework that compiles to a native binary via Spinel

https://github.com/oripekelman/tep
1•futurecat•48m ago•0 comments

Show HN: Design posters showcasing your country's electrical grid

https://github.com/open-energy-transition/grid2poster
1•lyoncy•53m ago•0 comments

European governments: 3.000 tracking sites, 1.000 phpMyAdmins, and 99% poorly

https://internetcleanup.foundation/2026/05/european-governments-3000-tracking-sites-1000-phpmyadm...
44•aequitas•57m ago•6 comments

A Brazilian Space Launch System for the Small Satellite Market

https://www.mdpi.com/2226-4310/6/11/123
1•rbanffy•58m ago•0 comments