frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•7mo ago

Comments

kate_at_refact•7mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

In which our protagonist dreams of laurels (and how we lost with FOSS)

https://wingolog.org/archives/2025/12/17/in-which-our-protagonist-dreams-of-laurels
1•laurex•5m ago•0 comments

Thirteen years of Rust and the birth of Rue

https://steveklabnik.com/writing/thirteen-years-of-rust-and-the-birth-of-rue/
1•steveklabnik•8m ago•0 comments

Go Gray, Not Cray: Why You Should Grayscale Your Phone

https://sami.eljabali.org/go-gray-not-cray-why-you-should-grayscale-your-phone/
1•samieljabali•10m ago•0 comments

I 3D Printed Origami [video]

https://www.youtube.com/watch?v=FNVBK7-h9Fs
1•yboris•10m ago•0 comments

How to Ship Confidently When Your Back End Makes Things Up

https://bits.logic.inc/p/how-to-ship-confidently-when-your
1•sgk284•11m ago•0 comments

Show HN: Latameo – exploring what it's like to live in Latin American cities

https://latameo.com/explore
1•batels•14m ago•0 comments

Michaelmas

https://en.wikipedia.org/wiki/Michaelmas
1•pinkmuffinere•14m ago•0 comments

Weight loss jabs: What happens when you stop taking them

https://www.bbc.com/news/articles/cn98pdpyjz5o
2•neom•14m ago•0 comments

Can I offer "login with yahoo" using FusionAuth?

https://fusionauth.io/community/forum/topic/3013/can-i-offer-login-with-yahoo-using-fusionauth
1•mooreds•16m ago•0 comments

An open-source screening platform accelerates discovery of drug combinations

https://www.nature.com/articles/s41467-025-66223-8
1•XzetaU8•17m ago•0 comments

Lemon-Shaped World Is the Most Stretched-Out Planet Ever Seen

https://www.nytimes.com/2025/12/18/science/lemon-planet-pulsar-webb.html
1•asib•19m ago•0 comments

Celebrating 10 Years of DirectX 12

https://devblogs.microsoft.com/directx/celebrating-10-years-of-directx-12/
3•ibobev•20m ago•0 comments

Concluding thoughts on our deep dive into Windows clipboard text conversion

https://devblogs.microsoft.com/oldnewthing/20251218-00/?p=111882
1•ibobev•20m ago•0 comments

The Windows clipboard automatic text conversion algorithm is path-dependent

https://devblogs.microsoft.com/oldnewthing/20251215-00/?p=111869
1•ibobev•21m ago•0 comments

What went wrong when I matchmade my friends

https://chrislakin.blog/p/matchmade
1•nowflux•21m ago•0 comments

Rules for Reading the Epstein Files

https://www.politico.com/news/magazine/2025/12/19/rules-how-to-read-jeffrey-epstein-files-column-...
1•Tomte•21m ago•0 comments

500 Hours of vibe coding: LLMs fight my coding standards at every turn

https://porridgeai.blogspot.com/2025/12/adventures-with-vibes.html
1•porridge0ats•26m ago•0 comments

Show HN: Build apps with 500 models locally. No tracking, no cloud, just code

https://github.com/codinit-dev/codinit-dev
4•Gerome24•26m ago•0 comments

Fallacies advocating software bloat

http://sininenankka.dy.fi/leetos/swbloat.php
2•marttt•28m ago•0 comments

What even are Cloudflare Durable Objects?

https://boristane.com/blog/what-are-cloudflare-durable-objects/
1•NicoJuicy•28m ago•0 comments

Show HN: Twitch Plays Claude – Crowd-controlled live coding experiment

https://www.twitch.tv/artix187
2•Artix187•30m ago•1 comments

Show HN: 12k+ Nano Banana Pro Prompts Organized in One Place

https://www.picsprompts.com/
2•moobuilds•31m ago•0 comments

Utah homeless campus takes shape

https://www.ksl.com/article/51408869/utah-has-a-grand-vision-for-homeless-campus-but-plans-ambigu...
1•gscott•32m ago•1 comments

Show HN: An authority gateway that controls AI actions before they execute

https://github.com/malukutty/ai_authority_gateway
1•bhaviav100•34m ago•0 comments

Foundation Models for Scientific Discovery and Innovation

https://www.nationalacademies.org/publications/29212
1•Anon84•34m ago•0 comments

Show HN: Building a 2D Platformer Video Game on an Oscilloscope

https://cameronbryzek.com/projects/oscilloscope-platformer
2•michaelbryzek•34m ago•0 comments

Show HN: Tooly – Developer tools without the ad clutter

https://www.tooly.one/
1•hengery•38m ago•0 comments

Live cameras are tracking faces in New Orleans. Who should control them?

https://www.npr.org/2025/12/16/nx-s1-5616681/new-orleans-live-facial-recognition-surveillance
2•measurablefunc•42m ago•1 comments

YouTube hid the dislike button. Now it's considering a name change

https://www.tubefilter.com/2025/12/19/youtube-dislike-button-hidden-not-interested/
3•gnabgib•42m ago•2 comments

How Did Corporations Get Stuck in Politics and Can They Escape?

https://corpgov.law.harvard.edu/2024/04/03/how-did-corporations-get-stuck-in-politics-and-can-the...
1•alephnerd•44m ago•1 comments