frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•7mo ago

Comments

kate_at_refact•7mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Texas Space Boom Requires Lots of Lawyers in Boost for Firms

https://news.bloomberglaw.com/business-and-practice/texas-space-boom-requires-lots-of-lawyers-in-...
1•mooreds•4m ago•0 comments

Microsoft Excel Conquered Corporate America

https://www.bloomberg.com/news/articles/2025-12-04/how-microsoft-excel-is-navigating-ai-new-compe...
1•mooreds•5m ago•0 comments

Ask HN: Do you write a technical doc first or just vibe code?

1•brihati•5m ago•0 comments

Advanced Spray Drone and Precision AG Technology

https://agrispraydrones.com
1•mooreds•5m ago•0 comments

Core War

https://en.wikipedia.org/wiki/Core_War
1•simonebrunozzi•8m ago•0 comments

Revealing Traces in Printouts and Scans

https://dys2p.com/en/2022-09-print-scan-traces.html
1•cryzinger•8m ago•0 comments

How to position your shower curtain to reduce mold risk

https://www.washingtonpost.com/wellness/2025/12/02/shower-curtain-humidity-mold/
1•bookofjoe•9m ago•1 comments

People Are Taking This Unapproved New Weight-Loss Drug [Retatrutide]

https://www.wired.com/story/people-are-already-taking-this-unapproved-new-weight-loss-drug-triple...
2•toomuchtodo•11m ago•1 comments

GRIN2A null variants increase early-onset schizophrenia and other disorders

https://www.nature.com/articles/s41380-025-03279-4
1•wjb3•12m ago•0 comments

Ask HN: How do you handle release notes for multiple audiences?

7•glidr_dev•14m ago•1 comments

What is a build system, anyway?

https://jyn.dev/what-is-a-build-system-anyway/
1•todsacerdoti•20m ago•0 comments

More than 9M US borrowers miss student loan payments as delinquencies rise

https://www.ft.com/content/b6ca2ab2-2d3a-40d7-9a61-12a6fda0625d
4•mikhael•22m ago•2 comments

The Drosophila of Decision Science

https://jtpeterson.substack.com/p/the-drosophila-of-decision-science
1•surprisetalk•24m ago•0 comments

We Lost Something: 1970s REPLs Were Better Than Modern Development Environments

https://programmingsimplicity.substack.com/p/we-lost-something-1970s-repls-were
2•surprisetalk•24m ago•0 comments

Energy Predictions 2025

https://caseyhandmer.wordpress.com/2025/12/08/energy-predictions-2025/
2•surprisetalk•24m ago•0 comments

A Multimedia Sketchpad

https://beyondloom.com/blog/sketchpad.html
1•surprisetalk•24m ago•0 comments

Unofficial Advent of Code 2025 Survey Results (with "Emotions" Added)

https://jeroenheijmans.github.io/advent-of-code-surveys/?y=2025
1•jeroenheijmans•25m ago•1 comments

Metagenomic profiling of microbial communities from aircraft filters, face masks

https://link.springer.com/article/10.1186/s40168-025-02276-7
1•PaulHoule•29m ago•0 comments

AI: A Dedicated Fact-Failing Machine, Or, yet Another Reason Not to Trust It

https://whatever.scalzi.com/2025/12/13/ai-a-dedicated-fact-failing-machine-or-yet-another-reason-...
2•calcifer•29m ago•0 comments

ChatGPT – GuardPrompt – PII

https://github.com/guardprompt/GuardPrompt
1•vlkc•32m ago•1 comments

Validate your software architecture before writing code

https://www.simuladordearquitetura.com.br/
1•alexsandronl•32m ago•1 comments

VPN location claims don't match real traffic exits

https://ipinfo.io/blog/vpn-location-mismatch-report
5•mmaia•33m ago•1 comments

Where are we going, IndieWeb?

https://hamatti.org/posts/where-are-we-going-indieweb/
1•freediver•33m ago•0 comments

Curio – AI Toys

https://heycurio.com/
1•domrdy•34m ago•0 comments

Show HN: Befa.ke – Destroy Instagram

https://befa.ke
1•anandbaburajan•35m ago•0 comments

Type Stripping with Zero Dependencies

https://termer.net/blog/type-stripping-with-zero-dependencies/
2•qwm•35m ago•0 comments

PowerLattice Voltage Regulator Boosts AI Energy Efficiency

https://spectrum.ieee.org/voltage-regulator
1•rbanffy•36m ago•0 comments

Everybody but Nvidia and TSMC Has to Make It Up in Volume with AI

https://www.nextplatform.com/2025/12/12/everybody-but-nvidia-and-tsmc-has-to-make-it-up-in-volume...
1•rbanffy•38m ago•0 comments

A trajectory-based approach to recommendation and search for creative content

https://zenodo.org/records/17847529
1•abhi_bhartiya•39m ago•1 comments

Sketch of Ideas in Geometry and Computing

https://nigelvr.github.io/post-1.html
1•nigelvr•41m ago•0 comments