frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•8mo ago

Comments

kate_at_refact•8mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

I Thought 2025 Was Cloudy. 26 Years of Data Proved Me Wrong

https://olekwrites.com/cloudy-year-perception-vs-reality/
1•olek•1m ago•0 comments

FAFO: How we stopped worrying and bought an Electron Microscope [video]

https://www.youtube.com/watch?v=zfPYij2-ry0
2•hmelder•1m ago•0 comments

Show HN: Toy Browser Update (January 2026) [video]

https://www.youtube.com/watch?v=4xdIMmrLMLo
1•logicallee•2m ago•0 comments

New maps reveal post-flood migration patterns across the US

https://kinder.rice.edu/urbanedge/fema-buyouts-vs-risky-real-estate-new-maps-reveal-post-flood-mi...
1•toomuchtodo•2m ago•1 comments

Swapping two blocks of memory inside a larger block, in constant memory

https://devblogs.microsoft.com/oldnewthing/20260101-00/?p=111955
1•ingve•3m ago•0 comments

China drafts strictest rules to end AI-encouraged suicide, violence

https://arstechnica.com/tech-policy/2025/12/china-drafts-worlds-strictest-rules-to-end-ai-encoura...
1•thunderbong•4m ago•0 comments

Show HN: Memoria – Spam exists, but can't climb (A local-first protocol)

https://github.com/Kusaneko-Memoria/memoria-protocol
1•Kusaneko•6m ago•1 comments

The peace of a nation no longer besieged by the third world

https://twitter.com/DHSgov/status/2006472108222853298
2•SilverElfin•6m ago•1 comments

NJ buying flood prone properties

https://dep.nj.gov/blueacres/
1•bnolan001•6m ago•2 comments

Public Domain Day 2026 in Literature

https://standardebooks.org/blog/public-domain-day-2026
2•robin_reala•8m ago•0 comments

WireGuard packet relay for NAT traversal

https://github.com/weiiwang01/wpex
1•progval•9m ago•0 comments

Show HN: Guess the Move Chess App

https://guessthemove.app
1•travelhead•10m ago•0 comments

Amy Schumer Moves On, Selling 'Moonstruck' House in Brooklyn

https://www.nytimes.com/2025/12/31/realestate/amy-schumer-moves-on-selling-moonstruck-house-in-br...
1•whack•14m ago•0 comments

TimesWire

https://donohoe.dev/timeswire/
1•donohoe•14m ago•0 comments

A Sparse Transformer with Tunable Emergent Subnetworks

https://github.com/wwes4/ResonanceTransformer
1•wwes369•16m ago•0 comments

Which Power Plant Does My Electricity Come From? [video]

https://www.youtube.com/watch?v=sH1PVVJuBtE
1•keepamovin•17m ago•0 comments

Every LLM hallucinates that std:vector deletes elements in LIFO order

https://am17an.bearblog.dev/every-llm-hallucinates-stdvector-deletes-elements-in-a-lifo-order/
2•am17an•18m ago•0 comments

The Miracle of Microfinance? Evidence from a Randomized Evaluation

https://www.aeaweb.org/articles?id=10.1257/app.20130533
1•haltingproblem•20m ago•0 comments

Ask HN: How Are You Handling Auth in 2026?

3•joshcsimmons•20m ago•0 comments

Amoskeag: F/OSS DSL for business rules - functional language inspired by Ruby

https://github.com/durable-oss/amoskeag
1•djb-at-durable•20m ago•0 comments

AI is not neutral. It judges you [video]

https://www.youtube.com/watch?v=LqiActVUm4Q
1•shine1697•23m ago•0 comments

Show HN: Downmark – Turn webpages into distraction-free Markdown

https://downmark.fly.dev/https%3A%2F%2Fgithub.com%2Fadhipk%2Fdownmark
1•AdhipKashyap•25m ago•0 comments

Back to basics: The foundations that shape everything we design

https://designexplained.substack.com/p/back-to-basics-the-foundations-that
1•kaizenb•26m ago•0 comments

MHC: Manifold-Constrained Hyper-Connections

https://arxiv.org/abs/2512.24880
2•tamnd•29m ago•0 comments

The Entry-Level Hiring Process Is Breaking Down

https://www.theatlantic.com/ideas/2025/12/grade-inflation-ai-hiring/685157/
1•cebert•30m ago•1 comments

AI Window Opportunity

https://www.localghost.ai/inflection
1•0xbrayo•30m ago•0 comments

European Space Agency hit again as cybercriminals claim 200 GB data up for sale

https://www.theregister.com/2025/12/31/european_space_agency_hacked/
12•smurda•32m ago•1 comments

Desugaring the Relationship Between Concrete and Abstract Syntax

https://thunderseethe.dev/posts/desugar-base/
1•PaulHoule•33m ago•0 comments

The Four-Language Waltz: A Tale of Allocators and Regret

https://xlii.space/eng/the-four-language-waltz-a-tale-of-allocators-and-regret/
1•xlii•33m ago•0 comments

FDA clears Caliway's trial of CBL-514 for localised fat reduction

https://www.clinicaltrialsarena.com/news/caliway-cbl-514/
1•daoboy•34m ago•1 comments