frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Cisco open sources toolkit for tracing AI model lineage

https://blogs.cisco.com/ai/model-provenance-kit
1•hsanthan•4m ago•0 comments

Swival: A coding agent for any model

https://swival.dev/
1•handfuloflight•4m ago•0 comments

Uantifying Voter Biases in Online Platforms: An Instrumental Variable Approach

https://arxiv.org/abs/1910.00757
1•smooke•5m ago•0 comments

Steep fertilizer and fuel prices could squeeze US farmers for months to come

https://www.wpr.org/news/steep-fertilizer-fuel-prices-squeeze-us-farmers-months-come
1•_tk_•7m ago•0 comments

Show HN: Vanilla-scroll-sky: CSS-only modern scroll-driven storytelling sections

https://github.com/ulrischa/vanilla-scroll-sky
1•ulrischa•7m ago•0 comments

Migrating from Supabase

https://blog.val.town/blog/migrating-from-supabase/
2•gurjeet•10m ago•0 comments

Do we even need a better GitHub?

https://www.aviator.co/blog/do-we-even-need-a-better-github/
2•tonkkatonka•10m ago•0 comments

Stable Specialization in Rust

https://goldstein.lol/posts/stable-specialization/
1•PaulHoule•10m ago•0 comments

Claude will use all SpaceX Colossus datacenter capacity

https://twitter.com/NVIDIAAI/status/2052082412994383936
4•kristianpaul•12m ago•1 comments

When do we know someone has died

https://blog.computationalcomplexity.org/2026/05/when-do-we-know-someone-has-died.html
1•speckx•13m ago•0 comments

Multipath Reliable Connection spec published

https://www.opencompute.org/documents/ocp-mrc-1-0-pdf
1•jabl•13m ago•0 comments

Olaf: Bringing an Animated Character to Life in the Physical World

https://arxiv.org/abs/2512.16705
1•programd•13m ago•0 comments

Bell Laboratories Record (August 1941) [pdf]

https://www.worldradiohistory.com/Archive-Bell-Laboratories-Record/40s/Bell-Laboratories-Record-1...
2•zuhayeer•15m ago•0 comments

MIT’s virtual violin offers luthiers a new design tool

https://arstechnica.com/science/2026/05/mits-virtual-violin-offers-luthiers-a-new-design-tool/
2•smushy•15m ago•0 comments

Free and Simple Chess Analysis

https://www.g6chess.com/
1•mantegna•15m ago•1 comments

Supercomputer networking to accelerate large scale AI training

https://openai.com/index/mrc-supercomputer-networking/
1•dataking•16m ago•0 comments

xAI will be dissolved as a separate company

https://twitter.com/elonmusk/status/2052105373621121284
1•break_the_bank•17m ago•0 comments

Learning Advanced JavaScript (2008)

https://johnresig.com/apps/learn/
1•downbad_•19m ago•1 comments

Gypsy Woman Hardware Live Jam (2023) [video]

https://www.youtube.com/watch?v=_SSXALxZ3Hs
1•elvis70•20m ago•0 comments

Mainframe modernization is no longer optional for the AI-driven enterprise

https://thenewstack.io/open-mainframe-enterprise-modernization/
2•rbanffy•21m ago•0 comments

Let's Get EFF to Accept Monero Donations

https://monerocoalition.org/lets-get-eff-to-accept-monero-donations/
5•Cider9986•22m ago•0 comments

You can make more money buying MTG cards than the lottery

https://meadow.cafe/blog/0073-you-can-make-more-money-buying-mtg-cards-than-the-lottery/
2•speckx•25m ago•0 comments

Go-joker – a much faster Clojure interpreter written in Go and WASM

https://rcarmo.github.io/projects/go-joker/
7•rcarmo•26m ago•0 comments

ZAYA1-8B: Frontier intelligence density, trained on AMD

https://www.zyphra.com/post/zaya1-8b
3•mseri•27m ago•0 comments

Shadow – find which prompt change broke your AI agent

https://github.com/manav8498/Shadow
2•manav8498•27m ago•0 comments

Upcoming El Niño: The World Is About to Get a Preview of Life in 2035

https://www.nytimes.com/2026/05/06/opinion/el-nino-climate.html
5•puttycat•30m ago•0 comments

Planting Trees and Dreaming of Software

https://jerodsanto.net/2026/05/planting-trees-software-dreams/
1•herbertl•30m ago•0 comments

Hackers Hate AI Slop More Than You Do

https://www.wired.com/story/cybercriminals-are-complaining-about-ai-slop-flooding-their-forums/
9•aledevv•32m ago•1 comments

An aggregate of payment usage data released by businesses that accept Monero

https://monerostats.org/
1•Cider9986•32m ago•0 comments

A Fundamental FX Factor Model

https://dm13450.github.io/2026/04/19/A-Fundamental-FX-Factor-Model.md.html
1•dm13450•32m ago•0 comments