frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

10x More Selective (2013)

https://yosefk.com/blog/10x-more-selective.html
1•tosh•3m ago•0 comments

Superficie – Surface Syntax for Clojure to Help Exposition/Onboarding

https://github.com/replikativ/superficie
1•TheWiggles•3m ago•0 comments

Mistral AI Acquires EU Physics AI Startup Emmi AI

https://www.reuters.com/business/autos-transportation/mistral-ai-buys-austrian-physics-ai-startup...
1•ottomengis•7m ago•0 comments

Claude Code in a Loop

https://github.com/quazardous/aiball
1•quazarzero•11m ago•0 comments

Portability Is a Myth: Why the Best AI Stacks Will Never Be Hardware-Agnostic

https://twitter.com/PatrickToulme/status/2055709800986780028
2•pretext•11m ago•0 comments

Show HN: Viberia – Civ/Polytopia-like command center for AI agents (BYOK/BYOS)

https://getviberia.com/
1•bhu8•12m ago•0 comments

Pizza Hut's AI system caused 'cascading' problems and $100M in damages

https://www.businessinsider.com/pizza-hut-ai-system-dragontail-lawsuit-franchisee-2026-5
1•kombookcha•16m ago•2 comments

Free crypto toolbox with 900 online tools

https://sevendiv.com/
1•m_mgkh•18m ago•0 comments

The Definitive Guide to Helping People Trapped in a Cult

https://www.psychologytoday.com/us/blog/freedom-mind/202104/the-definitive-guide-helping-people-t...
1•jacquesm•19m ago•0 comments

The cut in the Mixture of Experts compute graph

https://idlemachines.co.uk/essays/mixture-of-experts
1•smaddrellmander•25m ago•0 comments

State of AI 2026 survey results

https://2026.stateofai.dev/en-US
1•mycroft_4221•26m ago•0 comments

Evolving specs (2025)

https://github.com/github/spec-kit/discussions/152
1•pramodbiligiri•27m ago•0 comments

CISA Admin Leaked AWS GovCloud Keys on GitHub

https://krebsonsecurity.com/2026/05/cisa-admin-leaked-aws-govcloud-keys-on-github/
2•LelouBil•29m ago•0 comments

Hyperpolyglot ML Dialects and Haskell: SML, OCaml, F#, Haskell

https://hyperpolyglot.org/ml
1•lambertsimnel•33m ago•0 comments

AI agents for regulatory use cases: competition on the EU AI Act

https://regenold.com/landing/ai/eu-ai-act-competition
1•marco_virgolin•34m ago•2 comments

The just-say-no engineer was a ZIRP phenomenon

https://www.seangoedecke.com/the-just-say-no-engineer-was-a-zirp-phenomenon/
2•theanonymousone•41m ago•0 comments

Governments are ruining the internet to protect kids

https://www.neowin.net/editorials/governments-are-ruining-the-internet-to-protect-kids-but-there-...
3•bundie•41m ago•0 comments

Mounting Git commits as folders with NFS

https://jvns.ca/blog/2023/12/04/mounting-git-commits-as-folders-with-nfs/
2•pvtmert•42m ago•1 comments

Korean bill seeks strict watermark mandate on AI-generated content

https://www.koreatimes.co.kr/business/tech-science/20260517/korean-bill-seeks-strict-watermark-ma...
3•01-_-•44m ago•0 comments

A new EDIT tool for LLM agents

https://antirez.com/news/166
4•surprisetalk•45m ago•1 comments

Darwin Family: MRI-Trust-Weighted Evolutionary Merging

https://arxiv.org/abs/2605.14386
1•cheuv•45m ago•0 comments

Brain "Bypass" Technology Could Transform Treatment for Neurological Disorders

https://scitechdaily.com/new-brain-bypass-technology-could-transform-treatment-for-neurological-d...
2•01-_-•46m ago•0 comments

My domain got abused on GitHub Pages

https://meertens.dev/blog/github-enables-domain-abuse/
3•rmeertens•46m ago•0 comments

How We Hacked Our Way to Free 4.0s and Took Over a uWaterloo & UofT Grading Tool

https://xtra.sh/blog/markus/
1•xtra1•47m ago•0 comments

How to Select a Mobile OS

https://blog.gridranger.dev/mobile-oses-featuring-fairphone-5/
1•vinhnx•48m ago•0 comments

Why Elon Musk lost his suit against OpenAI

https://www.technologyreview.com/2026/05/18/1137488/elon-musk-suit-openai-verdict/
3•joozio•49m ago•0 comments

Topological Relativity Theory: A Quantum Gauge Field Framework

https://zenodo.org/records/20262720
1•kisnorbert•50m ago•0 comments

Meta lays out details of May 20 restructuring in internal document

https://www.reuters.com/world/meta-lays-out-plans-may-20-layoffs-restructuring-internal-document-...
3•theonionspeaks•51m ago•0 comments

My 40-liter backpack travel guide

https://vitalik.eth.limo/general/2022/06/20/backpack.html
2•bushwart•55m ago•2 comments

Kerf (Kerf1)

https://github.com/kevinlawler/kerf1
1•tosh•56m ago•0 comments