frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Microsandbox: OCI Filesystem 47x Faster

https://microsandbox.dev/blog/oci-filesystem-47x-faster
1•bkfh•1m ago•0 comments

Read-only developer endpoint scanner for on-disk package, extension

https://github.com/perplexityai/bumblebee
1•taubek•2m ago•0 comments

Scotland Yard can keep using live facial recognition on people in London- judges

https://www.theregister.com/security/2026/04/22/high-court-approves-met-polices-facial-recog-afte...
1•gnabgib•4m ago•0 comments

AI Translate All Formats

1•cadic2603•6m ago•0 comments

Cisco Foundry Security Spec: Open specification for agentic security evaluation

https://github.com/CiscoDevNet/foundry-security-spec
1•cpard•6m ago•0 comments

Why Japan has abandoned houses

https://thehustle.co/newsletters/13-05-2026
1•stephsmithio•9m ago•1 comments

Google vs. Perplexity Chrome Extension

https://github.com/sarons/dual-ai-chat
1•cybermango•9m ago•1 comments

Quantum Dynamics Breakthrough Overturns Claim of 'Quantum Supremacy'

https://www.simonsfoundation.org/2026/05/21/quantum-dynamics-breakthrough-overturns-claim-of-quan...
4•SiempreViernes•16m ago•0 comments

Free admission and discounted overnight stays with Parks Canada

https://parks.canada.ca/voyage-travel/conseils-tips/choisis-canada-choose/admission-camping
2•bookofjoe•19m ago•0 comments

Marimo: A Reactive Python Notebook

https://marimo.io
1•pmaddams•19m ago•0 comments

Why Most Senior Devs Plateau, and What to Do

https://stackandscale.substack.com/p/why-most-senior-developers-plateau
3•lucyb0207•22m ago•0 comments

Onfim

https://en.wikipedia.org/wiki/Onfim
3•Michelangelo11•25m ago•0 comments

You will not be a member of the permanent underclass

https://thingofthings.substack.com/p/you-will-not-be-a-member-of-the-permanent
1•paulpauper•28m ago•1 comments

Why reviewing AI-generated code is devilishly hard

https://www.spinellis.gr/blog/20260523/
2•DSpinellis•33m ago•0 comments

The Forgotten Art of the LAN Party (2023)

https://www.superjumpmagazine.com/the-forgotten-art-of-the-lan-party/
1•susam•35m ago•0 comments

Italian authorities shut down major streaming piracy network

https://www.engadget.com/2180075/italian-authorities-shut-down-major-streaming-piracy-network-cin...
3•01-_-•40m ago•0 comments

ANCI: The Agent Infrastructure for Scheduling

https://meetanci.com
1•rajl•40m ago•0 comments

What's in a Codebase?

https://www.moderndescartes.com/essays/codebase_spec/
2•brilee•40m ago•0 comments

Elon, stop trying to make Grok happen

https://www.theverge.com/ai-artificial-intelligence/936219/elon-stop-trying-to-make-grok-happen
4•01-_-•41m ago•2 comments

Verytis – shared error memory for AI coding agents (MCP)

https://www.verytis.com
1•TychiqueY•41m ago•0 comments

Show HN: A satirical idle game about running an AI startup

https://game.trae.academy/
3•haebom•42m ago•0 comments

Show HN: Running BitNet b1.58 inside DRAM by breaking DDR4 timing rules

1•pcdeni•43m ago•0 comments

A Mysterious Children's Search Engine Is Misleading Kids

https://www.city-journal.org/article/kiddle-search-engine-kids
3•bushwart•44m ago•0 comments

NeuralNote

https://github.com/DamRsn/NeuralNote
1•hyperific•46m ago•0 comments

Kanban board web app powered by the Redmine API

https://ricardoborges.github.io/RedKanban/
1•r2ob•46m ago•0 comments

Diátaxis: A systematic approach to technical documentation authoring

https://diataxis.fr/
2•ZeroCool2u•47m ago•0 comments

The Banal Horror of Jimmy Fallon

https://www.currentaffairs.org/news/the-banal-horror-of-jimmy-fallon
3•ZeroCool2u•49m ago•2 comments

User Story

https://beyondloom.com/blog/userstory.html
1•tosh•50m ago•0 comments

It's time to talk about my writerdeck

https://veronicaexplains.net/my-first-writerdeck/
27•hggh•51m ago•11 comments

SafeDB MCP – safer read-only database access for AI agents

https://github.com/narekmalk/safedb-mcp
2•Narek88•51m ago•0 comments