frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Netflix Wiz creates app to slash AI bills, then open sources it

https://www.theregister.com/ai-ml/2026/05/31/netflix-wiz-creates-app-to-slash-ai-bills-then-open-...
2•pseudolus•6m ago•0 comments

What building payment products taught me about scalable financial infrastructure

https://www.solvimon.com/blog/five-lessons-on-building-scalable-financial-infrastructure
1•arnon•6m ago•0 comments

Australia's far-right party leads in national poll for first time

https://www.reuters.com/world/asia-pacific/australias-far-right-party-leads-national-poll-first-t...
1•KnuthIsGod•7m ago•1 comments

Are API keys too much friction for AI tools in teams?

https://intrascope.app/
1•Intrascopeapp•8m ago•0 comments

Fine-tuning an LLM to write docs like it's 1995

https://passo.uno/fine-tuning-docs-llm/
1•theletterf•9m ago•0 comments

Nordstjernen Web Browser

https://nordstjernen.org/
2•andreasrosdal•12m ago•1 comments

Run your first marathon in your 50s, get chased by zombies

https://www.theguardian.com/games/2026/may/29/run-first-marathon-50s-zombies-run-game
1•6LLvveMx2koXfwn•13m ago•0 comments

Student astronomer discovers 'Rosetta Stone' for mysterious cosmic signals

https://phys.org/news/2026-05-student-astronomer-rosetta-stone-mysterious.html
2•pseudolus•16m ago•0 comments

What is the biggest problem you face as a software developer today?

https://docs.google.com/forms/d/e/1FAIpQLSf1M5d2y-0RXEIhrbDBtS5gC900YuzWl43cJCxGUrU38MyeDQ/viewfo...
2•cybertarr•18m ago•0 comments

Memo: Memory as a Model

https://arxiv.org/abs/2605.15156
1•melvinroest•19m ago•0 comments

Slowing Down

https://www.ssp.sh/brain/slowing-down/
2•eigenBasis•20m ago•0 comments

Valve, Gaming's Anticorporate Hero, Has Its Antitrust Moment

https://www.bloomberg.com/news/features/2026-06-01/valve-s-antitrust-reckoning-over-steam-has-ech...
5•igortru•22m ago•0 comments

Proveyouragent: Cryptographic identity for AI agents (Ed25519 and DPoP)

https://github.com/lujainkhalil/proveyouragent
1•lujainkhalil•24m ago•0 comments

Releases Its First World Model–Project Eden

https://twitter.com/tripoai/status/2061307584817385960
3•764261457•28m ago•0 comments

Two LLM UI Patterns That Aren't Chat

https://poyo.co/note/20260525T094605/
4•minikomi•28m ago•0 comments

Colorize Your Kubectl Output

https://github.com/kubecolor/kubecolor
2•ankitg12•30m ago•0 comments

Training a Simple World Model with Jax

https://www.alexinch.com/blog/simple-world-model
1•ainch•33m ago•1 comments

Security and Protection of Data in the IBM System/38

https://dl.acm.org/doi/pdf/10.1145/800053.801932
1•rbanffy•36m ago•1 comments

AI fiction is the new fast food

https://www.washingtonpost.com/opinions/2026/05/31/ai-fiction-is-fast-food-human-mind/
2•KnuthIsGod•38m ago•0 comments

Scientists discover lost range of 'supermountains' 3x longer than the Himalayas

https://www.space.com/supermountains-drove-evolution-on-earth
3•thunderbong•39m ago•0 comments

two strangers. one call. no names

https://just2voices.com/
2•whatis1215•42m ago•1 comments

Why are large language models so terrible at video games?

https://spectrum.ieee.org/ai-video-games-llms-togelius
10•sxx0•44m ago•7 comments

Meta Fixes Instagram AI Flaw Used in Account Takeovers

https://sqmagazine.co.uk/meta-fixes-instagram-ai-flaw-account-takeovers/
1•mmsc•44m ago•0 comments

Intention – The Next Layer of Abstraction

https://intelligence.bearblog.dev
1•GlowingFern•45m ago•0 comments

Rainbow Query Language

https://rbql.org/
2•shakna•52m ago•0 comments

Exec into Node via Kubectl

https://github.com/kvaps/kubectl-node-shell
1•ankitg12•53m ago•0 comments

An AI native hedge fund

https://github.com/achaljhawar/1rok
2•satoshiclad•53m ago•0 comments

The Seven-Action Documentation Model

https://passo.uno/seven-action-model/
1•eigenBasis•54m ago•0 comments

Package Manager for Kubectl Plugins

https://krew.sigs.k8s.io/plugins/
1•ankitg12•54m ago•0 comments

Tongan Castaways

https://en.wikipedia.org/wiki/Tongan_castaways
2•mpweiher•54m ago•0 comments