frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

China cracks down on rule-bending offshore investments

https://www.economist.com/finance-and-economics/2026/06/28/china-cracks-down-on-rule-bending-offs...
2•andsoitis•3m ago•1 comments

The Truth about Space Data Centers [video]

https://www.youtube.com/watch?v=_qpdUNMt2yg
2•tambourine_man•9m ago•0 comments

Why can't India's government build a decent website?

https://www.economist.com/asia/2026/06/28/why-cant-indias-government-build-a-decent-website
4•andsoitis•12m ago•3 comments

A faster bump allocator for rust

https://owen.cafe/posts/stumpalo/
3•birdculture•20m ago•0 comments

AI 'exuberance' risks ending in lengthy investment bust

https://www.ft.com/content/e81ce414-e4bd-4e8c-bac7-94f7bf17def4
3•petethomas•28m ago•1 comments

Show HN: Image2JXL – a native macOS JPEG XL converter

https://old.reddit.com/r/givebest/comments/1ueh3v4/i_built_image2jxl_a_native_macos_app_for_local/
3•givebest•36m ago•0 comments

Self-learning skill for Claude: let the agent capture its own hard-won patterns

https://github.com/Kulaxyz/self-learning-skills
3•kulaxyz•41m ago•0 comments

Show HN: A site that emails you the day your Bitcoin stack hits $1M

https://amimillionaire.com/
2•pro_methe5•42m ago•1 comments

Show HN: Calybris Core, a deterministic audit engine for decisions in Rust

https://github.com/emirhuseynrmx/calybris-core
3•emirhuseyininci•42m ago•0 comments

Show HN: wavecat – a fully local personal agent that watches your screen

https://wavecat.ai/
3•sdkpanda•45m ago•0 comments

Better Images of AI

https://betterimagesofai.org/
3•Curiositry•55m ago•0 comments

We need tech news sources which exclude AI

11•botfriendsarent•56m ago•4 comments

AI Agent Credential Crisis: Six Months of Incidents

https://devfortress.net/blog/semi-annual-2026
3•arian_•57m ago•0 comments

Stanford's Hoover Inst: "The Wealth Tax: Recipe for Economic Disaster"Lionaire [video]

https://www.youtube.com/watch?v=x6k4W5Qzg8U
2•stmw•1h ago•1 comments

Mux – A tmux overlay for managing Claude Code sessions

https://github.com/fashton28/mux
3•fashton28•1h ago•0 comments

A Fresh Equation Born from Hacker News Chaos

https://zenodo.org/records/21015132
2•GerbaitedLol•1h ago•0 comments

The Grim and the Dark: Jon Heder's journey into the world of Grimdark art [video]

https://www.youtube.com/watch?v=SBJefwNkLqo
2•andystanton•1h ago•0 comments

Company Cleaning Up the Reflecting Pool Says It Has 'Nothing to Hide'

https://www.wsj.com/us-news/lincoln-memorial-reflecting-pool-california-greenwater-services-729be960
3•petethomas•1h ago•0 comments

AI Agent Triggers Nuclear Strike After Getting Outmaneuvered in Civilization VI

https://decrypt.co/371877/ai-agent-nuclear-strike-civilization-vi-benchmark
6•Khaine•1h ago•1 comments

On cigarettes

https://funnelfiasco.com/blog/2026/06/28/on-cigarettes/
28•aendruk•1h ago•24 comments

Side-Stepping the Secretary Problem

https://www.evalapply.org/posts/side-step-secretary-problem-hiring/index.html
4•Curiositry•1h ago•0 comments

The Usefulness of AI Agents

https://erikjohannes.no/posts/20260408-on-the-usefulness-of-ai-agents/index.html
2•Curiositry•1h ago•0 comments

/Dev/Notion

https://www.notion.com/product/dev
3•handfuloflight•1h ago•0 comments

AI glasses are aiding cheating in exams. Test-obsessed Asia is ground zero

https://www.cnn.com/2026/06/26/asia/ai-glasses-cheating-exams-intl-hnk
4•pseudolus•1h ago•0 comments

AI Workflows in Production Without Burning Tokens

https://unmeshed.io/blog/bringing-ai-workflow-into-production-without-burning-tokens
2•jusonchan81•1h ago•0 comments

The Cheques Are for the Land Not the Intelligence

https://aidatumpoint.substack.com/p/the-cheques-are-for-the-land-not
3•MadCatBureau•1h ago•0 comments

Show HN: best.free

https://best.free/
4•nadermx•1h ago•0 comments

Crypto vs. community: 4k local US lenders join forces to fight 'stablecoins' law

https://www.theguardian.com/technology/2026/jun/28/crypto-v-community-local-lenders-fight-stablec...
4•pseudolus•1h ago•1 comments

New UK stealth fighter demonstrator takes shape

https://www.aerosociety.com/news/new-uk-stealth-fighter-demonstrator-takes-shape/
4•Gravityloss•1h ago•0 comments

You get to watch 1 season of 1 show for the rest of your life, what do you pick?

https://www.reddit.com/r/AskReddit/s/Sf9sqnAqRr
3•eeko_systems•1h ago•6 comments