frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

The Mysterious Woman Behind the Nord Stream Explosion

https://www.wsj.com/world/europe/nord-stream-explosion-pipeline-9a109da9
1•nradov•1m ago•0 comments

Glacial Valley

https://github.com/deedy/glacial-valley
1•tanelpoder•1m ago•0 comments

Linear Algebra Kernels for the Age of Research

https://www.gpumode.com/news/linear-algebra-kernels-age-of-research
1•matt_d•5m ago•0 comments

Russia builds up infrastructure near Europe's border to deploy over 100k troops

https://www.pravda.com.ua/eng/news/2026/06/10/8038671/
1•Bender•6m ago•0 comments

From a Single File to an MCP Server: Six Rewrites of My Own Harness

https://pub.towardsai.net/from-a-single-file-to-an-mcp-server-six-rewrites-of-my-own-harness-74b4...
2•tacoda•9m ago•0 comments

Agentifying Agent Assessment for Openness, Standardization, and Reproducibility

https://arxiv.org/abs/2606.13608
1•tcp_handshaker•14m ago•0 comments

Pink Cosmo Blueberries

https://www.baldorfood.com/product/blueberries/be3p-pink-cosmo-blueberries
1•mooreds•15m ago•0 comments

Why Tomatoes Are the Most Expensive They've Been in Four Decades [audio] [video]

https://www.youtube.com/watch?v=pUrIlUAo0kM
1•mooreds•16m ago•0 comments

What Do Engineers Mean When We Say "Taste"?

https://davegriffith.substack.com/p/what-do-engineers-mean-when-we-say
1•mikez302•16m ago•0 comments

Latent learning: episodic memory complements parametric learning

https://openreview.net/forum?id=RuWGeX5ZiB
1•matt_d•18m ago•0 comments

Erdős Problems and Speculations about the Power of AI Models

https://www.youtube.com/watch?v=KbNctTQnVHI
1•maayank•20m ago•0 comments

Show HN: Agent Joe – a Rust only coding agent with no shell access

https://github.com/Kapperchino/agent-joe
1•kapperchino•21m ago•0 comments

I Think They [Anthropic] Are Lying to You [video]

https://www.youtube.com/watch?v=zfYsSFY4l18
5•salutis•22m ago•1 comments

Digg

https://digg.com/tech
2•ahmedfromtunis•22m ago•1 comments

I created a facet search over music composition (no AI)

https://monictheory.com
1•midi_finder•22m ago•1 comments

Ring Holders Club – NBA draft-and-SIM playoff run, plus a daily tactics puzzle

https://www.ringholders.club/
3•pipnonsense•23m ago•0 comments

Stackit – European Hyperscaler and Cloud Provider

https://stackit.com/en
1•tomrod•23m ago•0 comments

For People with Misophonia, Everyday Noises Can Be Agony

https://www.newyorker.com/magazine/2026/06/15/for-people-with-misophonia-everyday-noises-can-be-a...
1•fortran77•25m ago•0 comments

'Crisis averted' as experts confirm universe's expansion is accelerating

https://ras.ac.uk/news-and-press/research-highlights/crisis-averted-experts-confirm-universes-exp...
2•hhs•26m ago•0 comments

Espressif Modules

https://esp32.atomic14.com/modules/
1•iamflimflam1•27m ago•0 comments

N8ao – An efficient and visually pleasing implementation of SSAO

https://github.com/N8python/n8ao
1•modinfo•29m ago•0 comments

The World Computer Has Children

https://hari.computer/the-world-computer-has-children
1•andytratt•35m ago•0 comments

Astrology Is Scientifically Provable

https://astrologerapp.org/free-synastry
1•calamaridude•35m ago•3 comments

Cooling at the Speed of Light

https://cacm.acm.org/news/cooling-at-the-speed-of-light/
2•sohkamyung•35m ago•0 comments

Google's new remote attestation scheme is as terrible as old scheme

https://pluralistic.net/2026/06/12/compelled-speech/
3•healsdata•36m ago•0 comments

Jane Yolen (1939–2026)

https://locusmag.com/2026/06/jane-yolen-1939-2026/
2•sohkamyung•36m ago•0 comments

Ask HN: How are you designing human review for production AI agents?

2•willXare•36m ago•1 comments

Scientists decipher how the nucleus gets its energy

https://heart.arizona.edu/news/power-genome-scientists-decipher-how-nucleus-gets-its-energy
2•hhs•36m ago•0 comments

AMD Stiffs Researcher $10k Bug Bounty

https://www.gadgetreview.com/amd-stiffs-researcher-10000-bug-bounty-after-critical-security-flaw-...
7•worik•37m ago•0 comments

Putin expands military presence on NATO's border

https://www.telegraph.co.uk/world-news/2026/06/11/pictured-putin-expands-military-presence-on-nat...
5•Bender•37m ago•1 comments