frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Wifärt Gallery

https://wifartgallery.com/
1•jasoncartwright•2m ago•0 comments

Good Design Disappears

https://hari.computer/good-design-disappears
1•andytratt•4m ago•0 comments

DVD-JEPA – a JEPA world model that dreams a bouncing DVD logo

https://dvd-jepa.vercel.app
1•mandarwagh•8m ago•0 comments

Cleve Moler (Developer of Matlab) Dies at 86

https://www.nytimes.com/2026/06/11/science/cleve-moler-dead.html
1•atan2•11m ago•0 comments

You Can Make Free Money on Polymarket. If You Know Math

https://www.nytimes.com/interactive/2026/06/12/upshot/kalshi-polymarket-prediction-markets-arbitr...
2•cainxinth•13m ago•0 comments

Introduction to the experience of rendering Arabic typography&its technical debt

https://lr0.org/blog/p/arabic/
2•bookofjoe•17m ago•0 comments

Show HN: Untyped – Voice to Email

https://play.google.com/store/apps/details?id=com.cronenka.untyped&hl=en_US
1•Cronenka•18m ago•0 comments

Resetting the Immune System

https://www.bbc.co.uk/news/articles/c4gy2d9y5z3o
1•pppone•20m ago•0 comments

Fedora 45 Considering a Lightened Grub Bootloader for Confidential Compute

https://www.phoronix.com/news/Fedora-45-Light-GRUB-For-CoCo
2•Bender•22m ago•0 comments

Nearly Everyone, Everywhere, Veers Left When Walking

https://www.nytimes.com/2026/06/10/science/humans-walking-veer-left-counterclockwise.html
2•mhb•23m ago•0 comments

Why do we need AI in project management software

https://evergantt.com/blog/2026-ai-doesnt-need-to-be-in-everything/
2•lb_john•23m ago•0 comments

Ask HN: What is the biggest pain of shipping improved versions of agents safely

1•prashar32•25m ago•0 comments

Show HN: AgentNexus – coordinate LLM agents by service boundary, not role

https://github.com/dugubuyan/agent-nexus
4•dugubuyan•33m ago•0 comments

Thoughts on AI and Jobs

https://blog.keyvan.net/p/thoughts-on-ai-and-jobs
12•k1m•36m ago•11 comments

An Interview with Intel's Kira Boyko: Xeon 6's Product Director

https://chipsandcheese.com/p/an-interview-with-intels-kira-boyko
6•lumpa•38m ago•0 comments

Show HN: A fully native offline location based music journal app app

https://apps.apple.com/us/app/reverie-fm/id6777534020
3•jeff-edmondson•38m ago•0 comments

DOE wants to build a single national platform for doing science with AI

https://cacm.acm.org/news/from-manhattan-to-genesis/
1•pseudolus•39m ago•0 comments

Sam Bankman-Fried loses bid to appeal against fraud conviction in FTX case

https://www.theguardian.com/business/2026/jun/12/sam-bankman-fried-loses-appeal
4•pseudolus•40m ago•0 comments

Nuclear clocks tick for the first time

https://phys.org/news/2026-06-nuclear-clocks.html
3•haeseong•42m ago•0 comments

Yann LeCun: World Models: Enabling the Next AI Revolution

https://www.youtube.com/watch?v=72Xj8k5WQX4
2•root-parent•44m ago•0 comments

Claude Corps

https://www.anthropic.com/claude-corps
1•hmokiguess•45m ago•1 comments

Show HN: 2 Weeks of Hallucinate – The Photo Gallery

https://hallucinate.site/gallery
22•stagas•46m ago•5 comments

AI OSS tool repo goes archived over night after raising $7.3M Seed

https://github.com/tensorzero/tensorzero
2•hek2sch•47m ago•1 comments

Implementing dark mode with standard CSS

https://olliewilliams.xyz/blog/dark-mode/
2•llcooliovice•49m ago•0 comments

Apple Lisa Emulator in Rust/WebAssembly

https://old.reddit.com/r/ClaudeCode/comments/1u4opmn/apple_lisa_emulator_in_rust_and_webassembly_...
2•adam_jesion•50m ago•0 comments

0xSero on X: "Open Source must win

https://twitter.com/0xSero/status/2035022588439581076
1•bilsbie•50m ago•0 comments

The Telematico NMS3000

https://celso.io/posts/2026/06/13/telematico/
1•celso•54m ago•0 comments

Tearing into ChatGPT's Container Environment

https://pncnmnp.github.io/blogs/chatgpt-containers.html
3•pncnmnp•55m ago•0 comments

Cerebras chips rival Nvidia GPUs for AI [video]

https://www.youtube.com/watch?v=qC_lCFTOJU0
1•binyu•56m ago•0 comments

SkipBeat – Uptime monitoring with Telegram alerts for indie developers

https://skipbeat.dev/
1•pavan_g_dev•59m ago•0 comments