frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•7mo ago

Comments

kate_at_refact•7mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Robots that spare warehouse workers the heavy lifting

https://news.mit.edu/2025/robots-spare-warehouse-workers-heavy-lifting-1205
1•meysamazad•11m ago•0 comments

Dockge: Self-hosted – Docker compose.yaml – Stack-oriented Manager

https://dockge.kuma.pet/
1•thunderbong•21m ago•0 comments

US regulators open Tesla probe after reports of children trapped in cars

https://www.bbc.com/news/articles/c203q2ywn88o
2•belter•22m ago•0 comments

Memory research: How respiration shapes remembering

https://www.lmu.de/en/newsroom/news-overview/news/memory-research-how-respiration-shapes-remember...
1•XzetaU8•24m ago•0 comments

Ask HN: A dumb game of dimensional analysis

1•egoism•31m ago•0 comments

Opus 4.5 Review (Custom Plan)

1•tactics6655•33m ago•0 comments

Cloudflare Has Blocked 416B AI Bot Requests Since July 1

https://www.wired.com/story/big-interview-event-matthew-prince-cloudflare/
3•aspenmayer•39m ago•2 comments

A Quiet Chinese Mobile Giant in Africa [video]

https://www.youtube.com/watch?v=PiXEJ6qe_Cg
1•mgh2•39m ago•0 comments

U.S. Unauthorized Immigrant Population Reached a Record 14M in 2023

https://www.pewresearch.org/race-and-ethnicity/2025/08/21/u-s-unauthorized-immigrant-population-r...
2•frasermarlow•40m ago•0 comments

Pete Hegseth is unfit to lead The Pentagon

https://www.ft.com/content/5dd72971-79e9-4ac8-8f2d-e230fb25c3b9
3•petethomas•42m ago•0 comments

The story of Mr DeepFakes – the world’s most notorious AI porn site

https://www.theguardian.com/society/ng-interactive/2025/dec/05/it-was-about-degrading-someone-com...
2•c420•43m ago•1 comments

Twine, the Video-Game Technology for All (2014)

https://www.nytimes.com/2014/11/23/magazine/twine-the-video-game-technology-for-all.html
2•sogen•49m ago•0 comments

CME Outage Shows Challenge of Keeping Data Centers Cool

https://www.bloomberg.com/news/articles/2025-11-28/cme-outage-how-are-data-centers-cooled-what-ha...
1•gmays•49m ago•0 comments

AI Predictions for 2026

https://www.aithings.dev/blog/2026-ai-predictions
1•irere123•50m ago•0 comments

Volcanic eruption might have helped bring the Black Plague to Europe

https://www.sciencenews.org/article/volcanic-eruption-black-plague-europe
2•mzs•50m ago•1 comments

PromptPwnd: Prompt Injection Vulnerabilities in GitHub Actions Using AI Agents

https://www.aikido.dev/blog/promptpwnd-github-actions-ai-agents
2•devy•52m ago•1 comments

Civic Nationalism

https://en.wikipedia.org/wiki/Civic_nationalism
1•CGMthrowaway•59m ago•0 comments

The economics of Pantone and its colours

https://finshots.in/archive/the-economics-of-pantone-color-of-the-year-cloud-dancer/
1•vismit2000•1h ago•0 comments

November CVEs Fell 25% YoY, Driven by Slowdowns at Major CNAs

https://socket.dev/blog/november-cves-fell-25-yoy-driven-by-slowdowns-at-major-cnas
1•feross•1h ago•0 comments

Reverse Benchmarking

https://www.dominiknitsch.com/reverse-benchmarking/
1•wseqyrku•1h ago•0 comments

On the trail of Borneo's bay cat, one of the most mysterious felines

https://news.mongabay.com/2024/04/on-the-trail-of-borneos-bay-cat-one-of-the-worlds-most-mysterio...
2•thunderbong•1h ago•0 comments

Intellivision Sprint by Atari

https://atari.com/products/intellivision-sprint
2•evo_9•1h ago•0 comments

QtkTest: Go-To Human Benchmark Tool

https://qtktest.com/
1•yimiqidage001•1h ago•0 comments

Patents and Open Source: Understanding the Risks and Available Solutions

https://opensource.org/blog/patents-and-open-source-understanding-the-risks-and-available-solutio...
1•gslin•1h ago•0 comments

How to speed up the Rust compiler in December 2025

https://nnethercote.github.io/2025/12/05/how-to-speed-up-the-rust-compiler-in-december-2025.html
3•todsacerdoti•1h ago•0 comments

What I Learned from Vibe-Coding Auth with AI

https://fusionauth.io/blog/vibe-coding-authentication
2•mooreds•1h ago•0 comments

Trustworthy software through non-profits?

https://www.more-magic.net/posts/trustworthy-software-through-non-profits.html
2•sjamaan•1h ago•0 comments

Show HN: Who is hiring" search tool with chat / other features

https://nthesis.ai/public/hn-who-is-hiring
2•osigurdson•1h ago•0 comments

Speed vs. Safety: Building developer experience in a MedTech startup

https://bradleybeddoes.com/posts/building-developer-experience-in-medtech
1•vedlin•1h ago•0 comments

Walks in Rotation Spaces Return Home When Doubled and Scaled

https://arxiv.org/abs/2502.14367
1•nomilk•1h ago•1 comments