frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•8mo ago

Comments

kate_at_refact•8mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Show HN: Securing Docker Builds

https://github.com/avkcode/buildkit-nsjail-sandbox-blog
1•KyleVlaros•1m ago•0 comments

Show HN: AI in SolidWorks

https://www.trylad.com
2•WillNickols•3m ago•0 comments

Show HN: Woid – 3x Faster Runtime Polymorphism. C++23

https://github.com/akopich/woid
1•akopich•4m ago•0 comments

Chesspire: "Slay the Spire" but for Chess

https://lykrast.com/chesspire
1•gaws•6m ago•0 comments

What If ASI Leads to Stasis?

https://thinking.luhar.org/2026/01/what-if-asi-leads-to-stasis/
1•rluhar•6m ago•1 comments

Enterprise Integration Patterns: Process Manager

https://james-carr.org/posts/2026-01-05-advent-of-eip-day-10-process-manager/
1•carrja99•6m ago•0 comments

Pwning Claude Code in 8 Different Ways

https://flatt.tech/research/posts/pwning-claude-code-in-8-different-ways/
1•kschaul•6m ago•0 comments

Cursor vs. antigravity after a week of real use

1•okaris•8m ago•0 comments

Show HN: Senlo - self-hosted open-source email management system

https://github.com/IgorFilippov3/senlo
1•igorfilippov3•9m ago•0 comments

The truth behind the 2026 J.P. Morgan Healthcare Conference

https://www.owlposting.com/p/the-truth-behind-the-2026-jp-morgan
1•crescit_eundo•10m ago•0 comments

PutHouse – Earn income automatically with risk management built-in

https://puthouse.com
1•jansonlau•11m ago•0 comments

Apple Foundation Models will be based on Gemini

https://blog.google/company-news/inside-google/company-announcements/joint-statement-google-apple/
2•spott•12m ago•0 comments

Deft: A new replacement for Clojure objects using plain maps

https://github.com/sstraust/deft
2•sammy0910•13m ago•1 comments

Framework: Memory and Storage Pricing Updates

https://frame.work/at/en/blog/in-stock-on-framework-desktop-and-updates-on-the-industry-wide-sili...
3•tosh•13m ago•0 comments

I spent my winter break teaching an LLM to play Diplomacy with RL

https://www.benglickenhaus.com/blog/diplomacy_rl_part_1
1•bglick13•14m ago•0 comments

Forget about crop diseases and try this

https://apps.apple.com/us/app/agrisense-field/id6738309189
1•dasorto•15m ago•0 comments

Yellowstone Bison Herd

https://en.wikipedia.org/wiki/Yellowstone_bison_herd
1•thunderbong•16m ago•0 comments

Show HN: Image0.dev – image tools that run in the browser

https://image0.dev/
1•ayushpawar•16m ago•0 comments

Telegram recovery model allows permanent lockout after phishing

https://bugs.telegram.org/c/58477
6•saloed•17m ago•1 comments

1X World Model – From Video to Action: A New Way Robots Learn

https://www.1x.tech/discover/world-model-self-learning
2•yusufozkan•17m ago•0 comments

Apple picks Google's Gemini AI for its big Siri upgrade

https://www.theverge.com/news/860521/apple-siri-google-gemini-ai-personalization
3•erex78•18m ago•1 comments

All the rovers heading to the Moon over the next 10 years

https://jatan.space/moon-monday-issue-256/
1•freediver•19m ago•0 comments

Mac OLM File to PST Converter"

https://apps.microsoft.com/detail/9n7jk7z3546j?hl=en-US&gl=US
1•tieanderson•20m ago•1 comments

iPhone 4 is having a TikTok revival

https://appleinsider.com/inside/iphone/tips/iphone-4-is-having-a-tiktok-revival-heres-how-to-use-...
1•ksec•20m ago•0 comments

They Want You to "Quit Demonstrating"

https://www.motherjones.com/politics/2026/01/trump-renee-good-ice-roger-williams-wesley-hunt-firs...
1•wahnfrieden•20m ago•0 comments

In Defense of the New York City Transit Strike

https://jacobin.com/2026/01/nyc-2005-twu-strike-toussaint
1•wahnfrieden•21m ago•0 comments

Pi Monorepo: AI agent toolkit

https://github.com/badlogic/pi-mono
2•pretext•21m ago•0 comments

How to Build a Habit

https://dogdogfish.com/blog/2026/01/12/building-a-habit/
1•matthewsharpe3•22m ago•0 comments

Show HN: Java In-Memory search using Forage

https://livetheoogway.github.io/forage/
2•tusharnaik•23m ago•0 comments

Kavia AI now supports Bitbucket (agent-driven code analysis and regression diff)

https://www.youtube.com/watch?v=r3la8vo_G0E
1•kavitha_kavia•24m ago•1 comments