frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•9mo ago

Comments

kate_at_refact•9mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Morgan Stanley predicts AI won't let you retire early

https://fortune.com/2026/02/26/morgan-stanley-predicts-ai-wont-let-you-retire-early-instead-youll...
1•geox•39s ago•0 comments

Show HN: Yaw: terminal, SSH/database connections, AI Chat and optimized AI CLI

https://yaw.sh
1•tkjef•51s ago•0 comments

From Skill.md to Shell: A Security Audit Guide for AI Agent Skills

https://aguarascan.com/blog/skill-md-security-audit-guide/
1•garagon•1m ago•0 comments

Platforms Not Applications

http://ngnghm.github.io/blog/2015/12/25/chapter-7-platforms-not-applications/
1•Antibabelic•1m ago•0 comments

NASA's Webb Telescope Locates Former Star That Exploded as Supernova

https://science.nasa.gov/missions/webb/nasas-webb-telescope-locates-former-star-that-exploded-as-...
1•bookofjoe•2m ago•0 comments

I built a personal AI assistant with security as a first-class citizen

https://www.provos.org/p/ironcurtain-secure-personal-assistant/
1•nielsprovos•2m ago•1 comments

Chaldean Lessons

https://learnchaldean.org/chaldean-neo-aramaic-lessons/
1•marysminefnuf•4m ago•0 comments

Two New Functions (=Sheet, =Sheets) in Google Sheets

https://workspaceupdates.googleblog.com/2026/02/two-new-functions-in-google-sheets.html
1•gnabgib•5m ago•0 comments

OpenClaw Is the Right Idea with the Wrong Defaults

https://simpleminded.bot/blog/openclaw-security-defaults
1•infotropy•6m ago•1 comments

Nano Banana 2 – Free AI Image Generator and Editor – Gemini

https://nan0banana2.com
1•AI_kid1412•6m ago•1 comments

China developed by defying free trade – not embracing it

https://www.theglobalcurrents.com/p/china-developed-by-defying-free-trade
2•atomicnature•6m ago•0 comments

The Remote-Work Dream Isn't Dead, but It's Slipping Away

https://www.wsj.com/lifestyle/careers/the-remote-work-dream-isnt-dead-but-its-slipping-away-a19ae9e8
2•apparent•8m ago•0 comments

Can you survive on Mars? What science fiction gets wrong

https://www.space.com/entertainment/space-movies-shows/can-you-really-survive-on-mars-what-scienc...
1•macbookaries•9m ago•0 comments

Red Dwarf Creator Rob Grant Has Died

https://www.beyondthejoke.co.uk/content/17193/red-dwarf-rob-grant
1•rwmj•10m ago•1 comments

What Is Limerence?

https://livingwithlimerence.com/what-is-limerence/
1•dsego•11m ago•0 comments

How visible is your app to AI?

https://www.llmcheck.app/ai-visibility/
1•sansreal•11m ago•0 comments

Show HN: OpenBrowserClaw: Browser-native Claw assistant, zero infrastructure

https://www.openbrowserclaw.com/
1•sachaa•13m ago•0 comments

Show HN: Blazor Developer Tools: React DevTools-Style for Blazor

https://github.com/joe-gregory/blazor-devtools
1•joe-gregory•13m ago•0 comments

Ask HN: Why is Claude Code so much larger than Codex on Mac OS?

1•VWWHFSfQ•13m ago•0 comments

Fibermaxxing Is a Diet Trend Even Nutritionists Can Love

https://www.bloomberg.com/news/articles/2026-02-11/high-fiber-is-latest-health-trend-hitting-star...
1•brandonb•14m ago•0 comments

Show HN: Made First Android game using Codespaces and AI, now in AdMob purgatory

1•apocryphal•14m ago•0 comments

Welcome, Heterogeneous Intelligence

https://www.callosum.com/blog/welcome-heterogeneous-intelligence
1•jasondavies•16m ago•0 comments

Looks Like an Insider Bet on Aliens

https://www.theatlantic.com/technology/2026/02/kalshi-aliens-insider-trading/686144/
2•breve•17m ago•0 comments

Show HN: WeDoDev – SaaS development subscription for startups

https://www.wedodev.co/#pricing
2•AdHelpAI•18m ago•0 comments

iPhone and iPad Are First Consumer Devices Cleared for NATO Classified Data

https://www.macrumors.com/2026/02/26/nano-classified-data-iphone-ipad/
5•stalfosknight•18m ago•0 comments

Quo Vadis, LLM Benchmarks?

https://florianbrand.com/posts/benches-2026
3•Davidzheng•20m ago•0 comments

Nano Banana 2 Partially Passes the Seven-Legged Spider Test

https://will-keleher.com/posts/nano-banana-2-partially-passes-the-spider-test/
2•gcmeplz•22m ago•0 comments

FastFlowLM (FLM) – Unlock Ryzen AI NPUs

https://github.com/FastFlowLM/FastFlowLM
3•jakogut•25m ago•1 comments

Prepaid vs. Postpaid Mobile: The cost breakdown nobody talks about

2•huntsmans•25m ago•3 comments

Banks decline to finance LNG project in Papua New Guinea

https://news.mongabay.com/short-article/2026/02/banks-decline-to-finance-lng-project-in-papua-new...
3•PaulHoule•26m ago•0 comments