frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•9mo ago

Comments

kate_at_refact•9mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

LLM-generated skills work, if you generate them afterwards

https://www.seangoedecke.com/generate-skills-afterwards/
1•niraj-agarwal•33s ago•0 comments

R. A. T. A Upgrade

1•Allenboyy•1m ago•0 comments

Show HN: SharpSkill – AI-tool built to increase success in tech Interviews

https://sharpskill.fr/en
1•CocoZozo•3m ago•0 comments

Ask HN: Why is there a negative sentiment towards crypto?

1•millarh3•6m ago•0 comments

How the book review became book list slop

https://thebaffler.com/after-the-fact/list-and-shout-kiesling
3•wawayanda•6m ago•0 comments

Looking for Founding Engineers – Esbern

1•mochi_codes•7m ago•1 comments

Show HN: PhoneClaw – Replace a $20/mo VPS with an old Android phone

https://github.com/phoneclaw/phoneclaw
1•8dazo•9m ago•1 comments

Show HN: Piggzy – Turn your inbox into a universal 'Buy' button

https://www.piggzy.com
1•contactreddyk•11m ago•2 comments

Americans are unleashing their anger on food-delivery robots

https://www.economist.com/united-states/2026/02/16/americans-are-unleashing-their-anger-on-food-d...
3•andsoitis•13m ago•0 comments

Manus AI launched 24/7 Agent via Telegram and got suspended

https://www.testingcatalog.com/manus-ai-launched-24-7-agent-via-telegram-and-got-suspended/
1•gmays•14m ago•0 comments

Google is killing authentic websites and I made it worse [video]

https://www.youtube.com/watch?v=II2QF9JwtLc
1•dataflow•18m ago•0 comments

XSDR, a single-sided M.2 software-defined radio with 2×RX/TX up to 3.8 GHz

https://www.crowdsupply.com/wavelet-lab/xsdr
2•iamnothere•20m ago•0 comments

Show HN: Stop Losing LangGraph Progress to 429 Errors

https://www.ezthrottle.network/blog/stop-losing-langgraph-progress
1•rjpruitt16•21m ago•0 comments

Show HN: Purely Vibe Coded Asmongold Simulator

https://spirofloropoulos.com/asmongold_simulator/
1•spirodonfl•30m ago•0 comments

The Consequences of the Epstein Document Release Start to Pile Up

https://www.nationalreview.com/the-morning-jolt/the-consequences-of-the-epstein-document-release-...
7•petethomas•31m ago•1 comments

Major PC OEMs Reportedly Exploring Chinese CXMT Memory Amid Shortages

https://www.techpowerup.com/346035/major-pc-oems-reportedly-exploring-chinese-cxmt-memory-amid-sh...
3•walterbell•31m ago•0 comments

Agent-evals: Overlap, boundary, and metacognitive scoring for coding agents

https://thinkwright.ai/agent-evals
1•oceanwaves•32m ago•1 comments

Why Affordability and the Vibecession Are Real Economic Problems

https://newsletter.mikekonczal.com/p/why-affordability-and-the-vibecession
2•NomNew•33m ago•0 comments

Hard Drive Prices Unexpectedly Rise in 2026

https://gettingwin.com/industry-information/592.html
3•AndrejXY•34m ago•0 comments

Manage Your Dotfiles with Stow

https://www.gnu.org/software/stow/manual/stow.html
1•ddtaylor•37m ago•0 comments

Show HN: The first financial intelligence MCP server live trading signals Claude

https://web-production-71423.up.railway.app/mcp-server
1•Shmungus•42m ago•0 comments

Show HN: Forage – MCP server that lets AI agents find and install their own MCPs

https://github.com/isaac-levine/forage
1•DoomedWheel1027•43m ago•1 comments

AI as Exoskeleton

https://clabs.org/blog/AiAsExoskeleton
2•the_chrismo•46m ago•1 comments

A.I. Salaries Are Causing Couples to Rethink Money in Relationships

https://www.nytimes.com/2026/02/14/business/artificial-intelligence-relationships-income-gap.html
3•mooreds•50m ago•1 comments

Sub-second volumetric 3D printing by synthesis of holographic light fields

https://www.nature.com/articles/s41586-026-10114-5
4•westurner•53m ago•0 comments

EU bans AI use on government work devices

https://www.neowin.net/news/eu-parliament-bans-ai-use-on-government-work-devices/
4•bundie•54m ago•1 comments

Filkoll – The fastest command-not-found handler (2025)

https://vorpal.se/posts/2025/mar/25/filkoll-the-fastest-command-not-found-handler/
1•crispinh•55m ago•0 comments

The Death of Traditional Testing

https://engineering.fb.com/2026/02/11/developer-tools/the-death-of-traditional-testing-agentic-de...
1•manveerc•58m ago•0 comments

Apple Begins Testing End-to-End Encryption for RCS Messages in iOS 26.4 Beta

https://www.macrumors.com/2026/02/16/ios-26-4-rcs-encryption-testing/
6•contact9879•59m ago•0 comments

Meta is wrong to try to sneak into facial recognition with Ray-Ban glasses

https://www.bloomberg.com/opinion/articles/2026-02-16/meta-is-wrong-to-try-to-sneak-into-facial-r...
4•socialcommenter•1h ago•4 comments