frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•9mo ago

Comments

kate_at_refact•9mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

The Perceptron

https://blog.engora.com/2026/02/the-perceptron.html
1•Vermin2000•1m ago•0 comments

Show HN: Open Benchmarks Grants– a $3M commitment to close the AI eval gap

https://benchmarks.snorkel.ai/closing-the-evaluation-gap-in-agentic-ai/
1•vincentschen•1m ago•0 comments

Self-Employment: Ownership Is Not Freedom

http://charleshughsmith.blogspot.com/2026/02/self-employment-series-2-ownership-is.html
2•dxs•2m ago•0 comments

My WordPress

https://my.wordpress.net
1•GavinAnderegg•2m ago•0 comments

Polymarket to Offer Attention Markets

https://www.forbes.com/sites/aliciapark/2026/02/10/polymarket-to-offer-attention-markets-in-partn...
1•m-hodges•2m ago•0 comments

Show HN: Project Promo – startup and project promotion platform

https://project.promo/
2•AlesBeg•3m ago•0 comments

Decapod: A local, daemonless control plane for AI agents

https://github.com/DecapodLabs/decapod
1•alexhr•3m ago•1 comments

How I built Fluxer, a Discord-like chat app

https://blog.fluxer.app/how-i-built-fluxer-a-discord-like-chat-app/
1•pr337h4m•6m ago•0 comments

Are ads the only way to scale AI to mainstream users?

https://nanonets.com/blog/openai-ads-vs-claude-real-fight-is-business-model/
1•nobsagents•6m ago•0 comments

The LLM Context Tax: Best Tips for Tax Avoidance

https://www.nicolasbustamante.com/p/the-llm-context-tax-best-tips-for
1•nbstme•7m ago•0 comments

Linux 7.0 Brings an EFI Framebuffer Quirk for Valve's Steam Deck

https://www.phoronix.com/news/Linux-7.0-EFI
3•Bender•8m ago•0 comments

Supercomputer simulations test turbulence theories at 35T grid points

https://phys.org/news/2026-02-supercomputer-simulations-turbulence-theories-trillion.html
2•mikhael•8m ago•0 comments

Add voice support for terminal coding assistants on Apple Silicon

https://github.com/shreyaskarnik/voice-mcp
1•shreyask•10m ago•1 comments

Geoff's Projects – ASCII Video Terminal

https://geoffg.net/terminal.html
2•rbanffy•10m ago•0 comments

Ask HN: Freelance Dev Available – Discord Bots, Web Scraping, GitHub Automation

1•deepakbot•12m ago•0 comments

Majutsu, Magit for Jujutsu

https://github.com/0WD0/majutsu
2•todsacerdoti•13m ago•0 comments

Evidence for the earliest hominin use of wooden handheld tools found in Greece

https://www.pnas.org/doi/10.1073/pnas.2515479123
1•bikenaga•14m ago•1 comments

Writing a Lisp JIT Interpreter with GraalVM Truffle

https://kyo.iroiro.party/en/posts/emacs-lisp-interpreter-with-graalvm-truffle/
1•PaulHoule•15m ago•0 comments

macOS Tahoe 26.3

https://www.macrumors.com/2026/02/11/apple-releases-macos-tahoe-26-3/
2•tosh•16m ago•0 comments

iOS 26.3

https://www.macrumors.com/2026/02/11/apple-releases-ios-26-3-and-ipados-26-3/
2•tosh•16m ago•0 comments

Chrome 146 Now in Beta with WebNN Origin Trial for Neural Networks in Browser

https://www.phoronix.com/news/Chrome-146-Beta
1•Bender•16m ago•0 comments

Preparing Your Website for LLMs

https://www.speakeasy.com/blog/prepare-your-website-for-llms
2•ndimares•17m ago•0 comments

The $6 Bug

https://campedersen.com/idle
1•ecto•17m ago•0 comments

Show HN: Open-source monitoring for AI agents (MCP-compatible)

1•yohanpoul•19m ago•0 comments

ChatGPT: The "Are You Sure?" Problem

https://www.randalolson.com/2026/02/07/the-are-you-sure-problem-why-your-ai-keeps-changing-its-mind/
1•doener•19m ago•0 comments

How Did the FBI Get Nancy Guthrie's Nest Doorbell Footage?

https://lifehacker.com/tech/how-did-the-fbi-get-nancy-guthries-doorbell-footage
6•daft_pink•19m ago•2 comments

Reverse cicd with GitHub and self hosted Forgejo

https://gist.github.com/melezhik/5f3f482c38ed9ab59626cc19c6bbbada
1•melezhik•20m ago•1 comments

Hackable Software

https://blog.abdellatif.io/hackable-software
1•tifa2up•20m ago•0 comments

Ask HN: If agentic AI is the future, why is every startup shipping a dashboard?

1•ATechGuy•22m ago•0 comments

Winter Olympic athletes are rightfully taking Covid-19 precautions

https://thesicktimes.org/2026/02/10/winter-olympic-athletes-are-rightfully-taking-covid-19-precau...
2•DustinEchoes•23m ago•0 comments