frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•11mo ago

Comments

kate_at_refact•11mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Beautiful Food Art Creator

https://yumoo.vercel.app/
1•amangousa•1m ago•0 comments

JetStream 3: A modern benchmark for high-performance, compute-intensive Web apps

https://blog.chromium.org/2026/03/jetstream-3-a-modern-benchmark.html
1•robin_reala•1m ago•0 comments

Paper Review: LeWorldModel

https://twitter.com/ID_AA_Carmack/status/2039046172799578122
1•tosh•1m ago•0 comments

DevOps Agent: Clanker CLI

https://github.com/bgdnvk/clanker
1•tekbog•2m ago•0 comments

I Saw Something New in San Francisco

https://www.nytimes.com/2026/03/29/opinion/ai-claude-chatgpt-gemini-mcluhan.html
1•platzhirsch•5m ago•1 comments

Personified Agentic Software

https://github.com/NascentCore/personified_agentic_repository
1•agenticsoftware•5m ago•1 comments

Biker gangs and hired hands: how Iran is increasingly outsourcing its terrorism

https://www.theguardian.com/world/2026/mar/27/golders-green-ambulances-firebomb-iran-involvement-...
1•pinewurst•6m ago•0 comments

Quote #75514

https://mako.cc/copyrighteous/quote-75514
1•jruohonen•10m ago•0 comments

Show HN: Browserbeam – a browser API built for AI agents

https://browserbeam.com/
1•nyku•10m ago•0 comments

You can generate flowcharts through AI chat now

1•allen2peace•11m ago•0 comments

Perplexity AI Machine Accused of Sharing Data with Meta, Google

https://www.bloomberg.com/news/articles/2026-04-01/perplexity-ai-machine-accused-of-sharing-data-...
1•doctaj•14m ago•1 comments

Cacheless Browser

1•lukasfischer•15m ago•0 comments

Show HN: Calx – track and compile corrections humans make with AI agents

https://github.com/getcalx
1•spenceships•15m ago•0 comments

JPMorgan and Pimco Warn Bond Markets Miss Slowdown Risks

https://catenaa.com/markets/global-markets/bond-market-slowdown-risks/
1•malindasp•18m ago•0 comments

NASA's asteroid Bennu sample reveals a hidden chemical patchwork

https://www.sciencedaily.com/releases/2026/03/260331231739.htm
1•doctaj•20m ago•0 comments

After 40 years NEW mario glitch discovered [video]

https://www.youtube.com/watch?v=bNulp6cDqUU
1•bawolff•21m ago•0 comments

OnlyOffice suspends Nextcloud partnership over 'illegal' Euro-Office fork

https://www.neowin.net/news/onlyoffice-suspends-nextcloud-partnership-over-unapproved-euro-office...
1•bundie•22m ago•0 comments

We built a 60-page ERP knowledge base in 24 hours using AI

https://www.professionalslobby.com/news/erpedia-ai-knowledge-platform-launch
2•MerinJo•22m ago•0 comments

Russia goes after VPNs as 'great crackdown' gathers pace

https://www.yahoo.com/news/articles/russia-goes-vpns-great-crackdown-082317909.html
1•TMWNN•23m ago•0 comments

The p-Adic Numbers of Hensel (1938)

https://www.jstor.org/stable/2303739
2•measurablefunc•24m ago•0 comments

Improve Coding Agents' Performance with Gemini API Docs MCP and Agent Skills

https://blog.google/innovation-and-ai/technology/developers-tools/gemini-api-docsmcp-agent-skills/
1•doctaj•24m ago•0 comments

President signs order to restrict mailin ballots in likely unconstitutional move

https://www.theguardian.com/us-news/2026/mar/31/trump-executive-order-restrict-mail-in-ballots
1•Jimmc414•26m ago•0 comments

Adaptive Block-Scaled Data Types

https://arxiv.org/abs/2603.28765
1•matt_d•27m ago•0 comments

Show HN: ADBC for COBOL – modern database access meets 1959

https://columnar.tech/blog/adbc-cobol//
3•ianmcook•27m ago•0 comments

AC4A: Access Control for Agents

https://arxiv.org/abs/2603.20933
1•matt_d•28m ago•0 comments

ARM Makes Chips

https://thechipletter.substack.com/p/arm-makes-chips
2•oopsiremembered•30m ago•0 comments

InferenceFS

https://github.com/philipl/inferencefs
1•philzdev•36m ago•0 comments

Washington state's 'historic' millionaire tax takes aim at super-rich

https://www.theguardian.com/us-news/2026/mar/31/washington-state-millionaire-tax-wealth
3•MilnerRoute•40m ago•0 comments

Non-US founders residential address problem with Brex, Mercury?

3•Barazutti629•42m ago•0 comments

Mercor Hit by Cyberattack

https://techcrunch.com/2026/03/31/mercor-says-it-was-hit-by-cyberattack-tied-to-compromise-of-ope...
2•jackson-mcd•46m ago•0 comments