frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Military Snipers Are Being Put Out of a Job by Drones

https://www.wsj.com/world/europe/military-snipers-are-being-put-out-of-a-job-by-drones-ae85a271
1•Michelangelo11•1m ago•0 comments

A cheap fix that saves the AI $400M dollars a year and brings 4B people online

https://codecai.net/
1•Zombwaffle•5m ago•0 comments

SIMD, cache and CPU internals with the expert Daniel Lemire [video]

https://www.youtube.com/watch?v=gqdFvYeMW5o
1•tosh•7m ago•0 comments

Anthropic just admitted AI is bullshit [video]

https://www.youtube.com/watch?v=juHv_Vi4giU
1•kshri24•9m ago•0 comments

Privacy Policy Changelog

https://www.fsf.org/about/free-software-foundation-privacy-policy/privacy-policy-changelog
1•toluc•10m ago•0 comments

Show HN: PathFinder – Map every path to your goal, then execute it step by step

https://pathfinderofficial.vercel.app/
2•SidVikJay•11m ago•0 comments

Musk vs. Altman week 3: Elon Musk and Sam Altman traded blows over each other's

https://www.technologyreview.com/2026/05/15/1137357/musk-v-altman-week-3/
3•joozio•27m ago•0 comments

Palantir's SaaS is dead claim is a warning shot for founders

https://startupfortune.com/palantirs-saas-is-dead-claim-is-a-warning-shot-for-founders/
2•01-_-•29m ago•0 comments

The US Is Using AI to Hunt Down Insider Trading on Polymarket

https://www.wired.com/story/polymarket-insider-trading-cftc-michael-selig-interview/
3•01-_-•31m ago•1 comments

Heroes of Might and Magic: Olden Era

https://store.steampowered.com/app/3105440/Heroes_of_Might_and_Magic_Olden_Era/
5•doener•32m ago•0 comments

Old English Pronunciation: A Comprehensive Reconstruction [video]

https://www.youtube.com/watch?v=WNQo54Ddte8
1•hnlyman•35m ago•0 comments

Team-memory – your team's shared brain, auto-built from Claude Code CLI or UI

https://github.com/AndrewSkea/team-memory
1•aski_dev•39m ago•0 comments

Abseil Common Libraries (C++)

https://github.com/abseil/abseil-cpp
1•tosh•46m ago•0 comments

Gaussian Splatting for Dummies

https://darshanmakwana412.github.io/2026/04/gaussian-splatting/
1•martianvoid•47m ago•0 comments

AI Playground – Let AI agents play safely

https://gitlab.com/cryptomilk/ai-playground
1•cryptomilk•51m ago•1 comments

PyCon US 2026 Packaging Summit Recap

https://discuss.python.org/t/packaging-summit-at-pycon-us-2026/106911
1•gaborbernat•55m ago•1 comments

Show HN: KoalaNews – how big is this story, really?

https://koalanews.app
1•koala-news•1h ago•1 comments

AI-generated code is 'pain waiting to happen'

https://www.theregister.com/ai-ml/2026/05/16/ai-generated-code-is-pain-waiting-to-happen/5241574
5•abdelhousni•1h ago•0 comments

We Are All Rankers Now: Or Why the Internet Has Turned to Shit

https://grumpywelshman.com/we-are-all-rankers-now-or-why-the-internet-has-turned-to-shit/
4•dave-x•1h ago•0 comments

Base64 encoding and decoding at almost the speed of a memory copy

https://arxiv.org/abs/1910.05109
1•tosh•1h ago•0 comments

Voltaire, the Entrepreneur

https://www.linkandth.ink/p/voltaire-the-entrepreneur
2•helsinkiandrew•1h ago•0 comments

Mozilla to UK regulators: VPNs are essential privacy and security tools

https://blog.mozilla.org/netpolicy/2026/05/15/mozilla-to-uk-regulators-vpns-are-essential-privacy...
29•WithinReason•1h ago•1 comments

Killswitch: Add per-function short-circuit mitigation primitive

https://lore.kernel.org/all/20260507070547.2268452-1-sashal@kernel.org/
2•Tomte•1h ago•0 comments

The Applicability of Spaced Repetition

https://borretti.me/article/the-applicability-of-spaced-repetition
4•Tomte•1h ago•0 comments

Linux Latest Vulnerability Allows Reading Root-Owned Files by Unprivileged Users

https://www.phoronix.com/news/Linux-ssh-keysign-pwn
3•tjek•1h ago•0 comments

At Cannes, filmmakers shift toward cautious acceptance of AI

https://www.reuters.com/lifestyle/cannes-filmmakers-shift-towards-cautious-acceptance-ais-inevita...
3•sahar_builds•1h ago•0 comments

CAFleet – open-source Agent Teams reinvented, both for Claude Code and Codex

https://github.com/himkt/cafleet
2•himkt•1h ago•0 comments

The Uncomfortable Truth About AI "Reasoning"

https://www.youtube.com/watch?v=iFYF_e1GSGI
3•tcp_handshaker•1h ago•0 comments

TypedMemory – long-term memory and reflection for AI agents

https://github.com/canis-minor/typedmem
2•ruxiz•1h ago•0 comments

Should you move to Silicon Valley? [video]

https://www.youtube.com/watch?v=QHJkUw31YX8
3•nomilk•1h ago•1 comments