frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•9mo ago

Comments

kate_at_refact•9mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Josh Collison and Dwarkesh Patel Interview Elon Musk [video]

https://www.youtube.com/watch?v=BYXbuik3dgA
1•surprisetalk•2m ago•0 comments

Human brain cells on a chip learned to play Doom in a week

https://www.newscientist.com/article/2517389-human-brain-cells-on-a-chip-learned-to-play-doom-in-...
1•alex_young•2m ago•0 comments

Malm Whale in Gothenburg

https://www.atlasobscura.com/places/malm-whale
1•thunderbong•3m ago•0 comments

Plugtest

https://en.wikipedia.org/wiki/Plugtest
1•dhorthy•4m ago•0 comments

Show HN: EmCogni Code, the context engine for the "why" behind your codebase

https://www.emcogni.com/
1•ssbodapati•5m ago•0 comments

Simple Made Inevitable: The Economics of Language Choice in the LLM Era

https://felixbarbalet.com/simple-made-inevitable-the-economics-of-language-choice-in-the-llm-era/
1•puredanger•8m ago•0 comments

Idiot Plot

https://en.wikipedia.org/wiki/Idiot_plot
1•treetalker•10m ago•0 comments

Interview with Thomas Wouters by Guido van Rossum

https://gvanrossum.github.io/interviews/Thomas.html
3•tzury•13m ago•0 comments

Translatorhub

https://translatorhub.org/
1•zidana•18m ago•0 comments

Show HN: ClaudeTerminal – A tabbed terminal manager for Claude Code

https://github.com/Mr8BitHK/claude-terminal
1•mr8bit•21m ago•0 comments

NeurIPS 2021 Papers (2021)

https://tanelp.github.io/neurips2021/
1•vinhnx•24m ago•0 comments

Office of Technology Assessment

https://en.wikipedia.org/wiki/Office_of_Technology_Assessment
1•softwaredoug•25m ago•0 comments

MidnightBSD Excludes Calif. From Desktop Use Due to Digital Age Assurance Act

https://ostechnix.com/midnightbsd-excludes-california-digital-age-assurance-act/
4•WaitWaitWha•28m ago•2 comments

OpenSandbox

https://github.com/alibaba/OpenSandbox
1•nileshtrivedi•29m ago•0 comments

Why Is Your Operating System Debugging Hackers for Free?

1•agarmte•29m ago•0 comments

Polymarket Iran Bets Hit $529M as New Wallets Draw Notice

https://www.bloomberg.com/news/articles/2026-02-28/polymarket-iran-bets-hit-529-million-as-new-wa...
2•petethomas•31m ago•0 comments

Show HN: Computer Agents – Agents that work while you sleep

https://computer-agents.com
3•janlucasandmann•31m ago•0 comments

Uplift Privileges on FreeBSD

https://vermaden.wordpress.com/2026/03/01/uplift-privileges-on-freebsd/
1•vermaden•31m ago•0 comments

Artichoke induces sweet taste (PubMed)

https://pubmed.ncbi.nlm.nih.gov/5084667/
1•valzevul•31m ago•0 comments

Edge – Generate structured evaluation criteria for any domain using a local LLM

https://github.com/EviAmarates/fresta-edge
1•TiagoSantos•43m ago•0 comments

Have you used Terragrunt in the past? Keen to hear your thoughts

https://techroom101.substack.com/p/terragrunt-what-it-solves-what-it
1•ahaydar•43m ago•0 comments

Two-way Discord bridge-autonomous Claude Code sessions(WebSocket+local queue)

https://github.com/AetherWave-Studio/autonomous-claude-code
1•Drew-Aetherwave•43m ago•1 comments

Token Anxiety

https://writing.nikunjk.com/p/token-anxiety
1•vinhnx•44m ago•0 comments

A State Government Tried to Regulate Linux; It Went How You'd Expect

https://www.youtube.com/watch?v=mQLdDR-hJpc
1•cable2600•50m ago•1 comments

I built AI agents that do the grunt work solo founders hate

2•Seleci•56m ago•0 comments

TorchLean: Formalizing Neural Networks in Lean

https://leandojo.org/torchlean.html
2•matt_d•56m ago•0 comments

Hackers Expose the Surveillance Stack Hiding Inside "Age Verification"

https://www.techdirt.com/2026/02/25/hackers-expose-the-massive-surveillance-stack-hiding-inside-y...
3•nobody9999•58m ago•1 comments

Japanese firm Space One plans to launch Kairos No.3 rocket on Sunday

https://www3.nhk.or.jp/nhkworld/en/news/20260301_01/
2•HardwareLust•1h ago•2 comments

Show HN: Sailor.ai – source-backed personalized outbound emails

https://trysailor.ai/
1•bill_waybird•1h ago•1 comments

Show HN: Brand Analytics for AI Search Engines (Beta)

https://explore.somantra.ai/dashboard/141d19d6-1ee7-4a25-81cf-411e6792e286/Australia
1•prasaar•1h ago•0 comments