frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•11mo ago

Comments

kate_at_refact•11mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Securing My Agent with Openshell

https://suthakamal.substack.com/p/the-container-was-always-there
1•suthakamal•21s ago•1 comments

Training mRNA Language Models Across 25 Species for $165

1•maziyar•2m ago•1 comments

Stripe went full 90s for April Fool's

https://stripe.dev/
2•annaspies•3m ago•0 comments

Show HN: A typing trainer that uses real code snippets

https://type.a2n.dev/
2•a2nb•3m ago•0 comments

The State of Video Gaming in 2026

https://www.matthewball.co/all/presentation-the-state-of-video-gaming-in-2026
1•mfiguiere•3m ago•0 comments

Australia facing an AI led job 'wipe out' that no one is prepared for

https://disassociated.com/australia-facing-ai-led-job-wipe-out-no-one-prepared/
3•speckx•5m ago•0 comments

Macroscope Has an Agent Now

https://macroscope.com/blog/introducing-agent
1•mssf123•6m ago•0 comments

Quantum Computing Built an Impossible Molecule

https://www.forbes.com/sites/moorinsights/2026/03/30/quantum-computing-built-an-impossible-molecu...
2•jonbaer•6m ago•0 comments

Software Engineering Is Becoming Civil Engineering

https://christophermeiklejohn.com/ai/engineering/2026/04/01/software-engineering-is-becoming-civi...
2•gpi•6m ago•0 comments

Optimizing for Understanding

https://blog.br11k.dev/2026-04-01-optimizing-for-understanding
1•konovalov-nk•6m ago•0 comments

Homo Sapiens vs. the Designer – Class Action Complaint

https://content.riif.com/class-action/
2•philiptranp•8m ago•0 comments

The Price Discovery Problem in the AI Debate

https://davefriedman.substack.com/p/the-price-discovery-problem-in-the
1•walterbell•9m ago•0 comments

Coordination patterns for multi-model AI systems

https://datda.substack.com/p/towards-reliable-agentic-systems
1•rapatel0•9m ago•1 comments

Gcannon – C io_uring HTTP/1.1 and WebSocket load generator for Linux

https://github.com/MDA2AV/gcannon
2•MDA2AV•10m ago•0 comments

HN: I simulated a real cancer case and predicted why some tumors didn't respond

https://github.com/ResakaGit/RESONANCE
2•agumza1•13m ago•0 comments

AgentDesk MCP: Adversarial review for LLM agent outputs (open source)

https://github.com/Rih0z/agentdesk-mcp
1•ezark_dev•14m ago•0 comments

Small Engines

https://scottlocklin.wordpress.com/2026/03/25/very-small-engines/
1•o_nate•15m ago•0 comments

Adult Swim Bumpers Collection

https://www.bumpworthy.com/bumps/classic/
1•marysminefnuf•16m ago•0 comments

Ask HN: Hypothetical Question or Thought Exercise

1•Bender•18m ago•0 comments

The Anti-Intellectualism of Silicon Valley Elites

https://www.elizabethspiers.com/the-anti-intellectualism-of-silicon-valley-elites/
6•speckx•18m ago•0 comments

AI, Human Cognition and Knowledge Collapse

https://www.nber.org/papers/w34910
3•kawera•20m ago•0 comments

Solar-powered truck charging gains ground on South Africa's freight corridors

https://apnews.com/article/charge-ev-trucks-solar-energy-africa-e153cf76cec084b1a6c681386840b977
2•PaulHoule•20m ago•0 comments

Apple at 50 – Apple Vision Pro body tracking, a disabled engineer, UK Lawsuit

https://edgecaseexistence.com/articles/apple-50/
1•iheartbiggpus•20m ago•0 comments

America's Best New Weapon in Iran Is a Drone Inspired by Iran

https://www.wsj.com/politics/national-security/iran-war-shahed-drone-65d0aced
2•uxhacker•22m ago•1 comments

Shared Moments – Wedding Album Created by your guests for you

https://www.shared-moments.com/
3•miaholloway•22m ago•1 comments

Tamp.dev – save up to 63% on AI tokens (free, works with Claude Code)

https://tamp.dev
1•kulesh•22m ago•0 comments

Build a CLI for AI agents and humans in less than 10 mins

https://twitter.com/GoogleCloudTech/status/2038778093104779537
1•rmason•26m ago•0 comments

The Launch of COBOL Weekly

https://cobolweekly.com/
1•rmason•27m ago•0 comments

Architectural Decision Records

https://adr.github.io/
3•ahamez•28m ago•0 comments

Apple at 50: Own the Whole Stack

https://blog.dreamfold.dev/post/apple-at-50/
1•darryl-c•29m ago•0 comments