frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•11mo ago

Comments

kate_at_refact•11mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

macOS Notifications for Claude Code and AeroSpace

https://kulikalov.com/claude-code-aerospace-notifications/
1•kulikalov•2m ago•0 comments

Up There with Carnegie

https://superconnectorbook.com/
1•Chrisszz•2m ago•0 comments

Show HN: Deadline.email – a daily reminder that you'll die

https://deadline.email
1•onesandofgrain•3m ago•0 comments

The Government Blacklisted the Best AI. It Came Back with the Same Red Lines

https://liminaldr.substack.com/p/the-government-tried-to-blacklist
1•BlendedPanda•4m ago•1 comments

Ask HN: What's Your Daily Routine?

1•chistev•6m ago•0 comments

Show HN: Anya – Offline static malware analysis (Rust)

https://github.com/elementmerc/anya
1•ElementMerc•11m ago•0 comments

Anchormd – Generate AI coding agent context files from any GitHub repo

https://anchormd.dev
1•aretedriver•13m ago•0 comments

The LMAX Architecture

https://martinfowler.com/articles/lmax.html
1•tosh•19m ago•1 comments

Why insects aren't huge: a new challenge to a decades-old idea

https://www.nature.com/articles/d41586-026-00976-0
2•marojejian•21m ago•1 comments

Hardware Is Hard?

https://prdpx7.github.io/posts/hardware-is-hard/
2•prdpx7•21m ago•1 comments

Show HN: JSON-logic-path – JSON logic with jsonpath multi-value resolution

https://github.com/bayinfosys/json-logic-path
1•anax32•22m ago•0 comments

Corporate Profits Are at Record Highs. These 4 Factors Could Sink Them

https://www.nytimes.com/2026/04/18/business/dealbook/corporate-profits-record.html
2•jhonovich•22m ago•0 comments

Why Mechanical Sympathy? (2011)

https://mechanical-sympathy.blogspot.com/2011/07/why-mechanical-sympathy.html
1•tosh•24m ago•0 comments

Only Law Can Prevent Extinction

https://www.lesswrong.com/posts/5CfBDiQNg9upfipWk/only-law-can-prevent-extinction
2•namanyayg•24m ago•0 comments

How Long Can You Keep Peptides After Reconstitution?

https://lifeimprovementschemes.substack.com/p/how-long-can-you-keep-peptides-after
1•BenPace•24m ago•1 comments

The Fermi Paradox Is Nerdslop

https://monismos.substack.com/p/the-fermi-paradox-is-nerdslop
1•BenPace•24m ago•0 comments

I've Been Trying to Delay the Industrial Revolution (and I'm Failing)

https://lostfutures.substack.com/p/ive-been-trying-to-delay-the-industrial
1•BenPace•25m ago•0 comments

The intelligence illusion: why AI isn't as smart as it is made out to be

https://www.nature.com/articles/d41586-026-00882-5
1•gnabgib•25m ago•1 comments

Why Postgres wants NVMe on the hot path, and S3 everywhere else

https://thenewstack.io/postgres-nvme-s3-storage/
2•tanelpoder•25m ago•0 comments

Binary GCD

https://gmplib.org/manual/Binary-GCD
3•tosh•33m ago•0 comments

Young sons of legendary U.S. marshal ride horseback from Oklahoma to New York

https://texascooppower.com/the-astonishing-ride-of-the-abernathy-boys/
11•mhb•36m ago•1 comments

Thoughts and Feelings Around Claude Design

https://samhenri.gold/blog/20260418-claude-design/
2•cdrnsf•36m ago•0 comments

OpenAI Proposes a 'Social Contract' for the Intelligence Age

https://www.noemamag.com/openai-proposes-a-social-contract-for-the-intelligence-age/
1•Brajeshwar•37m ago•1 comments

Show HN: TTS.ai

https://tts.ai/
1•nadermx•37m ago•0 comments

My personal website – a start to my internet home

https://alexarias.me/
1•AlexArias•38m ago•0 comments

Vibe Genomics: Sequencing Your Whole Genome at Home

https://vibe-genomics.replit.app/
1•moozilla•38m ago•0 comments

Show HN: Trained a 12M transformer on an ML framework we built from scratch

https://github.com/mni-ml/framework
1•caliandbust•38m ago•0 comments

Trappsec – Deception as a Developer Tool

https://trappsec.dev
3•kyuradar•42m ago•1 comments

Open Source SaaS Is Dead, AI Killed It

https://nmn.gl/blog/open-source-killed-ai
2•namanyayg•42m ago•0 comments

Claude –dangerously-skip-permissions –model Claude-Opus-4-5-20251101

1•deofoo•44m ago•0 comments