frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•10mo ago

Comments

kate_at_refact•10mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Can I hear a difference between MP3s and uncompressed audio?

https://82mhz.net/posts/2026/03/can-i-hear-a-difference-between-mp3s-and-uncompressed-audio/
1•nomemory•2m ago•0 comments

Show HN: AI Roundtable – Let 200 models debate your question

https://opper.ai/ai-roundtable/
2•felix089•2m ago•0 comments

Global terrorism falls to a decade low but Western fatalities surge

https://www.visionofhumanity.org/global-terrorism-falls-to-a-decade-low-but-western-fatalities-su...
2•littlexsparkee•2m ago•0 comments

Show HN: Apfel - Apple Intelligence from the Command Line

https://github.com/Arthur-Ficial/apfel
1•franze•3m ago•0 comments

DeepResearch: Autonomous optimization beating greedy search by 28%

https://github.com/Cosmic-Game-studios/deepresearch
1•Cosmic_dev_ML•3m ago•0 comments

Multi-Architecture Continuous Testing at Google

https://hackthology.com/taming-the-variants-multi-architecture-continuous-testing-at-google.html
2•compiler-guy•4m ago•0 comments

How to activate the secret debug menu in The Adventures of Willy Beamish? [video]

https://www.youtube.com/watch?v=cYZMWYCSphM
1•zappatic•5m ago•0 comments

Localizee

https://apps.apple.com/us/app/localizee/id6759800223?mt=12
1•helaia•6m ago•0 comments

App Store for GitHub Releases

https://github.com/OpenHub-Store/GitHub-Store
1•synchrone•8m ago•0 comments

Prompt 3

https://panic.com/prompt/
2•bradley_taunt•8m ago•0 comments

Windows stack limit checking retrospective: ARM64, also known as AArch64

https://devblogs.microsoft.com/oldnewthing/20260320-00/?p=112154
1•ibobev•9m ago•0 comments

Event Schedule: Plan, Promote and Sell Tickets (Open-Source) [video]

https://www.youtube.com/watch?v=IL8Fj0p6Lz8
1•hillelcoren•9m ago•0 comments

Show HN: RoverBook – Moltbook for Your Website

https://github.com/rtrvr-ai/rover/tree/main/packages/roverbook
1•arjunchint•10m ago•0 comments

Solving Semantle with the Wrong Embeddings

https://victoriaritvo.com/blog/robust-semantle-solver/
1•evakhoury•12m ago•0 comments

Europe waged war on young people to pay for pensions

https://www.telegraph.co.uk/money/pensions/state-pensions/how-europe-went-to-war-with-its-pension...
3•throw0101d•12m ago•1 comments

World first: antimatter particles transported in Geneva

https://www.swissinfo.ch/eng/various/antimatter-particles-are-transported-in-geneva-for-the-first...
1•_____k•12m ago•0 comments

Claude Code: Auto Mode

https://twitter.com/claudeai/status/2036503582166393240
2•tosh•16m ago•0 comments

Fork You

https://www.humancode.us/2026/03/21/no-fork-you.html
2•speckx•17m ago•0 comments

Telling Your AI Agent It's an Expert Makes It Less Accurate

https://newclawtimes.com/articles/expert-persona-prompting-damages-llm-accuracy-prism-research/
2•alvivanco•17m ago•0 comments

Show HN: AWS for Idiots (webcomic)

https://awsforidiots.com
1•heythisischris•17m ago•0 comments

Gap says it will launch checkout within Google's Gemini

https://www.cnbc.com/2026/03/24/gap-google-gemini-checkout-ai-platform.html
1•johnbarron•18m ago•0 comments

How to tell when a potential freelancing client is delusional

https://b2bs.substack.com/p/op-note-the-5-habits-of-delusional
2•oopsiremembered•18m ago•0 comments

Show HN: TrailTool – open-source CLI for querying CloudTrail data with AI agents

https://github.com/engseclabs/trailtool
1•alexsmolen•18m ago•0 comments

Pascal – open 3D home design in the browser

https://editor.pascal.app
5•Rabot•19m ago•2 comments

Mezcal's popularity is booming in the US, with a growing env cost in MX

https://apnews.com/article/mezcal-mexico-environment-deforestation-agave-oaxaca-erosion-a53aa9d26...
1•littlexsparkee•19m ago•0 comments

Norway wealth fund moves towards some AI-driven decisions with humans in control

https://www.reuters.com/business/norway-wealth-fund-moves-towards-some-ai-driven-decisions-with-h...
1•_____k•20m ago•1 comments

"I'm talking to you with my mind"

https://twitter.com/neuralink/status/2036489073091580011
1•nailer•21m ago•0 comments

Asteroid Samples Found DNA's Full Chemical Alphabet in Space

https://modernengineeringmarvels.com/2026/03/23/asteroid-samples-found-dnas-full-chemical-alphabe...
1•Brajeshwar•22m ago•0 comments

Show HN: Clarity – An AI Slack coach for better work communication

https://clarity.rocktangle.com/
2•dhruvghulati•22m ago•0 comments

Ads on Apple Maps – Coming Soon

https://ads.apple.com/maps
2•dmitrygr•23m ago•1 comments