frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•10mo ago

Comments

kate_at_refact•10mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

The Hive Mind

https://jacquesmattheij.com/the-hive-mind/
1•BatFastard•21s ago•1 comments

Voice Agents Latency

https://substack.com/home/post/p-189696660
1•agentropy•2m ago•0 comments

Roblox Is Minting Teen Millionaires

https://www.bloomberg.com/news/articles/2026-03-06/roblox-s-teen-millionaires-are-disrupting-the-...
2•petethomas•8m ago•0 comments

Secure Snake Home (SSH)

https://snake.eieio.games
1•fratellobigio•9m ago•0 comments

How AI Is Turbocharging the War in Iran

https://www.wsj.com/tech/ai/how-ai-is-turbocharging-the-war-in-iran-aca59002
1•JumpCrisscross•13m ago•0 comments

Anthropic and The Pentagon

https://www.schneier.com/blog/archives/2026/03/anthropic-and-the-pentagon.htmll
1•benwen•14m ago•0 comments

British Columbia makes daylight saving time permanent

https://text.npr.org/nx-s1-5741076
1•bvanderveen•15m ago•0 comments

Will the U.S. confirm that aliens exist before 2027?

https://kalshi.com/markets/kxaliens/aliens/KXALIENS-27
1•pinkmuffinere•16m ago•0 comments

Metrics Make Us Miserable

https://www.derekthompson.org/p/how-metrics-make-us-miserable
1•gmays•17m ago•0 comments

Best Music Distributors in 2026

1•anonyxbiz•24m ago•0 comments

Pushing and Pulling: Three Reactivity Algorithms

https://jonathan-frere.com/posts/reactivity-algorithms/
1•frogulis•30m ago•0 comments

Science Fiction Is Dying. Long Live Post Sci-Fi?

https://www.typebarmagazine.com/science-fiction-is-dying-long-live-post-sci-fi/
3•KittenInABox•30m ago•0 comments

On the road to C4 rice: Advances and perspectives

https://onlinelibrary.wiley.com/doi/full/10.1111/tpj.14562
1•lawrenceyan•35m ago•0 comments

The Intelligence Monopoly Is Over

https://www.spatialintelligence.ai/p/the-intelligence-monopoly-is-over
1•beauzero•35m ago•1 comments

Why can't you just ask AI to find you a trading edge? You can now

https://github.com/augiemazza/varrd
1•varrd1•36m ago•1 comments

Cloud VM benchmarks 2026: performance/price for 44 VM types over 7 providers

https://devblog.ecuadors.net/cloud-vm-benchmarks-2026-performance-price-1i1m.html
7•dkechag•44m ago•0 comments

Human brain cells on a chip learned to play Doom in a week

https://www.newscientist.com/article/2517389-human-brain-cells-on-a-chip-learned-to-play-doom-in-...
3•doener•52m ago•0 comments

The San Francisco lunch that launched Silicon Valley 70 years ago

https://davidlaws.medium.com/the-san-francisco-lunch-that-launched-silicon-valley-70-years-ago-3b...
2•DavidLawsCHM•53m ago•0 comments

NexusMods (game modding application for Linux) code repo is now read-only

https://github.com/Nexus-Mods/NexusMods.App
1•wingmanjd•55m ago•1 comments

ClawPurse Micropayment Ecosystem

https://clawpurse.ai/
3•TheTikiCow•57m ago•0 comments

Ask HN: Last time you wrote code?

3•blinkbat•1h ago•1 comments

What's the deal with distributed SYN DOS attacks

2•xmddmx•1h ago•0 comments

PressPuzzler AI Crosswrod Puzzle Maker

https://presspuzzler.com/
1•aidevguy•1h ago•0 comments

Blocking a common brain gas reverses autism-like traits in mice

https://www.psypost.org/blocking-a-common-brain-gas-reverses-autism-like-traits-in-mice/
3•geox•1h ago•1 comments

MuJS: Lightweight JavaScript interpreter for embedding in other software

https://mujs.com
2•linkdd•1h ago•0 comments

I don't know if my job will still exist in ten years

https://www.seangoedecke.com/will-my-job-still-exist/
4•nomdep•1h ago•0 comments

AI Powered Exploit Kit

https://github.com/Ed1s0nZ/CyberStrikeAI
1•jwally•1h ago•0 comments

Hitchhiker's Guide to Hitchhiking

https://www.mikokacki.me/blog/hitchhikers-guide-to-hitchhiking
1•samiczy•1h ago•0 comments

Show HN: Scalisos – A privacy-first, ad-free passport photo layout tool

https://scalisos.com
2•theborat•1h ago•0 comments

My chief of staff, Claude Code

https://twitter.com/jimprosser/status/2029699731539255640
2•mji•1h ago•6 comments