frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Gizmodo breached, hosting malware and no action taken for hours

https://old.reddit.com/r/cybersecurity_help/comments/1uajrup/beware_gizmodo_captcha_malware_attempt/
1•mullingitover•4m ago•1 comments

Virtual whiteboard for sketching hand-drawn like diagrams

https://github.com/excalidraw/excalidraw
1•modinfo•10m ago•0 comments

Bun 1.4 coming July 7th

https://twitter.com/jarredsumner/status/2068059134952677887
1•sorenbs•13m ago•0 comments

Show HN: I wrote a Rust book ending with a Redis clone

https://shankhan3.gumroad.com/l/dnwmtp
1•zeeshanali0094•14m ago•0 comments

Ask HN: What's a simple app you'd build if you had a weekend?

1•akashwadhwani35•16m ago•0 comments

GenAIDojo

https://www.genaidojo.io/
1•aniketwattawmar•29m ago•0 comments

EAS Alarm at midnight caused by a hacker in Brazil

https://www.reddit.com/r/mildlyinfuriating/s/wLRv2s1u5k
2•davidkwast•30m ago•2 comments

Study: Conservatives are dying at higher rates than liberals

https://www.fastcompany.com/91561329/widening-health-gap-between-liberals-and-conservatives
1•MilnerRoute•33m ago•0 comments

The New Calculus of AI-Based Coding

https://blog.joemag.dev/2025/10/the-new-calculus-of-ai-based...
2•aabdi•38m ago•1 comments

Designing a backyard deck for my house

https://blog.cosmin.cloud/posts/diy-deck.html
1•spycraft•43m ago•0 comments

Compress tool outputs, logs, files, RAG chunks before LLM for 60-95% less tokens

https://github.com/chopratejas/headroom
1•gmays•43m ago•0 comments

Trapped in a Waymo: SF passenger recounts terrifying construction zone ordeal

https://abc7news.com/post/waymo-recall-san-francisco-man-recounts-terrifying-moment-when-robotaxi...
1•mikhael•46m ago•0 comments

Show HN: Stocks Porfolio in a URL and Favicon

https://stocksreader.com/
2•franciscop•46m ago•1 comments

Bureaulogy – formation, evolution, and perpetuation of bureaucracies

https://grantslatton.com/bureaulogy
1•thelastgallon•47m ago•1 comments

Chromium Embedded Framework (CEF)

https://github.com/chromiumembedded/cef
1•auraham•49m ago•0 comments

The Punctum and the Blind Field

https://theholbrookreport.com/reports/punctum-and-the-blind-field/
1•TyrunDemeg101•53m ago•0 comments

China will have a Fable 5-class AI model before next year

https://www.tomshardware.com/tech-industry/artificial-intelligence/elon-musk-says-that-china-will...
4•achow•56m ago•0 comments

TV Player for Rocket Launches

https://github.com/sighmon/SpaceX-TV
1•sighmon•1h ago•0 comments

We record because we have forgotten how to remember

https://pilgrimsage.substack.com/p/the-lowest-frequency
1•momentmaker•1h ago•0 comments

AI and the Great CMS Unbundling

https://dri.es/ai-and-the-great-cms-unbundling
2•christefano•1h ago•0 comments

This pole is worse than any Flock Camera [video][38 mins]

https://www.youtube.com/watch?v=wEfbhEVuvMM
1•Bender•1h ago•0 comments

Lawsuit against Amazon over suicides linked to chemical can go to trial

https://www.cbsnews.com/news/amazon-suicide-case-sodium-nitrite-washington-supreme-court/
2•hentrep•1h ago•0 comments

Satellite reveals immense scale of GPS signal tampering

https://www.space.com/space-exploration/satellites/its-quite-a-bit-more-than-we-expected-satellit...
8•y1n0•1h ago•0 comments

Moebius: 0.2B Lightweight Image Inpainting Framework with 10B-Level Performance

https://hustvl.github.io/Moebius/
2•ilreb•1h ago•0 comments

Spirit Crossing's AI Problem

https://blog.curiousquail.com/spirit-crossings-ai-problem/
1•zokiboy•1h ago•0 comments

A Practical Guide to SSH Tunnels: Local and Remote Port Forwarding

https://labs.iximiuz.com/tutorials/ssh-tunnels
1•signa11•1h ago•0 comments

How to lose a war in three easy steps

https://www.ft.com/content/0ba6b511-649e-4a29-bb99-f824584ec93d
9•Alien1Being•1h ago•1 comments

Show HN: Rundown - Niche Intelligence for YouTube Creators

https://getrundown.xyz
2•razasaad92•1h ago•0 comments

Show HN: AgentArk – open-source self-hosted AI agent OS

https://github.com/agentark-ai/AgentArk
2•debankad•1h ago•0 comments

Show HN: Western Highway Alerts

https://westernhighwayalerts.com/
1•BetaDeltaAlpha•1h ago•0 comments