frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•11mo ago

Comments

kate_at_refact•11mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Chernobyl at 40: Secret Stasi files reveal extent of Soviet misinformation

https://theconversation.com/chernobyl-at-40-secret-stasi-files-reveal-extent-of-soviet-misinforma...
1•1659447091•1m ago•0 comments

New Type of Neuroplasticity Rewires the Brain After a Single Experience

https://www.quantamagazine.org/a-new-type-of-neuroplasticity-rewires-the-brain-after-a-single-exp...
1•pseudolus•2m ago•0 comments

Start Blogging (Even If Nobody Will Read It)

https://nikola-breznjak.com/blog/devthink/why-you-should-start-blogging-even-if-nobody-will-read-it/
1•eigenBasis•6m ago•0 comments

Brazil Passes Landmark Law to Protect Children Online

https://www.hrw.org/news/2025/09/17/brazil-passes-landmark-law-to-protect-children-online
1•mooreds•8m ago•0 comments

GnuPG – post-quantum crypto landing in mainline

https://lists.gnupg.org/pipermail/gnupg-announce/2026q2/000504.html
1•zdkaster•9m ago•1 comments

Wedges and Control Points in Product Strategy

https://edwardhsu.substack.com/p/wedges-control-points-and-the-missing
1•rahimnathwani•17m ago•0 comments

Meetings Are Forcing Functions

https://www.mooreds.com/wordpress/archives/3734
1•zdw•22m ago•0 comments

The death of the American Dream is now official

https://thehill.com/opinion/finance/5846892-american-dream-debt-crisis/
8•Teever•23m ago•1 comments

Tell HN: Medvi (telehealth) hardcodes 999 patient emails in public JavaScript

1•g48ywsJk6w48•26m ago•0 comments

OWASP Top, Vibe Coding, and What Developers Miss with Tanya Janca [video]

https://www.youtube.com/watch?v=LSYkD-MKdmk
1•mooreds•27m ago•0 comments

How Meta used AI to map tribal knowledge in large-scale data pipelines

https://engineering.fb.com/2026/04/06/developer-tools/how-meta-used-ai-to-map-tribal-knowledge-in...
1•theorchid•28m ago•0 comments

Fruit Box

https://en.gamesaien.com/game/fruit_box/
2•downboots•31m ago•0 comments

Reviving Koken

https://www.bradleyboy.com/writings/reviving-koken/
1•bradleyboy•31m ago•0 comments

Discord Sleuths Gained Unauthorized Access to Anthropic's Mythos

https://www.wired.com/story/security-news-this-week-discord-sleuths-gained-unauthorized-access-to...
2•wyldfire•32m ago•0 comments

Starting with "Yes"

https://www.darthealth.com/blog/starting-with-yes
1•mooreds•32m ago•0 comments

The AI Compute Crunch Is Here (and It's Affecting the Economy)

https://www.404media.co/the-ai-compute-crunch-is-here-and-its-affecting-the-entire-economy/
2•gasull•38m ago•0 comments

Ukraine marks 40th anniversary of Chornobyl disaster under cloud of war

https://www.reuters.com/world/europe/ukraine-marks-40th-anniversary-chornobyl-disaster-under-clou...
4•onemoresoop•39m ago•0 comments

Harvard students call grading reform 'racist' in petition

https://www.campusreform.org/article/harvard-students-call-grading-reform-racist-petition/29761
3•ivewonyoung•46m ago•4 comments

LLMs Corrupt Your Documents When You Delegate

https://arxiv.org/abs/2604.15597
3•achrono•48m ago•1 comments

Reviving BrowserID in 2026

https://wakamoleguy.com/p/reviving-browserid-in-2026
2•wakamoleguy•56m ago•0 comments

Show HN: deterministic oracle for hardware designs with replayable proofs

https://suprastructure.net
1•suprastructure•57m ago•0 comments

Show HN: Draw Together Online

https://vidzert.com/draw-together
4•vidzert•58m ago•0 comments

Donald Trump is giving psychedelic medicines a welcome boost

https://www.economist.com/business/2026/04/23/donald-trump-is-giving-psychedelic-medicines-a-welc...
2•andsoitis•1h ago•0 comments

Claude Cowork Now Runs Any LLM. Test It Free

https://www.productcompass.pm/p/cowork-on-3p-any-llm
1•obilgic•1h ago•0 comments

Multi-Agent AI Systems Are Eating Single Agents

https://aistackinsights.ai/blog/multi-agent-ai-systems-langgraph-crewai-production-guide
2•aistackinsights•1h ago•0 comments

A Guide to CubeSat Mission and Bus Design

https://pressbooks-dev.oer.hawaii.edu/epet302/
2•o4c•1h ago•0 comments

I have officially retired from Emacs

https://nullprogram.com/blog/2026/04/26/
4•Fudgel•1h ago•0 comments

Reading Vault – free vocab tool that scores comprehension, not memory

https://www.readingvault.app
1•HimaghnaS•1h ago•0 comments

Modelling a Balanced Pencil(2014)

https://thatsmaths.com/2014/06/26/balancing-a-pencil/
1•o4c•1h ago•0 comments

Ask HN: What file sharing apps do you guys use?

3•samarthv•1h ago•3 comments