frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•11mo ago

Comments

kate_at_refact•11mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Lawnchair Larry Flight

https://en.wikipedia.org/wiki/Lawnchair_Larry_flight
1•anyonecancode•2m ago•0 comments

GitHub's Outages Since the Microsoft Acquisition

https://old.reddit.com/r/github/comments/1rnvhs9/githubs_historic_downtime_scraped_and_plotted/
2•MrBuddyCasino•2m ago•0 comments

Researchers say we're talking less

https://www.theverge.com/science/918753/researchers-talking-less
1•vednig•3m ago•0 comments

The Demonization of Male Ambition – Lisa Britton

https://lisabritton.substack.com/p/the-demonization-of-male-ambition
1•bilsbie•4m ago•0 comments

Meta buys AWS Graviton Arm cores in a CPU land grab

https://www.servethehome.com/meta-buys-tens-of-millions-of-aws-graviton-arm-cores-in-a-cpu-land-g...
3•teleforce•7m ago•0 comments

The Fall of the Theorem Economy

https://davidbessis.substack.com/p/the-fall-of-the-theorem-economy
2•mathgenius•13m ago•0 comments

Minimal offline Wikipedia with all text, and proper math rendering

https://github.com/ttsiodras/offline-wikipedia-via-slobby-and-mathjax
2•ttsiodras•14m ago•0 comments

Sabastian Sawe Shatters the 2-Hour Barrier at 2026 London Marathon

https://www.letsrun.com/news/2026/04/15930-sabastian-sawe-shatters-the-2-hour-barrier-at-2026-lon...
2•hasheddan•17m ago•0 comments

<hyper-frame> – embed the Internet, no restrictions

https://www.hyper-frame.art/?hit
1•keepamovin•21m ago•0 comments

Ask HN: What to Expect in 2030s?

3•alexander2002•23m ago•2 comments

A populist wave is rising to end the 'captive' repair economy

https://www.cnbc.com/2026/04/25/right-to-repair-consumer-prices-affordability-economy-elections.html
4•pseudolus•28m ago•0 comments

The 'smart wall' the US is building on the border

https://english.elpais.com/usa/2026-04-26/reinforced-walls-and-detection-technology-the-smart-wal...
2•geox•29m ago•0 comments

Ask HN: Has Claude Opus 4.7 nerfed?

2•souravroy78•30m ago•0 comments

Getting Started [with Retro Computing]

https://smallcomputercentral.com/articles/getting-started/
1•AlexeyBrin•30m ago•0 comments

Claude Code Hooks Reference

https://code.claude.com/docs/en/hooks
2•firasd•32m ago•0 comments

Two runners finish marathon in under 2 hours, a world first

https://www.dw.com/en/sawe-smashes-2-hour-mark-setting-record-at-london-marathon/a-76943171
2•keiferski•32m ago•0 comments

1B in, 20B out. Apple stays out of the war

https://alphasense.cc/signals/apple-ai/
1•langtang1996•35m ago•1 comments

Documented source code for The Sentinel on the BBC Micro

https://thesentinel.bbcelite.com
1•jimmcslim•35m ago•1 comments

The Half-Life of a Moat (Part 1)

https://semistructured.substack.com/p/the-half-life-of-a-moat-part-1
1•kmdupree•37m ago•0 comments

A 17th Century astrolabe once owned by Indian royalty heads for auction

https://www.bbc.com/news/articles/c8x7kw9lp8do
1•breve•38m ago•0 comments

Thoughts about Moments in Claude Mythos System Card

https://old.reddit.com/r/BetterOffline/comments/1sgxc77/thoughts_about_strange_moments_in_claude_...
3•kmdupree•38m ago•0 comments

EsoBench: Learning a Novel Esolang via Iterative Execution Feedback

https://caseys-evals.com/esobench
1•kmdupree•40m ago•0 comments

'I know what I saw' – Scotland's history of big cat sightings

https://www.bbc.com/news/articles/cdxk5525792o
1•breve•40m ago•0 comments

This is How We Get Moral A.I. Companies

https://www.nytimes.com/2026/04/26/opinion/ai-company-good-altruism.html
2•trauco•41m ago•0 comments

Trace Codex Session Easily

https://github.com/PixelPaw-Labs/codex-trace
1•ywian•44m ago•0 comments

Show HN: Ctxbrew – Ship and Use LLM-friendly library context

https://github.com/artem-mangilev/ctxbrew
1•mangilev•45m ago•0 comments

Is it worth parallelizing your GitLab/GitHub pipeline? (Not yet another AI post)

https://softwareefficiency.wordpress.com/2026/04/26/is-it-worth-parallelizing-your-gitlab-github-...
1•denshadeds•47m ago•1 comments

Tosijs-UI's new composable icon system

https://loewald.com/blog/2026/4/26/tosijs-icon-system
1•podperson•52m ago•1 comments

Is Fahrenheit 451 becoming relevant again?

https://kevinboone.me/fahrenheit451.html
2•AlexeyBrin•53m ago•1 comments

Beyond Silicon: Materials, Mechanisms, and Methods for Physical Neural Computing

https://arxiv.org/abs/2604.09833
2•Jazgot•54m ago•1 comments