frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

OpenAI now wants ChatGPT to access your bank accounts

https://www.theverge.com/ai-artificial-intelligence/931122/openai-chatgpt-financial-accounts-plai...
1•ndr42•34s ago•0 comments

The Aperiodic Table

https://blog.jgc.org/2026/05/the-aperiodic-table.html
1•jgrahamc•43s ago•0 comments

What Color is Your Function? (2015)

https://journal.stuffwithstuff.com/2015/02/01/what-color-is-your-function/
1•tosh•5m ago•0 comments

Using a Nintendo Switch to Speed Up a 3D Printer

https://hackaday.com/2026/05/15/using-a-nintendo-switch-to-speed-up-a-3d-printer/
3•speckx•6m ago•0 comments

Where Did All the Soul Go?

https://arpl.dev/blog/where-did-all-the-sould-go
1•mooreds•7m ago•0 comments

Psyllium husk is being touted as nature's Ozempic

https://www.theguardian.com/wellness/2025/jun/11/what-is-psyllium-husk
1•rzk•9m ago•0 comments

Microsoft/Wil: Windows Implementation Library

https://github.com/microsoft/wil
1•Tomte•10m ago•0 comments

Playing Atari music on Amiga for free

https://arnaud-carre.github.io/2026-05-15-ym-fast-emu/
2•nopakos•10m ago•0 comments

JOOQ: The easiest way to write SQL in Java

https://www.jooq.org/
1•Tomte•10m ago•0 comments

Travelers on Air Force One ordered to throw away gifts, phones after China trip

https://techcrunch.com/2026/05/15/us-orders-travelers-on-air-force-one-to-throw-away-gifts-pins-a...
4•leopoldj•12m ago•0 comments

Azure Container Apps Express

https://techcommunity.microsoft.com/blog/appsonazureblog/introducing-azure-container-apps-express...
1•vyrotek•13m ago•0 comments

Trump leaves China with no agreement but cites 'good' talks with Xi

https://www.nbcnews.com/politics/donald-trump/trump-leaves-china-no-agreement-thorny-issues-cites...
1•kaycebasques•15m ago•1 comments

I'm Not Sorry

https://www.lrb.co.uk/the-paper/v48/n09/thomas-nagel/i-m-not-sorry
1•lermontov•15m ago•0 comments

The shift towards pay to play

https://rosie.land/posts/the-shift-towards-pay-to-play/
2•mooreds•16m ago•0 comments

The Slowest SR-71 Blackbird Fly-By (2017)

https://theaviationgeekclub.com/story-behind-famed-sr-71-blackbird-super-low-knife-edge-pass/
3•_Microft•17m ago•1 comments

YA3 – Yet Another TB-303 clone, that runs in the browser and as a DAW plugin

https://ya3.surge.sh/
1•stagas•17m ago•0 comments

Przybylski's Star: Still After All These Years

https://www.centauri-dreams.org/2026/05/15/przybylskis-star-still-bizarre-after-all-these-years/
2•JPLeRouzic•18m ago•0 comments

Kairos: The ancient Greek art of knowing when to act

https://bigthink.com/mini-philosophy/kairos-the-ancient-greek-art-of-knowing-when-to-act/
2•lschueller•21m ago•0 comments

Waymo recalls 3,800 robotaxis after they drive into flood waters

https://www.cnbc.com/2026/05/12/waymo-recalls-3800-robotaxis-after-able-drive-into-standing-water...
4•drob518•23m ago•0 comments

Building a UMatrix Replacement

https://lock.cmpxchg8b.com/umatrix.html
2•taviso•23m ago•0 comments

Ghost of long-extinct ancestor lives on in people today

https://www.science.org/content/article/ghost-long-extinct-ancestor-lives-people-today
1•gmays•25m ago•0 comments

Build a Full-Featured Text Editor from Scratch (Rust)

https://0xkiire.com/build-text-editor-from-scratch/
3•jabits•28m ago•1 comments

Apple Sold Out of Mac Minis and Mac Studios

https://www.apple.com/shop/buy-mac/mac-mini
1•adgjlsfhk1•31m ago•1 comments

Git Is Not Fine

https://www.billjings.com/posts/title/git-is-not-fine/
2•steveklabnik•32m ago•0 comments

What Is Code?

https://martinfowler.com/articles/what-is-code.html
1•BerislavLopac•41m ago•2 comments

Bidirectional typechecking that does not stop

https://semantic-domain.blogspot.com/2026/05/bidirectional-typechecking-that-does.html
1•fanf2•41m ago•0 comments

Why Gemma-4 26B MoE works in HuggingFace but breaks in prod inference engines

https://github.com/maeddesg/vulkanforge/blob/main/docs/gemma4_26b_moe_solution.md
1•maeddesg•42m ago•0 comments

Ask HN: Can I take Meta to court for banning business Insta or FB account?

7•milanspeaks•47m ago•3 comments

Linus Torvalds declares AI-fueled code surges as the new normal

https://www.neowin.net/news/linus-torvalds-declares-massive-ai-fueled-code-surges-as-the-new-norm...
3•ell1e•49m ago•0 comments

Goodgallery: WebGL sprite engine that can load 100k thumbnails in 1 second

https://ggdemo.s80.me/demo-100000/#fit
3•thunderbong•49m ago•0 comments