frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

What Apple Knows About AI That Silicon Valley Won't Admit

https://www.thealgorithmicbridge.com/p/what-apple-knows-about-ai-that-silicon
1•gHeadphone•33s ago•0 comments

Netflix Wiz creates app to slash AI bills by pruning agent instructions

https://www.theregister.com/ai-ml/2026/05/31/netflix-wiz-creates-app-to-slash-ai-bills-then-open-...
1•beardyw•3m ago•1 comments

Open but Polished

https://mason.bearblog.dev/why-im-building-open-but-polished/
1•masoniamme•5m ago•0 comments

Abstruse Goose Comic Archive

https://github.com/s-macke/Abstruse-Goose-Archive
2•s-macke•5m ago•0 comments

The Road to Zig 1.0 (2019) [video]

https://www.youtube.com/watch?v=Gv2I7qTux7g
1•tosh•16m ago•0 comments

The dangerous delusion of modern warfare

https://economist.com/interactive/essay/2026/05/28/the-dangerous-delusion-of-modern-warfare
3•runeks•16m ago•1 comments

Show HN: CPU-only fast OCR for screenshots, images, PDFs, webpages

https://github.com/kouhxp/textsnap
2•mrkn1•19m ago•1 comments

A Beginner's Guide to Scaling to 11M+ Users on Amazon's AWS (2016)

https://highscalability.com/a-beginners-guide-to-scaling-to-11-million-users-on-amazons/
1•downbad_•20m ago•0 comments

Edgar Allan Poe's story that taught me about cryptography (2024)

https://robotsinplainenglish.com/e/2024-01-21-gold-bug.html
2•ripe•21m ago•0 comments

Build Agents, Not Pipelines

https://www.seangoedecke.com/build-agents-not-pipelines/
1•alexsanjoseph•27m ago•0 comments

An Interview with 100 Rabbits

https://sourcehut.org/blog/2021-12-08-100-rabbits-interview/
1•jruohonen•29m ago•0 comments

A New Design for Pretty Printer Implementations in Rust

https://blog.wybxc.cc/blog/pretty-printer-pye/
1•g0xA52A2A•30m ago•0 comments

Palo Alto GlobalProtect VPN auth bypass flaw now exploited in attacks

https://www.bleepingcomputer.com/news/security/palo-alto-globalprotect-vpn-auth-bypass-flaw-now-e...
2•throwa356262•31m ago•0 comments

AI Dark Output: The Visible Cost of Invisible Output

https://newsletter.semianalysis.com/p/ai-dark-output-the-visible-cost-of
2•qnleigh•31m ago•1 comments

Stop Micromanaging Your Agents

https://adlrocha.substack.com/p/adlrocha-stop-micromanaging-your
1•adlrocha•32m ago•0 comments

Multicore suppport for DOS is real – partly

https://www.vogons.org/viewtopic.php?t=111336
1•beebix•45m ago•0 comments

AI slop is hard to fork

https://00f.net/2026/05/31/ai-slop-is-hard-to-fork/
1•jedisct1•46m ago•0 comments

Your Electronics Depend on Dying Sensors. The Silicon Labs Incident Proves It

https://www.cambridge.org/engage/coe/article-details/6a054b304770e67d92e8c7a2
1•openrockets•47m ago•0 comments

Looking for paid reviewer of custom Sealed Sender construction

https://github.com/pontusva/spectre
1•nullsender•49m ago•0 comments

Untraceable Digital Cash, Information Markets, and BlackNet (1997)

https://osaka.law.miami.edu/froomkin/articles/tcmay.htm
1•greyface-•52m ago•1 comments

GOMS

https://en.wikipedia.org/wiki/GOMS
1•ustad•56m ago•0 comments

Pkl from Apple: configuration as code language with rich validation and tooling

https://github.com/apple/pkl
1•appwiz•1h ago•0 comments

ICE Authorized to Detain People Suspected of Emigration from Alternate Realities

https://medium.com/luminasticity/ice-authorized-to-detain-people-suspected-of-immigrating-from-al...
4•bryanrasmussen•1h ago•0 comments

QuantaTrader Demo – Ferramenta de análise e simulaçãO de trading

https://quantatrader-demo.streamlit.app/
1•gilthor•1h ago•0 comments

Enhancing Multi-Agent Communication Through Attention Steering

https://arxiv.org/abs/2605.30136
3•ankitg12•1h ago•0 comments

Open Letter to Steve Lemay

https://ilyabirman.net/meanwhile/all/dear-steve-lemay/
5•smagin•1h ago•0 comments

Memory as Action: Autonomous Context Curation for Long-Horizon Agentic Tasks

https://arxiv.org/abs/2510.12635
2•ankitg12•1h ago•0 comments

Five days in darkness left scientist Kiana Aran thinking about anything

https://www.abc.net.au/news/2026-05-31/darkness-retreat-sensory-deprivation-cave-kiana-aran/10668...
5•Towaway69•1h ago•1 comments

Show HN: AnyFrame, Platform for every agent your team builds

https://anyframe.dev
3•inishchith•1h ago•0 comments

How does a Mikrokator work?(2022) [video]

https://www.youtube.com/watch?v=2zEeAzJq-CQ
2•pillars•1h ago•0 comments