frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•11mo ago

Comments

kate_at_refact•11mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

DriftNote – Podcast notes that talk back

https://www.driftnote.net/
1•LifeOfKP•1m ago•1 comments

Tests confirm super-fast charging for first solid-state-battery e-moto

https://newatlas.com/motorcycles/donut-lab-solid-state-battery-charge-test/
1•breve•1m ago•0 comments

The Clock

https://blog.senko.net/the-clock
1•senko•7m ago•1 comments

CDC and health groups spent millions in ads on sites flagged for misinformation

https://www.cidrap.umn.edu/anti-science/cdc-health-groups-spent-millions-buy-ads-websites-flagged...
1•giuliomagnifico•9m ago•0 comments

Angine de Poitrine – Full Performance [video]

https://www.youtube.com/watch?v=0Ssi-9wS1so
1•vitto_gioda•10m ago•0 comments

PDS Mac OLM File to PST Converter

https://www.perfectdatasolutions.com/en/olm/olm-to-pst-converter.html
1•tieanderson•11m ago•0 comments

A brief history of instant coffee

https://worksinprogress.co/issue/a-brief-history-of-instant-coffee/
1•admp•15m ago•0 comments

GPUs vs. TPUs: Decoding the Powerhouses of AI

https://www.savvycanary.com/gpus-vs-tpus-decoding-the-powerhouses-of-ai/
1•car•20m ago•0 comments

Being the Human in the Loop – Kevlin Henney

https://www.youtube.com/watch?v=vpYJMr1pJRY
1•mcp_•20m ago•0 comments

Emotion Concepts and Their Function in a Large Language Model

https://transformer-circuits.pub/2026/emotions/index.html
2•Anon84•22m ago•0 comments

Trinity-Large-Thinking: Open-source 398B MoE (13B active) for agentic tasks

https://firethering.com/trinity-large-thinking-open-source-agent-model/
2•steveharing1•24m ago•0 comments

Trying to Be Responsible

https://chatgpt.com/s/t_69d0ca9364ac8191868c2850d26305aa
1•aljgz•27m ago•1 comments

Every dependency you add is a supply chain attack waiting to happen

https://benhoyt.com/writings/dependencies/
1•ingve•29m ago•0 comments

Engineering a Better Java Build Tool [video]

https://www.youtube.com/watch?v=OtsJ902k458
1•lihaoyi•34m ago•0 comments

The Evolution of x86 SIMD: From SSE to AVX-512

https://bgslabs.org/blog/evolution-of-x86-simd/
1•jiehong•34m ago•1 comments

The global oil crisis is turning into an everything crisis

https://www.cnn.com/2026/04/04/business/global-oil-crisis-shortage-everything-intl-hnk-dst
1•iamflimflam1•35m ago•0 comments

Show HN: Deeplink – Go library for short links, click tracking, and OG previews

https://github.com/yinebebt/deeplink
1•yinebeb_sc•37m ago•0 comments

Show HN: I rebuilt search using physics instead of statistics. +18.5% NDCG 10

https://github.com/Razshy/resonance-search
1•KendallCBooker•38m ago•0 comments

European Commission cloud breach: a supply-chain compromise

https://cert.europa.eu/blog/european-commission-cloud-breach-trivy-supply-chain
2•Sandman•38m ago•0 comments

Show HN: LaneKeep - let your agent run within boundaries that you define

https://github.com/algorismo-au/lanekeep
2•mightymo1•39m ago•1 comments

Show HN: Prematrix

https://www.prematrix.dev/
1•thomasfromcdnjs•41m ago•0 comments

Excess mortality attributable to the 2025 Iberian Peninsula blackout

https://www.medrxiv.org/content/10.1101/2025.06.03.25328877v1
2•mpweiher•42m ago•0 comments

Should you change your life decisions if we're being watched by alien drones?

https://marginalrevolution.com/marginalrevolution/2026/04/how-should-you-change-your-life-decisio...
1•jger15•42m ago•0 comments

Mechanical Techno Updates

https://www.youtube.com/watch?v=sBhGbHVQYvI
2•ngcazz•44m ago•0 comments

Mixed Precision Quantization on mlx comes with TurboQuant implementation

https://twitter.com/thin_signal/status/2028412948167942334
2•jsilence•45m ago•1 comments

Is it workig download yt video through yt-dlp after deploying?

1•sunill•53m ago•0 comments

LÖVE: 2D Game Framework for Lua

https://github.com/love2d/love
2•cl3misch•55m ago•0 comments

Talent is everywhere, opportunity is not. We are all losing out because of this

https://ourworldindata.org/talent-is-everywhere-opportunity-is-not
2•prakashqwerty•57m ago•2 comments

110k+ publications from 2025 might include hallucinated citations

https://www.nature.com/articles/d41586-026-00969-z
4•cyclecycle•59m ago•3 comments

SQLite in Production: Lessons from Running a Store on a Single File

https://ultrathink.art/blog/sqlite-in-production-lessons
3•thunderbong•59m ago•1 comments