frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•11mo ago

Comments

kate_at_refact•11mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Ben Goldacre: OpenSAFELY in Brief

https://www.bennett.ox.ac.uk/blog/2025/02/opensafely-in-brief/
1•anitil•1m ago•1 comments

Do those that deserve the world, get the world?

https://dreamingtooloud.substack.com/p/do-those-that-deserve-the-world-get
1•homebush•11m ago•0 comments

AI-powered knowledge assistant for sexual and reproductive health

https://chathrp.org/
1•salkahfi•14m ago•0 comments

Quest Browser 146.0 adds experimental support for WebGPU in WebXR

https://bsky.app/profile/cabanier.bsky.social/post/3mk6ugjtbw22y
1•juretriglav•14m ago•0 comments

The importance of stupidity in scientific research (2008) [pdf]

https://web.stanford.edu/~fukamit/schwartz-2008.pdf
1•the-mitr•16m ago•0 comments

How Europe regulated itself into American vassalage

https://www.economist.com/europe/2026/04/22/how-europe-regulated-itself-into-american-vassalage
1•RestlessMind•16m ago•1 comments

Iran War Has Drained U.S. Supplies of Critical, Costly Weapons

https://www.nytimes.com/2026/04/23/us/politics/iran-war-cost-military.html
2•vrganj•18m ago•0 comments

Dutch government secures deal with European cloud platform STACKIT

https://www.nldigitalgovernment.nl/news/dutch-government-secures-deal-with-european-cloud-platfor...
1•hvb2•19m ago•0 comments

PuzzleScript

https://www.puzzlescript.net/Documentation/rules101.html
1•azhenley•21m ago•0 comments

Contral AI

https://contral.ai
1•vednig•24m ago•0 comments

What Are Unix Domain Sockets?

https://docs.sweeting.me/s/sockets-101
1•nikisweeting•27m ago•0 comments

Paint But…

https://paintbut.netlify.app/
1•memalign•31m ago•0 comments

GitGuardian analysis of the bitwarden/CLI compromise

https://blog.gitguardian.com/bitwarden-cli-gitguardian-views-on-helloworm00/
2•cwinq•31m ago•0 comments

Rendezvous and Docking: A User's Guide for Non Rocket Scientists

https://www.baen.com/rendezvous
1•EvgeniyZh•31m ago•0 comments

Microsoft offers buyouts for longtime employees

https://www.seattletimes.com/business/microsoft/microsoft-offers-buyouts-for-longtime-employees/
4•divbzero•34m ago•0 comments

FujiNet Go 800 – Atari800 Emulator for Android

https://fujinet.online/2026/04/23/fujinet-go-800-atari800-emulator-for-android/
1•p0w3n3d•37m ago•2 comments

The Surveillance Accountability Act Full Text [pdf]

https://boebert.house.gov/sites/evo-subsites/boebert.house.gov/files/evo-media-document/surveilla...
4•Cider9986•39m ago•1 comments

OpenAI deprecates all GPT nano fine tuning

https://community.openai.com/t/deprecation-of-fine-tuned-models-but-still-cant-access-newer-ones/...
2•dandiep•40m ago•0 comments

Why Not Venus?

https://mceglowski.substack.com/p/why-not-venus
1•zdw•48m ago•0 comments

Running Bare-Metal Rust Alongside ESP-IDF on the ESP32-S3's Second Core

https://tingouw.com/blog/embedded/esp32/run_rust_on_app_core
1•MrBuddyCasino•52m ago•0 comments

The Budgeting Mistake That Cost Uber Its Annual AI Spend in 4 Months

https://www.productcurious.com/p/uber-ai-budget-mistake
5•umangsehgal93•53m ago•0 comments

Tremendous Iranian Invasion: A Text Misadventure

3•brooksc•57m ago•0 comments

Essential Voice by Nothing

https://nothing.community/d/56167-introducing-essential-voice
1•plun9•59m ago•0 comments

Familiarity is the enemy: On why Enterprise systems have failed for 60 years

https://felixbarbalet.com/familiarity-is-the-enemy/
2•adityaathalye•1h ago•0 comments

Intel Arc Pro B70 Review

https://www.pugetsystems.com/labs/articles/intel-arc-pro-b70-review/
1•zdw•1h ago•0 comments

ASML's latest chipmaking gear is too pricey, even for TSMC

https://theedgemalaysia.com/node/800964
2•jackyli02•1h ago•0 comments

Intel Arc Pro B70 benchmarks for LLMs and video generation

https://github.com/PMZFX/intel-arc-pro-b70-benchmarks
1•mroche•1h ago•0 comments

DeepSeek's Sequel Set to Extend China's Reach in Open-Source A.I

https://www.nytimes.com/2026/04/24/business/china-ai-deepseek-open-source.html
1•Cider9986•1h ago•0 comments

Ubuntu 26.04 LTS Released

https://lwn.net/Articles/1069399/
63•lxst•1h ago•17 comments

AI Resume Reviewer

https://www.thehumancapitalhub.com/ai-resume-reviewer
1•bisit•1h ago•0 comments