frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Google's AI Is Confused About Fish and the Days of the Week

https://opus.ing/p/google-ai-really-confused-about-fish-days-week
1•_____k•1m ago•0 comments

Rivian explains why they won't adopt Apple CarPlay

https://www.macrumors.com/2026/05/28/rivian-software-chief-on-carplay/
2•andsoitis•1m ago•0 comments

Testing products on AI-generated buyers

https://www.bain.com/insights/synthetic-customers-earn-their-stripes/
1•paulpauper•2m ago•0 comments

Soft Serve and Soft Power

https://listomania.substack.com/p/soft-serve-and-soft-power
1•paulpauper•2m ago•0 comments

Secure GCP Auth in Bitbucket Pipelines

https://emilytburak.net/posts/bitbucket-pipes-gap-gcp-oidc/
1•mooreds•3m ago•0 comments

US State Privacy Legislation Tracker

https://iapp.org/resources/article/us-state-privacy-legislation-tracker
1•mooreds•5m ago•0 comments

White House aliens website is scary

https://www.whitehouse.gov/aliens/
1•brisket_bronson•6m ago•0 comments

Everything We Know About OpenAI's Planned iPhone Rival

https://www.macrumors.com/2026/05/29/everything-we-know-about-openai-iphone-rival/
2•andsoitis•6m ago•0 comments

Dusklight – GC Twilight Princess Decompiled

https://twilitrealm.dev/
1•shepherdjerred•6m ago•0 comments

The First Killer App

https://dl.acm.org/doi/pdf/10.1145/2509224
1•jruohonen•8m ago•0 comments

Why is this text everywhere? (Lorem Ipsum) [video]

https://www.youtube.com/watch?v=kL1PDqzqhM4
1•fanfantm•9m ago•0 comments

Zerostack v1.3.4 released – Lightweight Unix-like coding agent

https://github.com/gi-dellav/zerostack/releases/tag/v1.3.4
1•gidellav•12m ago•0 comments

768GB Intel Optane DIMMs to run 1T-parameter LLM with single GPU at 4tps

https://www.tomshardware.com/tech-industry/artificial-intelligence/enthusiast-runs-1-trillion-par...
1•walterbell•13m ago•0 comments

Fortunes of Anthropic's Seven Cofounders More Than Double to $16.6B Each

https://www.forbes.com/sites/richardnieva/2026/05/29/anthropics-cofounders-worth/
1•andsoitis•15m ago•0 comments

1M Ancient Greek fragments soon to be translated with the help of AI

https://www.oeaw.ac.at/en/news/austrian-academy-of-sciences-is-developing-the-ancient-greek-ai-ap...
1•janandonly•17m ago•0 comments

For targeted assassination using a ricin-Lorazepam combination

https://vostoktechnicalbureau.substack.com/p/for-targeted-assassination-using
1•VostocBuraeu•17m ago•0 comments

Mental Health Benefits

https://samatahealth.com/
2•paulsiccha•18m ago•0 comments

One engine, many tools – Introducing Rubydex

https://railsatscale.com/2026-05-12-one-engine-many-tools/
1•weaksauce•18m ago•0 comments

The Value of Science – Poincare [pdf]

https://academicweb.nd.edu/~powers/ame.60611/poincare.pdf
1•mindcrime•19m ago•0 comments

Sycophantic AI decreases prosocial intentions and promotes dependence

https://www.science.org/doi/10.1126/science.aec8352
1•jg0r3•20m ago•0 comments

Byte, Vol. 7, No. 8 (1982) [pdf]

https://archive.org/download/byte-magazine-1982-08/1982_08_BYTE_07-08_Logo.pdf
1•susam•21m ago•0 comments

Net 11 introduces runtime-native async replacing compiler-gen. state machines

https://learn.microsoft.com/en-us/dotnet/core/whats-new/dotnet-11/runtime
1•polskibus•22m ago•0 comments

Looking Back at Lewis and Clark

https://www.newyorker.com/magazine/2026/06/01/this-vast-enterprise-craig-fehrman-book-review
1•bookofjoe•23m ago•1 comments

LocalEmu, a free AWS emulator (fork of archived LocalStack)

https://github.com/localemu/localemu
2•CloudHackerFr•25m ago•0 comments

A disappearing Service Processor (2025)

https://oxide.computer/blog/cosmo-sp
2•mooreds•25m ago•0 comments

Frona v2026.5.5 – self-hosted personal AI assistant

https://github.com/fronalabs/frona/releases/tag/v2026.5.5
1•syncerx•27m ago•0 comments

The price of the Manhattan Project (2013)

https://blog.nuclearsecrecy.com/2013/05/17/the-price-of-the-manhattan-project/
1•downbad_•27m ago•0 comments

Transfer Emotions to VR Avatar via Brain with PiEEG XR

https://www.notebookcheck.net/For-VR-PiEEG-XR-measures-brain-activity-in-real-time.1311211.0.html
2•Christiangmer•31m ago•0 comments

The Cost of AI: From the Perspective of a Game Developer

https://alextardif.com/AI.html
1•coinfused•33m ago•0 comments

Who Has the Hardest Fist in China's AI Valuation Race?

https://crossingriver.substack.com/p/who-has-the-hardest-fist-in-chinas
2•ramimac•35m ago•0 comments