frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Latvian government collapses after Ukrainian drones strike oil facility

https://www.theglobeandmail.com/world/article-latvian-government-collapses-after-ukrainian-drones...
1•petethomas•2m ago•0 comments

Musk accused of 'selective amnesia,' Altman of lying as OpenAI trial nears end

https://www.reuters.com/sustainability/society-equity/elon-musks-court-battle-against-openai-ente...
1•jnord•2m ago•0 comments

Details of the Daring Airdrop at Tristan Da Cunha

https://www.tristandc.com/government/news-2026-05-11-airdrop.php
1•kspacewalk2•2m ago•0 comments

White-collar workers report growing feelings of 'AI brain fry'

https://www.ft.com/content/0ba3bd4f-cc3a-4cad-8a8e-76925da2a711
2•1vuio0pswjnm7•7m ago•0 comments

How Do VPNs Protect Your Privacy? VPN Overview

https://www.privacyguides.org/en/basics/vpn-overview/
1•Cider9986•9m ago•0 comments

Secrets at Rest: SOPS and Age for Docker Compose Homelabs

https://pikemd.com/blog/sops-age-docker-compose/
2•pike00•11m ago•0 comments

Self-destructing $2k Nvidia chips for distributed solar data ctrs in lampposts

https://www.techradar.com/pro/self-destructing-usd2-000-nvidia-chips-will-soon-power-tens-of-thou...
2•toss1•13m ago•0 comments

I ran forensics on closed models and discovered no one is using dense attention

https://blog.0xmmo.co/forensics/post.html
1•mmoustafa•17m ago•0 comments

Countdown to Apophis Close Approach–Cascading Hazards from Asteroid Impacts

https://pubs.usgs.gov/publication/fs20253028/full
1•rolph•18m ago•0 comments

Systematically Auditing AI Agent Benchmarks with BenchJack

https://arxiv.org/abs/2605.12673
1•matt_d•20m ago•0 comments

Show HN: Trailmaps.app – Mobile maps that match the trail

https://trailmaps.app/
1•c0nsumer•23m ago•2 comments

Musk's China trip during OpenAI trial prompts apology from his lawyer

https://www.cnbc.com/2026/05/14/musk-lawyer-trial-jury-china-trip-openai-altman.html
1•1vuio0pswjnm7•26m ago•0 comments

How to Fix "DMARC Quarantine/Reject Policy Not Enabled"

https://dmarcguard.io/blog/dmarc-policy-not-enabled-fix/
1•meysamazad•27m ago•0 comments

How do you tell who's thinking?

https://willhackett.com/borrowed-cognition/
1•meysamazad•27m ago•0 comments

Ingest – Capture Anything from Anywhere

https://edleeman.co.uk/posts/ingest-capture-anything-from-anywhere/
1•meysamazad•28m ago•0 comments

Cowboy files plans for up to 20k orbital data centers

https://spacenews.com/cowboy-files-plans-for-up-to-20000-orbital-data-centers/
2•defrost•29m ago•0 comments

Bay Area customers may face warnings, fees under Recology's new camera system

https://www.sfgate.com/local/article/recology-cameras-22259377.php
1•turtlegrids•31m ago•0 comments

Water on Earth

https://www.scientificamerican.com/article/its-a-water-full-world/
2•soupspaces•32m ago•0 comments

Big tech is sacrificing its cashflows to prop up the AI boom

https://www.economist.com/business/2026/05/13/big-tech-is-sacrificing-its-cashflows-to-prop-up-th...
3•1vuio0pswjnm7•33m ago•1 comments

Possible Samsung strike puts more pressure on memory pricing

https://www.theregister.com/systems/2026/05/15/possible-samsung-strike-puts-even-more-pressure-on...
1•jnord•36m ago•0 comments

Beyond Git: Coordinating humans, agents, and automation in a repo with a ledger

https://www.mentu.ai/blog/beyond-git
2•rashidae•36m ago•0 comments

Audit of Serai's Substrate Blockchain

https://serai.exchange/2026/04/15/serai-blockchain-audited.html
1•Cider9986•37m ago•0 comments

The secretive and lucrative world of orchid breeding

https://www.bbc.com/news/articles/cly039rr2mgo
1•y1n0•37m ago•0 comments

Spam Resistant Forges

https://blog.feld.me/posts/2026/05/spam-resistant-forges/
1•y1n0•38m ago•0 comments

Untangling Communication (2001) [pdf]

https://dhemery.com/pdf/untangling_communication.pdf
1•mooreds•38m ago•0 comments

Don't let your old NVMe gather dust: It's the fastest USB stick you own

https://www.xda-developers.com/old-nvme-is-the-fastest-usb-stick-you-own/
2•y1n0•40m ago•0 comments

AI Wellbeing – Measuring and Improving the Functional Pleasure and Pain of AIs

https://www.ai-wellbeing.org/
1•xiaoyu2006•41m ago•1 comments

Heads up: new Google support scam uses a REAL email from Google: sysadmin

https://old.reddit.com/r/sysadmin/comments/1tdezhu/heads_up_new_google_support_scam_uses_a_real/
1•freediver•43m ago•0 comments

US plans to indict Cuba's Raul Castro, US DOJ official says

https://www.reuters.com/legal/government/us-plans-indict-cubas-raul-castro-us-doj-official-says-2...
1•tartoran•45m ago•0 comments

We Didn't Ask for This Internet

https://angelabenton.substack.com/p/what-a-post-social-media-internet
1•ethanplant•55m ago•0 comments