frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Fenris Creations (FKA CCP Games) Opens Carbon Engine to the World

https://fenris.com/news/2026/fenris-creations-opens-carbon-engine-to-the-world
1•lentil_soup•46s ago•0 comments

Amazon blames piracy apps with malware for killing new Fire Stick sideloading

https://arstechnica.com/gadgets/2026/06/exec-blames-malware-threat-for-amazon-blocking-sideloadin...
1•Brajeshwar•1m ago•0 comments

Workplace monitoring platforms are sharing your data

https://scholarship.law.columbia.edu/law_economy/5/
1•claudiacsf•1m ago•0 comments

A Real-World Law-Enforcement Hack: The Case of Encrochat

https://martinralbrecht.wordpress.com/2026/07/01/a-real-world-law-enforcement-hack-the-case-of-en...
1•u1hcw9nx•2m ago•0 comments

Ray Tracer in SQL

https://github.com/ClickHouse/RayTracer
1•kbumsik•3m ago•0 comments

Sony Deletes 551 Movies PlayStation Owners Paid For

https://reclaimthenet.org/sony-deletes-551-studiocanal-movies-playstation-owners-paid-for
3•bilsbie•5m ago•0 comments

Persistent memory for AI agents is three problems, not one

https://promptowl.ai/resources/persistent-memory-ai-agents/
1•sparkystacey•6m ago•0 comments

UK likely to intervene in Paramount takeover of Warner Bros Discovery

https://arstechnica.com/tech-policy/2026/07/uk-likely-to-intervene-in-paramount-takeover-of-warne...
1•rbanffy•6m ago•0 comments

Thomas Paine might have had to verify his identity before publishing this

https://twitter.com/FreeSpeech_AI/status/2072016572571435224
2•bilsbie•6m ago•0 comments

AI Sped Up Coding Faster Than It Sped Up Delivery

https://www.builder.io/blog/ai-sped-up-coding-faster-than-it-sped-up-delivery
1•jamdesk•7m ago•0 comments

Cloudflare to block cynical search-and-scrape bots from ad-supported web pages

https://www.theregister.com/ai-and-ml/2026/07/01/cloudflare-to-block-cynical-search-and-scrape-bo...
1•hedora•8m ago•1 comments

Why AI agents get canceled (and the 5 places they fail quietly)

https://www.brimtech.co/notes/why-agents-get-canceled/
1•semalba•9m ago•0 comments

For First Time, a Cell Built from Scratch Grows and Divides

https://www.quantamagazine.org/for-the-first-time-a-cell-built-from-scratch-grows-and-divides-202...
2•defrost•10m ago•0 comments

Heading OS – Run a company (as the CEO) from Claude Code, with data kept private

https://github.com/mishahanin/heading-os
1•mishahanin•12m ago•0 comments

Physical Disc Production to End for New Games Releasing on PlayStation Consoles

https://www.ign.com/articles/sony-just-killed-discs-physical-disc-production-to-end-january-2028-...
1•alanfranz•12m ago•0 comments

Soatok's Informal Guide to Threat Models

https://soatok.blog/2026/06/30/soatoks-informal-guide-to-threat-models/
1•birdculture•12m ago•0 comments

The Case for Sustainability Metrics (Or Don't Be Kennan Frost)

https://pawelbrodzinski.substack.com/p/the-case-for-sustainability-metrics
1•flail•13m ago•0 comments

They Don't Know How It Works

https://moai.studio/blog/posts/they-dont-know-how-it-works.html
1•ionwake•14m ago•0 comments

Abundance of Intelligence

https://magzimof.com/abundance-of-intelligence/
1•shaimagz•15m ago•0 comments

Mark Zuckerberg says a Meta cloud computing business 'definitely on the table'

https://www.cnbc.com/2026/05/27/mark-zuckerberg-says-meta-starting-cloud-business-on-the-table.html
1•BiraIgnacio•15m ago•0 comments

Watching for File Changes on macOS

https://alexwlchan.net/2026/watch-files-on-macos/
2•surprisetalk•16m ago•0 comments

CNN Weather

https://www.cnn.com/interactive/new_business/weather_app/index.html
1•ChaseRensberger•16m ago•0 comments

Monlite: The complete back end for AI agents – in one file

https://github.com/qataruts/monlite
2•emadjumaah•19m ago•0 comments

Meta looks to turn excess AI compute into cash

https://techcrunch.com/2026/07/01/meta-like-spacex-looks-to-turn-excess-ai-compute-into-cash/
2•bogdiyan•19m ago•0 comments

Show HN: Pinch-to-zoom tree navigation

https://www.delopsu.com/pinch-to-zoom-tree-navigation
3•delopsu•20m ago•2 comments

Mageia 10 keeps the 32-bit Linux flame alive

https://www.theregister.com/os-platforms/2026/06/29/mageia-10-keeps-the-32-bit-linux-flame-alive/...
1•Qem•20m ago•0 comments

FFmpeg 9.1's new AAC encoder

https://news.ycombinator.com/
2•ledoge•20m ago•4 comments

Prevented Mortality and Greenhouse Gas Emissions from Nuclear Power [pdf]

https://www.giss.nasa.gov/pubs/docs/2013/2013_Kharecha_kh05000e.pdf
1•rbanffy•22m ago•0 comments

Show HN: Osiris JSON generate private infrastructure snapshot without AI or SaaS

https://github.com/osirisjson/osiris-producers
1•skhell•23m ago•1 comments

Show HN: Loma – a self-hosted shared AI layer for your whole company

https://github.com/plotlinelabs/loma
1•tadarsh•24m ago•0 comments