frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

A device that revives eyeballs from dead donors could make eye transplants poss

https://www.technologyreview.com/2026/07/03/1140148/a-device-that-revives-eyeballs-from-dead-dono...
1•joozio•34s ago•0 comments

Show HN: FlipFlow FlipFlow turns PDF, Word, and images into flipbooks

https://flippingbooks.org
1•ceaserwang•1m ago•0 comments

Fable 5. Safety Taken to an Extreme

1•sergeysmirnov•2m ago•0 comments

HTML Is the New PowerPoint

https://sproutmarkup.com/
1•ginooliver•2m ago•0 comments

More Tailscale tricks for your jailbroken Kindle

https://tailscale.com/blog/jailbroken-kindle-proxy-tun-modes
1•Quizzical4230•3m ago•0 comments

Running Qwen 3.6 Locally on a Mac Mini M4 with 16GB RAM

https://maloyan.xyz/blog/running-qwen-locally-mac-mini-m4
1•mpweiher•5m ago•0 comments

What's new in Claude Sonnet 5

https://platform.claude.com/docs/en/about-claude/models/whats-new-sonnet-5
1•tosh•7m ago•0 comments

Is DentaBiome Legit? Full Oral Postbiotic Review 2026

https://gamma.app/embed/Is-DentaBiome-Legit-Full-Oral-Postbiotic-Review-2026-ngcts4rx0fndxw5?mode...
1•prepostseo•10m ago•0 comments

Accessible Math in PDF [video]

https://www.youtube.com/watch?v=yb5QElBAr-Q
1•anewhnaccount2•20m ago•0 comments

The LLVM Compiler Infrastructure

https://cacm.acm.org/federal-funding-of-academic-research/the-llvm-compiler-infrastructure/
1•tosh•21m ago•0 comments

Open source project for evaluating two models on specific task

1•mzubairtahir•25m ago•0 comments

Mir Little Mathematics Library

https://mirtitles.org/2011/06/02/little-mathematics-library/
1•vismit2000•25m ago•0 comments

Show HN: On99 – a no-signup Hong Kong Mark Six checker and stats explorer

https://on99.life/en/lottery
1•alex_foolsmart•29m ago•0 comments

Godot bans "vibe-coded" pull requests

https://theguptalog.blogspot.com/2026/07/godot-bans-ai-generated-code.html
2•guptalog•31m ago•2 comments

Show HN: Much – Local-first AI workspace with in-browser Python (WASM) sandbox

1•srinivasthalada•32m ago•0 comments

The bottleneck might be the air in the room

https://blog.mikebowler.ca/2026/07/03/co2-and-decision-making/
13•gslin•32m ago•1 comments

Borrowed Confidence

https://zhavaedhaemaed.substack.com/p/borrowed-confidence
1•vismit2000•37m ago•0 comments

The feature in OxCaml that more languages should steal

https://theconsensus.dev/p/2026/06/27/the-feature-in-oxcaml-more-languages-should-steal.html
1•tosh•39m ago•0 comments

Now at $50M: Ro Khanna "Why I Support a Billionaire Wealth Tax"

https://rokhannausa.substack.com/p/why-i-support-a-billionaire-wealth
2•g42gregory•43m ago•0 comments

OpenClaw just launched an official app for iPhone

https://9to5mac.com/2026/06/29/openclaw-just-launched-an-official-app-for-iphone-details-here/
1•TMWNN•44m ago•0 comments

I Helped Fact-Check the 1619 Project. The Times Ignored Me. (2020)

https://www.politico.com/news/magazine/2020/03/06/1619-project-new-york-times-mistake-122248
1•Tomte•44m ago•0 comments

Help Us Save MeshCore

https://blog.meshcore.io/2026/07/04/help-us-save-meshcore
1•ilreb•46m ago•0 comments

Berikut cara perubahan tiket AGODA

https://drive.google.com/file/d/1jtjTxNKMww-bAiMbr1SqRwrP7Kwn5wk8/view?usp=drivesdk
1•akupadamu•48m ago•0 comments

Bloomberg Terminal Is Ugly and Clunky–Everyone Still Uses It

https://oztalking.com/en/issues/bloomberg-terminal-lock-in
10•haebom•57m ago•2 comments

Alibaba bans Claude Code as a security risk

https://www.scmp.com/tech/big-tech/article/3359375/alibaba-bans-staff-using-claude-code-over-anth...
2•5701652400•1h ago•0 comments

Scientists say they have built a cell from scratch for the first time

https://www.cnn.com/2026/07/01/science/synthetic-cell-research
1•giuliomagnifico•1h ago•0 comments

What Happens When Your Site Goes Down?

https://urlwatch.io/
2•mssblogs•1h ago•0 comments

How AI models would vote in Sweden

https://www.nordan.ai/research/which-swedish-party-do-llms-vote-for
2•urvader•1h ago•1 comments

California Bans 'Sell by' Labels, Hoping to Cut Food Waste

https://www.nytimes.com/2026/07/02/us/california-food-labels-sell-by.html
1•thunderbong•1h ago•1 comments

Movies I've watched. (623 recorded and counting)

https://artconnects.club/u/bora/movies
2•kaizenb•1h ago•3 comments