frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

OpenCL 3.1 Is Here

https://www.khronos.org/blog/opencl-3.1-is-here
1•jrepinc•35s ago•0 comments

Topaz vs. Azurite: what works locally and what doesn't

https://topaz.thecloudtheory.com/blog/topaz-vs-azurite/
1•kamilmrzyglod•1m ago•0 comments

Why ADHD Is the Cheat Code of the AI Era

https://www.airsugar.com/p/why-adhd-is-the-cheat-code-of-the
2•herbertl•3m ago•0 comments

Show HN: PulsePages – Multi-page websites for $9/year (Carrd alternative

https://www.pulsepages.co
1•erichensley•5m ago•0 comments

A Reddit commenter warned me..I laughed it off. Then they wiped everything

https://onetile.me
2•omara123•5m ago•0 comments

We Can Do Hard Things

https://allenpike.com/2026/we-can-do-hard-things/
1•herbertl•7m ago•0 comments

Cerebras targets $26.6B valuation in US IPO as AI chip demand surges

https://www.reuters.com/business/ai-chipmaker-cerebras-targets-115-125-share-price-us-ipo-source-...
1•giuliomagnifico•7m ago•0 comments

Fizz Buzz Through Monoids

https://entropicthoughts.com/fizzbuzz-through-monoids
1•ibobev•8m ago•0 comments

Accountants in Ilford

https://skzee.co.uk/accountants-in-ilford/
1•syedsherazahmed•8m ago•0 comments

Show HN: Instantly understand any GitHub repo

https://gitdiagram.com
1•ahmedkhaleel•9m ago•0 comments

A Love Letter to Flashcards

https://lesleylai.info/en/flashcards/
1•ibobev•9m ago•0 comments

OpenAI's 'DeployCo' wins $4B from leading PE firms, FT says

https://pe-insights.com/openais-deployco-wins-4bn-from-leading-pe-firms-ft-says/
1•Brajeshwar•9m ago•0 comments

The Moon

https://buttondown.com/jaffray/archive/the-moon/
1•ibobev•9m ago•0 comments

Six Seven

1•sergiomattei•10m ago•0 comments

A bill banning AI companions for kids could usher in widespread ID checks online

https://reason.com/2026/05/04/how-a-bill-banning-ai-companions-for-kids-could-usher-in-widespread...
1•bilsbie•11m ago•0 comments

The Secret Team Blowing Up Ford's Assembly Line to Make a $30k Electric Truck

https://www.wsj.com/business/autos/ford-ev-electric-truck-7fdb0e0a
1•berkeleyjunk•14m ago•1 comments

Direct I/O for Cassandra Compaction: Cutting p99 Read Latency by 5x

https://lightfoot.dev/direct-i-o-for-cassandra-compaction-cutting-p99-read-latency-by-5x/
1•samlightfoot•14m ago•0 comments

Ntfy

https://ntfy.sh/
1•wewewedxfgdf•15m ago•0 comments

Year

https://seths.blog/2026/05/the-best-year/
1•herbertl•15m ago•0 comments

Redundant Information in LLM Weights

https://fergusfinn.com/blog/weight-entropy/
3•mezark•17m ago•0 comments

Popular Kubernetes Networking Project Antrea Compromised

https://opensourcemalware.com/blog/antrea-compromise2
1•6mile•17m ago•0 comments

Building Influence Within the Team

https://highimpactengineering.substack.com/p/building-influence-within-the-team
1•kiyanwang•18m ago•0 comments

Show HN: Sprivex – a place for people with knowledge. Not influencers

https://sprivex.com/
1•sprivexfounder•20m ago•0 comments

Show HN: SnapDraft – Sketch floor plans in the browser

https://snapdraftapp.com/
3•rowandhit•20m ago•2 comments

Human and Coding-Agent-Friendly Environment: ghq × gwq × fzf

https://shunk031.me/post/ghq-gwq-fzf-worktree/
1•ankitg12•20m ago•0 comments

Why Reddit blocked my daily visit to its mobile website

https://arstechnica.com/information-technology/2026/05/why-reddit-blocked-my-daily-visit-to-its-m...
2•ndr42•21m ago•0 comments

Model of a Letter of Recommendation of a Person You Are Unacquainted With

https://founders.archives.gov/documents/Franklin/01-23-02-0365
1•tosh•24m ago•0 comments

A polynomial autoencoder beats PCA on transformer embeddings

https://ivanpleshkov.dev/blog/polynomial-autoencoder/
2•timvisee•24m ago•1 comments

Agentic Coding at ClickHouse

https://clickhouse.com/blog/agentic-coding
2•zX41ZdbW•25m ago•0 comments

The Distillation Panic

https://www.interconnects.ai/p/the-distillation-panic
1•pretext•25m ago•0 comments