frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•8mo ago

Comments

kate_at_refact•8mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Justice Delayed Is Justice Denied

https://en.wikipedia.org/wiki/Justice_delayed_is_justice_denied
1•barrister•49s ago•0 comments

The Homepage of Ron Goodwin

http://rongoodwin.co.uk/
1•ocfnash•50s ago•0 comments

Time Is of the Essence

https://docs.eventsourcingdb.io/blog/2026/01/12/time-is-of-the-essence/
1•goloroden•1m ago•0 comments

Show HN: Home Design AI

https://homedesign-ai.net
1•zoooey•2m ago•0 comments

Cosmotechnics and AI: Reading Hamid Ismailov's We Computers

https://seanvoisen.com/writing/cosmotechnics-and-ai/
1•tobr•2m ago•0 comments

Universal Commerce Protocol (UCP)

https://developers.googleblog.com/en/under-the-hood-universal-commerce-protocol-ucp/
1•topper00_raptor•4m ago•0 comments

US Nightmare Propaganda

https://twitter.com/i/status/2010826442725056648
1•barrister•5m ago•0 comments

Vibe Coding Debt: The Security Risks of AI-Generated Codebases

https://instatunnel.my/blog/vibe-coding-debt-the-security-risks-of-ai-generated-codebases
2•birdculture•7m ago•0 comments

Even Linus Torvalds is vibe coding now

https://www.zdnet.com/article/linus-torvalds-vibe-coding-ai/
2•isaacfrond•8m ago•0 comments

Working with Ruby Threads

https://workingwithruby.com/wwrt/intro
2•gmac•9m ago•0 comments

The Day AI Defeated Google (As Its Own Owner)

https://ai-404.medium.com/the-day-ai-defeated-google-as-its-own-owner-2fc1372cd2cc
2•martinambrus•9m ago•0 comments

Operation Tailwind War Crime

https://en.wikipedia.org/wiki/Operation_Tailwind
2•barrister•10m ago•0 comments

macOS 26's Cut Corners

https://daringfireball.net/2026/01/resizing_windows_macos_26
3•7777777phil•13m ago•0 comments

Burroughs B21 / Convergent AWS Vintage Computer Restoration – Dr. Scott M. Baker

https://www.smbaker.com/burroughs-b21-convergent-aws-vintage-computer-restoration
2•rbanffy•14m ago•0 comments

My AI resources packed together

https://mind-sculptor-engine.lovable.app/
2•tvali•15m ago•1 comments

I asked Opus 4.5 to make a Rust implementation of PyNNDescent

https://twitter.com/leland_mcinnes/status/2009738982712627433
2•tomthe•17m ago•1 comments

The Foundation Every Design System Gets Wrong

https://www.designsystemscollective.com/spacing-systems-the-foundation-every-design-system-gets-w...
3•vednig•20m ago•0 comments

Klarna boss backs interest rate cap on credit cards

https://www.thetimes.com/business/companies-markets/article/klarna-boss-backs-trump-10-percent-in...
2•petethomas•22m ago•0 comments

Show HN: Oubli – Persistent fractal memory for Claude Code

https://github.com/dremok/oubli
2•dremok•26m ago•0 comments

Helping promote the Lax programming language

2•Mavox-ID•38m ago•3 comments

Show HN: Stove – Kotlin-first E2E testing for JVM Back end apps(Ktor,SpringBoot)

https://github.com/Trendyol/stove
1•osoykan•38m ago•0 comments

In Memoriam: The Academic Journal

https://ieeexplore.ieee.org/document/11134631
1•jruohonen•38m ago•0 comments

Agnostic library without code, only specs and tests

https://github.com/dbreunig/whenwords
1•nesk_•39m ago•0 comments

State of DataHaskell Q1 2026

https://www.datahaskell.org/blog/2026/01/12/state-of-datahaskell-q1-2026.html
4•todsacerdoti•42m ago•0 comments

Show HN: Shorta – analyze a YouTube Short → generate a storyboard → re-film

https://shorta.ai
1•eguitarz•44m ago•0 comments

Ask HN: Are you paying for AWS support, and is it worth the cost?

1•oriettaxx•49m ago•1 comments

Agent-browser by Vercel: Browser automation CLI for AI agents

https://github.com/vercel-labs/agent-browser
1•handfuloflight•49m ago•0 comments

Norway reaches 97% EV sales as EVs now outnumber diesels on its roads

https://electrek.co/2026/01/02/norway-reaches-97-ev-sales-as-evs-now-outnumber-diesels-on-its-roads/
3•smurda•50m ago•0 comments

/R/Atlanta Has New Mods: Here's What Happened

https://old.reddit.com/r/Atlanta/comments/1qbabii/ratlanta_has_new_mods_heres_what_happened/
3•echelon•51m ago•1 comments

How and for Whom Using Generative AI Affects Creativity: A Field Experiment

https://psycnet.apa.org/fulltext/2026-29702-001.html
1•EagnaIonat•52m ago•0 comments