frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•12mo ago

Comments

kate_at_refact•12mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Oil tanker hijacked off Yemen, steers toward Somalia

https://www.yahoo.com/news/articles/yemen-says-oil-tanker-hijacked-121710980.html
1•delichon•48s ago•0 comments

Microsoft Researchers Predicting 2026's Tech, in 2015

https://news.microsoft.com/features/from-ai-and-data-science-to-cryptography-microsoft-researcher...
1•subdomain•1m ago•0 comments

Ask HN: Which is better–macOS/Windows/Linux?

1•wasimsk•2m ago•0 comments

OpenAI Builds an Advertising Infrastructure Around ChatGPT

https://tux.re/forum/viewtopic.php?t=216
1•tux033•2m ago•0 comments

Outlive 25 Remaster is launched

https://store.steampowered.com/app/2948680/Outlive_25/
1•renatovico•3m ago•1 comments

The Things We're Building

https://www.theverge.com/tech/922505/vibe-code-projects-claude-installer
1•Brajeshwar•4m ago•0 comments

Docker 29 has changed its default image store for new installs

https://docs.docker.com/engine/storage/containerd
1•neitsab•7m ago•1 comments

Show HN: A Local-only URL shortener

https://9ev.in/
1•shabda•9m ago•1 comments

Discover: A curated list of RSS feeds worth following

https://discover.brine.dev/
3•brine•12m ago•1 comments

OrangePi 4 Pro Review

https://boilingsteam.com/orange-pi-4-pro-review/
2•ekianjo•13m ago•0 comments

How did it come to this? The state of the Royal Navy

https://vulpesetleo.substack.com/p/how-did-it-come-to-this
2•foxandlion•17m ago•0 comments

AgInTiFlow, a local web and CLI agent workspace using low-cost DeepSeek

https://www.npmjs.com/package/@lazyingart/agintiflow
3•lachlanchen•18m ago•0 comments

Everyone Should Write

https://collabfund.com/blog/why-everyone-should-write/
3•eigenBasis•19m ago•0 comments

How to orchestrate large coding tasks without context bloat

https://raine.dev/blog/phased-implement-workflow/
3•rane•20m ago•1 comments

William Byrd on Logic and Relational Programming, miniKanren (2014)

https://www.infoq.com/interviews/byrd-relational-programming-minikanren/
2•tosh•20m ago•0 comments

Have Your Iceberg Cubed, Not Sorted: Meet Qbeast, the OTree Spatial Index

https://jack-vanlightly.com/blog/2025/11/19/have-your-iceberg-cubed-not-sorted-meet-qbeast-the-ot...
3•birdculture•21m ago•0 comments

Trystero – Browser P2P Library

https://github.com/dmotz/trystero
1•rickcarlino•22m ago•0 comments

The 'manosphere' has already infiltrated the workplace. We're only just noticing

https://www.fastcompany.com/91523017/the-manosphere-has-already-infiltrated-the-workplace-were-on...
3•zczc•25m ago•1 comments

The Plot to Kidnap and Assassinate Me

https://www.youtube.com/watch?v=y8i-5907ky4
1•tcp_handshaker•25m ago•0 comments

Rent-a-Ruminant

https://www.rentaruminant.com/
1•bariumbitmap•26m ago•0 comments

Emergency First Responders Say Waymos Are Getting Worse

https://www.wired.com/story/emergency-first-responders-say-waymos-are-getting-worse/
1•tcp_handshaker•26m ago•0 comments

Why did I choose to run that marathon?

https://anushkakarmakar.substack.com/p/1-why-did-i-choose-to-run-that-marathon
2•thinkingkite•26m ago•0 comments

Acupuncture works for pain. Jury is out on everything else

https://www.economist.com/science-and-technology/2026/05/01/does-acupuncture-work
1•bookofjoe•29m ago•1 comments

Tired of high costs, some Americans are importing homes straight from China

https://www.cnn.com/2026/04/25/business/china-imports-americans-homebuilding-costs
1•JumpCrisscross•30m ago•0 comments

A Bill Aimed at Creating Homes Is Leaving Plots Empty Instead

https://www.wsj.com/real-estate/a-bill-aimed-at-creating-homes-is-leaving-plots-empty-instead-c25...
1•JumpCrisscross•30m ago•0 comments

Porting microgpt to Futhark, Part I

https://www.kmjn.org/notes/microgpt_futhark.html
1•fulafel•31m ago•0 comments

No Code Reviews by Default

https://www.raycast.com/blog/no-code-reviews-by-default
1•fagnerbrack•33m ago•0 comments

The College Admissions Chess Game Is More Complicated

https://www.wsj.com/us-news/education/college-admissions-yield-rate-2fb30f42
1•tcp_handshaker•33m ago•0 comments

Learn Algorithms for Interviews, Forget Them for Work

https://fagnerbrack.com/learn-algorithms-for-interviews-forget-them-for-work-c7dc5fe6cd3b
2•fagnerbrack•33m ago•0 comments

Refusal in Language Models Is Mediated by a Single Direction

https://arxiv.org/abs/2406.11717
2•fagnerbrack•33m ago•0 comments