frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•8mo ago

Comments

kate_at_refact•8mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

149M Infostealer Credentials Exposed in Unsecured Database

https://thecyberedition.com/149m-infostealer-credentials-exposed-in-unsecured-database/
1•thehacknews•56s ago•1 comments

What If We Took Message-Passing Seriously?

https://worksonmymachine.ai/p/what-if-we-took-message-passing-seriously
1•thunderbong•2m ago•0 comments

Because Coordination Is Expensive

https://surfingcomplexity.blog/2026/01/24/because-coordination-is-expensive/
1•azhenley•9m ago•0 comments

Palantir has no place in UK public services

https://www.opendemocracy.net/en/zarah-sutlana-palantir-no-place-uk-public-services-ministry-of-d...
8•jethronethro•10m ago•0 comments

Canon and Compass into <Si Wu Zi> Scripture

https://jimiwen.substack.com/p/si-wu-zi
1•jimiwen•11m ago•0 comments

Eightfold AI sued for job candidate reports without their consent

https://www.hrdive.com/news/eightfold-ai-lawsuit-job-candidate-consumer-reports/810332/
1•felineflock•16m ago•0 comments

Debugging a "weird SIGSEGV" core dump with Codex and GDB inside Docker

https://medium.com/@haohang.shi/codex-in-docker-debugging-from-a-weird-sigsegv-core-dump-to-a-rea...
1•Sep142324•17m ago•1 comments

The Great Escape and Stalag Luft III

https://www.youtube.com/watch?v=D8nt3dt8vT4
1•fallinditch•17m ago•0 comments

Trump Family's $6.8B Fortune Is Increasingly Tied to Crypto

https://www.bloomberg.com/news/features/2026-01-20/donald-trump-family-net-worth-increasingly-com...
7•zerosizedweasle•21m ago•3 comments

Stable-DiffCoder: Pushing the Frontier of Code Diffusion Large Language Models

https://bytedance-seed.github.io/Stable-DiffCoder/
1•akshayt•25m ago•0 comments

TikTok is officially US-owned for American users, here's what's changing

https://9to5mac.com/2026/01/23/tiktok-is-officially-us-owned-for-american-users-heres-whats-chang...
7•WaitWaitWha•26m ago•0 comments

Training Medical AI to Think Like a Doctor [video]

https://www.youtube.com/watch?v=XaxGcb9qvTw
1•ssunboyy•29m ago•0 comments

Ask HN: A good Model to choose in Ollama to run on Claude Code

1•sujayk_33•29m ago•0 comments

Ercot Dashboard

https://www.ercot.com/gridmktinfo/dashboards
2•RyanShook•32m ago•0 comments

Show HN: Snowfall Progress

https://www.kmjn.org/snow/
4•mjn•35m ago•1 comments

Patterns, Predictions, and Actions – A story about machine learning [pdf]

https://mlstory.org/pdf/patterns.pdf
1•vinhnx•36m ago•0 comments

The Possessed Machines

https://possessedmachines.com/
1•laurex•37m ago•0 comments

Elixir: Gradual Set-Theoretic Types

https://hexdocs.pm/elixir/main/gradual-set-theoretic-types.html
1•sergiotapia•50m ago•0 comments

Show HN: EchoFluid – a video transcription without uploading full videos

https://www.echofluid.com
1•samwulol•55m ago•0 comments

ICE Launches "Operation Catch-of-the-Day" Across Maine

https://www.dhs.gov/news/2026/01/21/ice-launches-operation-catch-day-targeting-worst-worst-crimin...
10•SanjayMehta•55m ago•1 comments

Show HN: AsyncReview – An OSS DevinReview Using RLMs

https://github.com/AsyncFuncAI/AsyncReview
1•sashimikun•1h ago•0 comments

Why is it legal and ok when big corp. straight tries to scam a user?

40•kingleopold•1h ago•11 comments

Read Bondi's Letter to Minnesota's Governor

https://www.nytimes.com/interactive/2026/01/24/us/pam-bondi-walz-doc.html
8•duxup•1h ago•0 comments

Skijoring is taking travellers off Colorado's ski-resort circuit

https://www.bbc.com/travel/article/20260121-skijoring-colorados-wildest-winter-sport
1•1659447091•1h ago•0 comments

ICE Is Not Like the Brownshirts, the Brownshirts Identified Themselves

https://www.mcsweeneys.net/articles/ice-is-nothing-like-the-brownshirts-because-the-brownshirts-a...
16•bjourne•1h ago•0 comments

The Responsibility of Intellectuals (1967)

https://www.nybooks.com/articles/1967/02/23/a-special-supplement-the-responsibility-of-intelle/
26•andsoitis•1h ago•11 comments

US SEC: India refused to serve summons to Adani Group under Hague Convention

https://economictimes.indiatimes.com/news/company/corporate-trends/us-regulator-seeks-to-bypass-i...
2•alok-g•1h ago•1 comments

Alex Honnold completes Taipei 101 skyscraper climb without ropes or safety net

https://www.cnn.com/sport/live-news/taiwan-alex-honnold-climb-taipei-101-01-25-26-intl-hnk
56•keepamovin•1h ago•30 comments

The Bay Area Has Lost Its Soul. No Wonder Everyone's Leaving [video]

https://www.youtube.com/watch?v=9_bsaF5TTEE
1•keepamovin•1h ago•1 comments

Prototyping NixOS with FDE, tang and Podman using Claude.AI

https://claude.ai/public/artifacts/e7cdb55a-3960-4dc9-bd1a-3c73a3145fb9
2•cptnslow•1h ago•1 comments