frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Craigslist Charitable Fund

https://www.craigslistfund.org
1•Yctg•3m ago•0 comments

JGuard v0.4.0 – Capability-based security for the JVM (post-SecurityManager)

https://github.com/jguard-io/jguard
1•nknize•5m ago•0 comments

Quantum Computing Expert Explains One Concept in 5 Levels of Difficulty [video]

https://www.youtube.com/watch?v=OWJCfOvochA
1•gmays•5m ago•0 comments

Apple TV 12% market share, reaping benefits of stale content

https://appleinsider.com/articles/26/04/20/apple-tv-reaping-the-benefits-of-stale-content-on-bigg...
1•mgh2•7m ago•0 comments

NCEES discontinuing PE Software Engineering exam (2019)

https://ncees.org/ncees-discontinuing-pe-software-engineering-exam/
1•consumer451•15m ago•0 comments

AI for the Real World: A Conversation with Yann LeCun

https://twitter.com/AnneliesGamble/status/2054219457451733382
2•gmays•15m ago•0 comments

The Evolution of Team Appwrite

https://appwrite.io/blog/post/the-evolution-of-team-appwrite
1•gauravmeena95•16m ago•0 comments

Production Is a Compiler Input

https://aicoding.leaflet.pub/3mjx4erlboc2l
1•ankitg12•19m ago•0 comments

Two computers, one monitor, zero fiddling – Alex Plescan

https://alexplescan.com/posts/2025/08/16/kvm/
1•ankitg12•23m ago•0 comments

New Issue Tracker

https://lightningtrack.io/login
1•garyeterry•23m ago•0 comments

Terence Tao: My recollections on the early history of compressed sensing

https://mathstodon.xyz/@tao/114967650999562435
4•johnbarron•38m ago•2 comments

Used to manage a collection of AI workflows for a single vertical domain – Wasup

https://github.com/EdwardJoke/Wasup
1•EdwardXie•39m ago•1 comments

The IndieWeb Is Wonderfully Dionysian

https://brennan.day/the-indieweb-is-wonderfully-dionysian/
2•gm678•39m ago•0 comments

Fix pathological performance in trait solver

https://github.com/rust-lang/rust/pull/155355
2•Jyaif•48m ago•0 comments

Pinote – A lightweight floating Markdown scratchpad app

https://github.com/ImFeH2/pinote
2•indigodaddy•49m ago•0 comments

I built a machine that can make you rich with math [video]

https://www.youtube.com/watch?v=2UM4j1_xEs0
1•tzvc•50m ago•1 comments

Senior NIAID Official Indicted for Concealing Records During Covid Pandemic

https://www.justice.gov/opa/pr/former-senior-niaid-official-indicted-concealing-federal-records-d...
5•Jimmc414•52m ago•2 comments

YC startup Luel appears to have copied Kled

https://twitter.com/avipat_/status/2055384102409253056
3•tjek•55m ago•1 comments

Show HN: Nexa-Gauge – LLM eval framework, now with self-hosted model support

https://github.com/harnexa/nexa-gauge
1•Sardhendu•56m ago•0 comments

Ask HN: What happened to ssh-audit.com?

2•Bender•58m ago•0 comments

Show HN: Plan-Graph based code generation with LLMs

https://github.com/agrin96/VibegraphGenerator
1•ag_rin•1h ago•0 comments

Kinetic typography: the what, why, and how

https://www.linearity.io/blog/kinetic-typography/
1•argee•1h ago•0 comments

Symposia AI

https://www.trysymposiaai.com/landing
2•CarlosEdu•1h ago•1 comments

Solving CartPole in 8 Weights

https://cartpole.neocities.org/
4•georgehotz•1h ago•0 comments

Magical Realism: "Northern Exposure" 25 Years Later (2015)

https://www.rogerebert.com/streaming/magical-realism-nothern-exposure-25-years-later
2•walterbell•1h ago•0 comments

Show HN: Wyndup – share a live countdown with your podcast guest

https://wyndup.net
1•ardwino•1h ago•0 comments

Elastic Cloud on Kubernetes, simplified: zone awareness, restarts, and mTLS

https://www.elastic.co/search-labs/blog/elasticsearch-kubernetes-zone-awareness-restarts-mtls
2•eigenBasis•1h ago•0 comments

Jane Street's approach to AI adoption throughout their SDLC [video]

https://www.youtube.com/watch?v=rUYP4C29yCw
3•devdoshi•1h ago•1 comments

Brovan: Binary user-mode emulator for x86_64

https://github.com/AdvDebug/Brovan
2•AdvDebugy•1h ago•0 comments

WikiProject Editor Retention

https://en.wikipedia.org/wiki/Wikipedia:WikiProject_Editor_Retention
1•sshh12•1h ago•1 comments