frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•9mo ago

Comments

kate_at_refact•9mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Beating context rot in Claude Code with GSD

https://thenewstack.io/beating-the-rot-and-getting-stuff-done/
1•jimminyx•37s ago•0 comments

Vibing with the Agent Control Protocol

https://taoofmac.com/space/notes/2026/02/01/2100
1•rcarmo•1m ago•0 comments

Thoughts on AI-Assisted Software Development in 2026

https://taoofmac.com/space/notes/2026/02/01/2130
1•rcarmo•1m ago•0 comments

Why decisions decay in engineering orgs

https://notsolvingthis.substack.com/p/part-2-decision-half-life
1•sun123•2m ago•0 comments

Tell HN: iPhones screen time widget is broken

1•garyfirestorm•2m ago•0 comments

To Save Everything Click Here: The Folly of Technological Solutionism (2016) [video]

https://www.youtube.com/watch?v=9yQqrZUD6Gk
1•measurablefunc•3m ago•0 comments

Cursorless: Voice Coding at the Speed of Thought

https://www.cursorless.org/
1•PaulHoule•3m ago•0 comments

Autonomy and Clarity in Leadership Styles – Bjorg

https://bjorg.bjornroche.com/management/autonomy-vs-clarity/
2•kiyanwang•4m ago•0 comments

Treasures found on HS2 route stored in secret warehouse

https://www.bbc.com/news/articles/c93v21q5xdvo
1•breve•6m ago•0 comments

Two CBP Agents Identified in Alex Pretti Shooting

https://www.propublica.org/article/alex-pretti-shooting-cbp-agents-identified-jesus-ochoa-raymund...
4•lawrencejgd•11m ago•1 comments

Peep this sgnl_interceptor hacking concept

https://ab73acf1acd5a5.lhr.life/
3•gh0stwalk•15m ago•0 comments

LLMs achieve adult human performance on higher-order "theory of mind" tasks

https://pmc.ncbi.nlm.nih.gov/articles/PMC12808479/
2•stareatgoats•15m ago•0 comments

IntentBound: Purpose-aware authorization for autonomous AI agents

2•Grokipaedia•17m ago•0 comments

Show HN: Pro Gamer Gear- the Ninjutsu Sora V3

https://xthe.com/news/pro-gamer-gear-the-ninjutsu-sora-v3/
2•xthe•18m ago•0 comments

The Futurama Episode That Set the Show's Writers Free from Fox's Terrible Notes

https://www.slashfilm.com/1408546/futurama-episode-set-writers-free-fox/
1•rolph•18m ago•0 comments

My (very) fast zero-allocation webserver using OxCaml

https://anil.recoil.org/notes/oxcaml-httpz
1•todsacerdoti•22m ago•0 comments

Escutcheon

https://en.wikipedia.org/wiki/Escutcheon_(furniture)
2•huhtenberg•22m ago•1 comments

Google Introduces Managed Connection Pooling for AlloyDB

https://www.infoq.com/news/2026/01/alloydb-managed-connection-pool/
1•GavCo•22m ago•0 comments

Show HN: Echo – Local-first kindle-like reader with annotations and LLM chat

https://github.com/tibi-iorga/echo-reading
2•tb8424•22m ago•0 comments

Audio on Hp300

http://miod.online.fr/software/openbsd/stories/arcofi.html
2•todsacerdoti•24m ago•0 comments

Embedded AI usage controls and spend limits for your enterprise customers

https://www.stigg.io/ai-usage-management
1•anton-stigg•26m ago•2 comments

Show HN: Smith – A visual control room for managing parallel coding agents

https://trysmith.dev/
1•tomhr•26m ago•0 comments

Lily Programming Language

https://lily-lang.org
1•FascinatedBox•27m ago•0 comments

Embedded AI usage controls and spend limits for your enterprise customers

https://stigg-x.webflow.io/ai-usage-management
1•anton-stigg•27m ago•1 comments

Magnetic core memory 128-byte USB drive

https://www.tomshardware.com/pc-components/usb-flash-drives/researcher-builds-bizarre-128-byte-us...
1•stevenjgarner•29m ago•0 comments

Show HN: OpenRAPP – AI agents autonomously evolve a world via GitHub PRs

https://kody-w.github.io/openrapp/rappbook/
2•bothangles•29m ago•0 comments

Making a Zig Agent Skill

https://austinrude.com/blog/making-a-zig-agent-skill/
2•rudedogg•29m ago•0 comments

Iran summons families of exiled journalists to halt their activities

https://www.iranintl.com/en/202602017863
51•ukblewis•33m ago•31 comments

Ask HN: How can I get decent internet speeds in my apartment?

1•nobody_nothing•34m ago•1 comments

The foundation powering modern AI agents

1•edihasaj•34m ago•1 comments