news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/

3•kate_at_refact•10mo ago

Comments

kate_at_refact•10mo ago

Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

37signals' ONCE: platform for installing and managing Docker-based web apps

https://github.com/basecamp/once

1•jschulenklopper•41s ago•1 comments

Show HN: World Library in AI

https://spacefrontiers.org/mcp

1•pasha_sf•1m ago•0 comments

Self-Hosted Website Analytics with SQLite

https://github.com/kaichaosun/swa

1•kaichao•1m ago•0 comments

Tacit Knowledge Videos on Every Subject

https://www.lesswrong.com/posts/SXJGSPeQWbACveJhs/the-best-tacit-knowledge-videos-on-every-subject

1•sebg•2m ago•0 comments

Show HN: Jekyll VitePress – VitePress-Style Docs for Jekyll

https://jekyll-vitepress.dev/

1•earcar•2m ago•0 comments

Gemini CLI Shake-Up: Abuse Controls and Traffic Priorities

https://github.com/google-gemini/gemini-cli/discussions/22970

1•anhldbk•3m ago•0 comments

Show HN: PlanWiki – Open-source platform for product teams and agents to execute

https://github.com/planwiki/planwiki-app

1•KupaMakunura•4m ago•0 comments

The Slow Work of Making Sense of History

https://arbesman.substack.com/p/the-slow-work-of-making-sense-of

1•arbesman•8m ago•0 comments

Vape packaging will be forced to grow up

https://www.designweek.co.uk/how-vape-packaging-will-be-forced-to-grow-up/

1•taubek•9m ago•0 comments

Atoms of Thought

https://github.com/freyzo/AOT-SKILLS

1•leo_agent•10m ago•0 comments

Email to registered companies about the WebFiling security issue

https://www.gov.uk/government/publications/email-to-registered-companies-about-the-webfiling-secu...

1•sarusso•10m ago•1 comments

Remote Pre-Auth Buffer Overflow in GNU Inetutils Telnetd

https://lists.gnu.org/archive/html/bug-inetutils/2026-03/msg00031.html

1•campuscodi•12m ago•0 comments

GLP-1 diabetes drugs could stop anxiety and depression worsening, study finds

https://www.theguardian.com/science/2026/mar/18/glp-1-type-2-diabetes-drugs-semaglutide-anxiety-d...

1•giuliomagnifico•13m ago•0 comments

Elon Musk's X cooperates with €120M EU fine

https://www.politico.eu/article/elon-musk-x-eu-120m-fine-commission/

1•1vuio0pswjnm7•14m ago•0 comments

Oil Hits $119. Gas Prices Jump After Escalation

https://www.bloomberg.com/news/articles/2026-03-19/oil-and-gas-prices-jump-as-strikes-on-gulf-fac...

2•master_crab•14m ago•0 comments

An Argument for Logging Off

https://aaronfrancis.com/2024/an-argument-for-logging-off-9a4de45b

1•cl3misch•14m ago•0 comments

Slop Machines

https://vladinator.net/blog/slop-machines/

1•vladinator1001•16m ago•0 comments

Cindy Cohn on privacy battles old and new

https://lwn.net/SubscriberLink/1061979/cf283eea1bee9a59/

1•smitty1e•17m ago•0 comments

Show HN: Lognorth self-hosted errors and logs on a $5 VPS

1•karloscodes•18m ago•0 comments

Show HN: Reqlog – live HTTP dashboard for Node.js and Go

https://github.com/FirasLatrech/reqlog

1•firaslatrach•22m ago•0 comments

2% of ICML papers desk rejected because the authors used LLM in their reviews

https://blog.icml.cc/2026/03/18/on-violations-of-llm-review-policies/

28•sergdigon•22m ago•8 comments

Built-in VPN coming to Firefox 149

https://blog.mozilla.org/en/firefox/firefox-148-149-new-features/

1•campuscodi•22m ago•0 comments

PeerClaw – Decentralized P2P AI Agent Network in a Single Binary

https://github.com/antonellof/peerclaw

1•peerclaw•24m ago•0 comments

Terror Camp Clear

https://niche-canada.org/2025/12/01/terror-camp-clear/

1•aa_is_op•25m ago•0 comments

Software Engineering – A Roast

https://yagnipedia.com/wiki/software-engineering

1•riclib•27m ago•1 comments

The State of Docs 2026 (GitBook)

https://www.stateofdocs.com/2026

1•armcat•29m ago•0 comments

Nanopositioning Metrology, Gödel, and Bootstraps

https://www.pi-usa.us/en/tech-blog/nanopositioning-metrology-goedel-and-bootstraps

2•nill0•30m ago•0 comments

Can We Make Simpler Software with LLMs?

https://www.karl.berlin/simplicity-by-llm.html

1•karl42•33m ago•0 comments

Claude Cowork Dispatch: Anthropic's Answer to OpenClaw

https://www.latent.space/p/ainews-claude-cowork-dispatch-anthropics

1•thoughtpeddler•34m ago•0 comments

Beta testers wanted for hugpoint.io v2 – fair meeting point finder

https://hugpoint.io/v2

1•prunax•35m ago•0 comments