frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•10mo ago

Comments

kate_at_refact•10mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Show HN: PeriodicTableOfElements.org

https://periodictableofelements.org/?lang=en
1•nadermx•11m ago•0 comments

Social media is populist and polarising; AI may be the opposite

https://www.ft.com/content/3880176e-d3ac-4311-9052-fdfeaed56a0e
1•malloryerik•12m ago•1 comments

Show HN: Anamnesis – Open-source 4D strategic memory engine for AI agents

https://github.com/gayawellness/anamnesis
1•gayawellness•12m ago•0 comments

Pretext Demos

https://chenglou.me/pretext/
1•vinhnx•14m ago•0 comments

Alzheimer's disease mortality among taxi and ambulance drivers (2024)

https://www.bmj.com/content/387/bmj-2024-082194
2•bookofjoe•17m ago•0 comments

-

https://github.com/d0nk3yhm/pbix-mcp
2•d0nk3yhm•21m ago•0 comments

Translating non-trivial codebases with Claude

https://blog.danieljanus.pl/2026/03/26/claude-nlp/
1•vinhnx•25m ago•0 comments

Catching crumbs from the table by Ted Chiang (2000) [pdf]

https://gwern.net/doc/fiction/science-fiction/2000-chiang.pdf
2•sendes•27m ago•1 comments

The Opt Out Project

https://www.optoutproject.net/
3•billybuckwheat•29m ago•0 comments

BubbleWrap your dev env and agents

https://dpc.pw/posts/bubblewrap-your-dev-env-and-agents/
1•vinhnx•31m ago•0 comments

A simple explanation of the key idea behind TurboQuant

https://old.reddit.com/r/LocalLLaMA/comments/1s62g5v/a_simple_explanation_of_the_key_idea_behind/
1•thunderbong•35m ago•0 comments

IN Event of Moon Disaster [pdf]

https://www.archives.gov/files/presidential-libraries/events/centennials/nixon/images/exhibit/rn1...
1•interweb•38m ago•0 comments

Anthropic's Mythos leak: 3k files in a public CMS, and what the docs revealed

https://medium.com/ai-advances/anthropic-claude-mythos-leak-analysis-b77c1b304eb8
4•Aedelon•43m ago•0 comments

Git City – Your GitHub as a 3D City

https://www.thegitcity.com
2•fcoury•46m ago•0 comments

Seattle opens first light rail across floating bridge

https://www.fox13seattle.com/news/seattle-train-floating-bridge
2•whiskey-one•47m ago•0 comments

Ask HN: How are you keeping AI coding agents from burning money?

2•bhaviav100•48m ago•1 comments

What's Banned on Your Block?

https://www.strongtownschicago.org/whats-banned-on-your-block
1•animal_spirits•50m ago•0 comments

Motorola 88000

https://en.wikipedia.org/wiki/Motorola_88000
3•doener•51m ago•1 comments

We spent 2 hours working in the future

https://metr.org/notes/2026-03-19-org-uplift-game/
2•gmays•56m ago•0 comments

Dashboards Are Already Dead

https://joshsymonds.com/blog/dashboards-are-already-dead/
2•Veraticus•57m ago•1 comments

Liberate Your OpenClaw

https://huggingface.co/blog/liberate-your-openclaw
1•cezarvil•57m ago•0 comments

Milawa on Jitawa, a Verified Theorem Prover

http://lambda-the-ultimate.org/node/4464
1•poppingtonic•1h ago•0 comments

Codex Use Cases

https://developers.openai.com/codex/use-cases
1•AnhTho_FR•1h ago•0 comments

How Japan's Shiitake mushrooms fuel a $740M global Shiitake industry [video]

https://www.youtube.com/watch?v=XuJ5HsV8mlQ
1•teleforce•1h ago•0 comments

OpenClaw Is LangChain 2.0

https://justinflick.com/2026/03/28/openclaw-is-langchain-2.html
2•pamplemeese•1h ago•0 comments

Linux 7.0-Rc6 Bringing a Lot of Audio Quirks / Fixes

https://www.phoronix.com/news/Linux-7.0-rc6-Many-Audio-Fixes
2•Bender•1h ago•0 comments

Saudi Pipeline to Bypass Hormuz Hits 7M Barrel Goal

https://www.bloomberg.com/news/articles/2026-03-28/saudi-pipeline-that-bypasses-hormuz-hits-7-mil...
6•geox•1h ago•0 comments

Ghst – an experimental, full-featured CLI for managing Ghost CMS sites

https://github.com/TryGhost/ghst
1•Curiositry•1h ago•0 comments

AI adoption problem isn't tech debt

https://dheer.co/ai-adoption-operating-model/
1•bushido•1h ago•0 comments

Woman visiting ER for back pain shocked after doctor suggests euthanasia

https://www.dailymail.co.uk/news/article-15687817/woman-euthanasia-pain-doctor-offer.html
4•Bender•1h ago•0 comments