frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•11mo ago

Comments

kate_at_refact•11mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

All of the String Types

https://lambdalemon.gay/posts/string-types
1•j03b•2m ago•0 comments

British computer scientist denies he is Bitcoin developer Satoshi Nakamoto

https://www.theguardian.com/technology/2026/apr/08/british-computer-scientist-adam-back-denies-he...
1•4ndrewl•3m ago•0 comments

Thousands of consumer routers hacked by Russia's military

https://arstechnica.com/security/2026/04/russias-military-hacks-thousands-of-consumer-routers-to-...
1•choult•3m ago•0 comments

Show HN: I built an open source multi-agent harness in Go

https://github.com/AndreBaltazar8/artificial
1•AndreBaltazar•4m ago•0 comments

Metrics SQL: A SQL-based semantic layer for humans and agents

https://www.rilldata.com/blog/introducing-metrics-sql-a-sql-based-semantic-layer-for-humans-and-a...
1•tanelpoder•6m ago•0 comments

Show HN: Context-Aware Twitch Moderation

https://modcheck.app
2•WalkingFridge•8m ago•0 comments

OpenAI Codex Moves to API Usage-Based Pricing for All Users

https://startupfortune.com/openai-codex-moves-to-api-usage-based-pricing-for-all-users/
2•wheelerwj•12m ago•1 comments

New Anthropic model is too dangerous to release publicly

https://www.nbcnews.com/tech/security/anthropic-project-glasswing-mythos-preview-claude-gets-limi...
2•hackyhacky•13m ago•0 comments

Gemma 4 E4B vs. Gemma Family: Enterprise Benchmark Across 8 Task Suites

https://aiexplorer-blog.vercel.app/post/gemma-4-e4b-enterprise-benchmark
2•mailharishin•13m ago•0 comments

Why Meter Is Not Essential to Poetry

https://oliviamarstall.substack.com/p/why-meter-is-not-essential-to-poetry
3•frogulis•14m ago•0 comments

Why Use Meter in Poetry?

https://robertcharboneau.substack.com/p/why-use-meter-in-poetry
3•frogulis•15m ago•0 comments

Major Polish crypto exchange presumed insolvent

https://www.money.pl/gospodarka/wielki-problem-zondacrypto-klienci-sie-skarza-mamy-niepokojaca-an...
2•nathell•16m ago•0 comments

The Bar Hostess of Ginza

https://cinemasojourns.com/2026/04/08/the-bar-hostess-of-ginza/
2•jjgreen•16m ago•0 comments

Show HN: Dis: Dev environments but without Node.js

https://github.com/candacelabs/dis/tree/main
4•kaashmonee•18m ago•0 comments

Laurie Kirk proved your RAM lies to you. So we built Palantir at home

https://github.com/seppulcro/phantom
2•seppulcro•18m ago•1 comments

Ask HN: We got 100 stars. A weekend project got 12k. What are we missing?

2•lilwing•18m ago•3 comments

Ask HN: A CLI to control what AI code can (and can't) change in your repo

3•sujitjaunjal•22m ago•0 comments

Principles of Mechanical Sympathy

https://martinfowler.com/articles/mechanical-sympathy-principles.html
2•emschwartz•28m ago•0 comments

Taco: Linktaco from Your Terminal

https://linktaco.com/blog/taco-cli-tool-linktaco.html
2•peterjsanchez•28m ago•0 comments

Show HN: Tired of logic in useEffect, I built a class-based React state manager

https://thales.me/posts/why-i-built-snapstate/
1•thalesfp•30m ago•0 comments

Hsihp Hsims

https://fromouterstace.com
1•stacydbradford•30m ago•0 comments

Prisoner's Dilemma – An Extension (from game theory)

https://pd.luthira.com/
2•llamatheollama•32m ago•0 comments

Resurrecting a 1992 MUD with Agentic AI

https://meditations.metavert.io/p/resurrecting-a-1992-mud-with-agentic
3•salt4034•33m ago•0 comments

Once 'Ultra MAGA', Trump Supporters Fume About Iran on Truth Social

https://www.nytimes.com/2026/04/08/us/politics/trump-truth-social-iran.html
5•doener•33m ago•0 comments

Volatco – Powerful Multicompute

https://volatco.github.io/
1•nbaksalyar•38m ago•0 comments

Show HN: Linggen – Open-source AI agent with P2P remote access from your phone

https://linggen.dev/
2•linggen•40m ago•0 comments

How to Connect to Builders in Web3

https://woof.software/
1•Pasha-CEO•41m ago•1 comments

Google engineer rejected by colleges uses AI to sue for racial discrimination

https://abc7news.com/post/google-engineer-rejected-colleges-uses-ai-sue-ucs-other-universities-ra...
1•ubasu•42m ago•0 comments

AI Did It in 12 Minutes. It Took Me 10 Hours to Fix It

https://idiallo.com/blog/it-took-me-10-hours-to-fix-ai-code
3•firefoxd•47m ago•2 comments

Show HN: Is Hormuz open yet?

https://www.ishormuzopenyet.com/
105•anonfunction•50m ago•41 comments