frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•10mo ago

Comments

kate_at_refact•10mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Rat King

https://en.wikipedia.org/wiki/Rat_king
1•fittingopposite•3m ago•0 comments

Physical Reality as Hypermedia

https://paper.supernovalabs.co.uk
1•supernovalabs•5m ago•0 comments

Lindley's Paradox

https://en.wikipedia.org/wiki/Lindley%27s_paradox
2•mschnell•14m ago•0 comments

Predicting home electricity usage from historical patterns in Home Assistant

https://blog.cyplo.dev/posts/2026/03/load-prediction-in-home-assistant/
1•swq115•16m ago•0 comments

I made a GPU price tracker

https://gpusniper.com/
1•codingblink•19m ago•1 comments

HopTab–free,open source macOS app switcher and tiler that replaces Cmd+Tab

https://www.royalbhati.com/hoptab
2•robhati•21m ago•0 comments

We built Avancé Communicatie (digital services for Dutch companies)

https://www.avancecommunicatie.nl/
2•bullmeister•25m ago•0 comments

Why do we need apps like cursor?

1•amanhij•26m ago•0 comments

Ask HN: Top repos you'd want offline on a desert island?

2•quijoteuniv•27m ago•2 comments

Computer Networks: A Systems Approach

https://open-cloud.github.io/index.html
1•vismit2000•33m ago•0 comments

Kattis Problem Archive

https://open.kattis.com
1•vismit2000•35m ago•0 comments

Show HN: Helios – 3 Claude agents (Red vs. Blue) hack and patch your codebase

https://gitlab.com/nakaiwilliams20/helios
2•nakaiwilliams•37m ago•0 comments

Synaphe – A type-safe language for hybrid AI and quantum computing

https://github.com/martus-spinther/synaphe-project
2•martus-spinther•45m ago•0 comments

Mindwtr – Open-source, local-first GTD app (Tauri and React Native)

https://github.com/dongdongbh/Mindwtr
1•dongdongbh•47m ago•0 comments

Quantum mechanics simulation Python library for research and learning

https://github.com/iDEA-org/iDEA
1•jw1294•48m ago•1 comments

Proof Theory and Logic Programming

https://www.lix.polytechnique.fr/Labo/Dale.Miller/ptlp/
1•remywang•49m ago•0 comments

Tell HN: Microsoft365 "Convert to Paid" checkout silently default to 25 licenses

3•davidstarkjava•53m ago•1 comments

Show HN: Passport Globe (See where your passport takes you)

https://hariharan.uno/globe
1•hariharan_uno•1h ago•0 comments

Show HN: TMA1 – Local-first observability for LLM agents

https://tma1.ai/
2•killme2008•1h ago•0 comments

Show HN: Yeet – Throw AI tasks at hardware and walk away (Nomad and OpenShell)

https://github.com/wan0net/yeet
1•wan0net•1h ago•0 comments

Phase Transitions and Computation

https://theory.org/complexity/cdpt/html/node5.html
1•downboots•1h ago•0 comments

Show HN: Banish: A declarative framework for rule-based state machines in Rust

https://github.com/LoganFlaherty/banish/releases/tag/v1.3.0
1•LoganFlaherty•1h ago•0 comments

Bitcoin mining difficulty drops 7.8% as miner exodus accelerates amid AI pivot

https://www.theblock.co/post/394579/bitcoin-mining-difficulty-drops-7-8-as-miner-exodus-accelerat...
4•adrianwaj•1h ago•1 comments

Review: Why Evolution Is True

https://ncse.ngo/review-why-evolution-true
2•akbarnama•1h ago•0 comments

We Read What Delve Ships to the Browser

https://security.redeux.ai/research/delve-compliance-posture
1•chasewarren•1h ago•0 comments

Isometric exercise: The most efficient fitness regime?

https://www.bbc.com/future/article/20260319-isometric-exercise-the-most-efficient-fitness-regime
2•akbarnama•1h ago•0 comments

A Rant about Resolutions

https://blog.brixit.nl/rant-about-resolutions/
1•vinhnx•1h ago•0 comments

Is Simple Good?

https://darth.games/posts/is-simple-good/
1•vinhnx•1h ago•0 comments

Delve Accused of Fraud

https://techcrunch.com/2026/03/21/delve-accused-of-misleading-customers-with-fake-compliance/
5•zlu•1h ago•0 comments

Sashiko: AI code review system for the Linux kernel spots bugs humans miss

https://www.theregister.com/2026/03/20/sashiko_code_review_linux/
2•maxloh•1h ago•0 comments