frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•10mo ago

Comments

kate_at_refact•10mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

China is running multiple AI races

https://www.highcapacity.org/p/china-is-running-multiple-ai-races
1•KnuthIsGod•1m ago•0 comments

CAIveat Emptor: What You Tell AI Can and Will Be Used Against You

https://natlawreview.com/article/caiveat-emptor-what-you-tell-ai-can-and-will-be-used-against-you
2•petethomas•2m ago•0 comments

Delight is the only thing that's still rare

https://techstackups.com/articles/delight-is-the-only-thing-thats-still-rare/
1•ritzaco•11m ago•0 comments

After AI smashes the information barrier

https://www.lowimpactfruit.com/p/after-ai-smashes-the-information
1•mnky9800n•13m ago•0 comments

Free market research reports covering every tech sector

https://github.com/spinov001-art/ai-market-research-reports
1•aimarketintel•14m ago•0 comments

Show HN: Keynest – a simple offline secrets manager

https://github.com/capydev42/keynest
1•capydev42•14m ago•0 comments

Show HN: SEDManager – GUI Application for Setting Up Self-Encrypting Drives

https://github.com/petiaccja/sed-manager-rs
1•pregnenolone•16m ago•0 comments

Python is faster than Assembly for real [video]

https://www.youtube.com/watch?v=rD1wapiJPDk
1•artisandip7•17m ago•0 comments

M68K Interpreter Refactored with AI

https://gianlucarea.dev/blog/m68k-march26
1•aldino97•19m ago•0 comments

A gaming CEO asked ChatGPT how to avoid paying a $250M bonus

https://fortune.com/2026/03/17/krafton-subnautica-chatgpt-delaware-court-ruling-ceo-reinstated/
1•KnuthIsGod•21m ago•0 comments

STDWIN: a standard window system interface by Guido Van Rossum

https://ir.cwi.nl/pub/5998
1•rbanffy•22m ago•1 comments

FBI started buying Americans' location data again, Kash Patel confirms

https://arstechnica.com/tech-policy/2026/03/fbi-started-buying-americans-location-data-again-kash...
1•rbanffy•24m ago•2 comments

Being John Rawls

https://www.astralcodexten.com/p/being-john-rawls
1•jstanley•26m ago•0 comments

Chromostereopsis

https://en.wikipedia.org/wiki/Chromostereopsis
1•tosh•27m ago•0 comments

How Do LLMs Compute Verbal Confidence (DeepMind)

https://arxiv.org/abs/2603.17839
3•armcat•28m ago•0 comments

Washington Legislature Passes 9.9% Millionaire Tax; Awaits Governor Signature

https://www.imidaily.com/tax/washington-state-legislature-passes-9-9-millionaire-tax-bill-awaits-...
1•NewCzech•31m ago•0 comments

Free open source AI cost tracker – pip install tokenbudget

https://github.com/AIMasterLabs/tokenbudget
1•harshakgowda•31m ago•1 comments

Navia discloses data breach impacting 2.7M people

https://www.bleepingcomputer.com/news/security/navia-discloses-data-breach-impacting-27-million-p...
1•01-_-•32m ago•0 comments

YouTube is asking users if videos "feel like AI slop"

https://www.dexerto.com/youtube/youtube-is-asking-users-if-videos-feel-like-ai-slop-to-flag-low-q...
2•01-_-•34m ago•4 comments

Struggling to describe your AI aversion? Here's a glossary

https://www.theregister.com/2026/03/19/ai_skeptic_labels/
2•rbanffy•35m ago•0 comments

CPython: 36 Years of Source Code

https://blog.python.org/2026/03/cpython-codebase-growth/
2•lululpac•35m ago•0 comments

Atuin v18.13 – better search, a PTY proxy, and AI for your shell

https://blog.atuin.sh/atuin-v18-13/
1•wrxd•36m ago•0 comments

Self-hosted deployment platform, zero runtime dependencies

https://github.com/AmirSoleimani/openberth
1•Amirso•37m ago•0 comments

Cloud service providers ask EU regulator to reinstate VMware partner program

https://arstechnica.com/information-technology/2026/03/cloud-service-providers-ask-eu-regulator-t...
2•joozio•38m ago•0 comments

Roll TV

https://w.merkoba.com/roll/
1•serveitup•42m ago•0 comments

Ask HN: Which dead app or game do you wish someone would rebuild?

3•firef1y1203•46m ago•2 comments

Evaluating Genuine Reasoning in LLMs via Esoteric Programming Languages

https://arxiv.org/abs/2603.09678
1•kerneis•47m ago•1 comments

Study: ChatGPT, Claude, Gemini and Grok are all bad at crediting news outlets

https://www.niemanlab.org/2026/03/chatgpt-claude-gemini-and-grok-are-all-bad-at-crediting-news-ou...
1•giuliomagnifico•49m ago•0 comments

FastSafeStrings (safe, fast string library for C/C++)

https://github.com/clemcl/FastSafeStrings
2•Clemcl•52m ago•2 comments

Project Hail Mary

https://en.wikipedia.org/wiki/Project_Hail_Mary
1•tosh•55m ago•0 comments