frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•10mo ago

Comments

kate_at_refact•10mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Show HN: HackerNewsDelta Is a Clone of HN Powered by Exponential (CMS)

https://hackernewsdelta.com/
1•7x•4m ago•0 comments

SafeVPN–Let people remote in to fix your servers, but prevent data exfiltration

https://github.com/chinagloud/SafeVPN
1•chinagloud163•6m ago•0 comments

Kalshi Has Been Temporarily Banned in Nevada

https://www.wired.com/story/nevada-bans-kalshi-prediction-market/
1•HardwareLust•8m ago•1 comments

Show HN: An MCP server that helps coding agents find the full change surface

https://github.com/ftrou/Decodifier3.1
1•ftrou•10m ago•0 comments

What Looks Like Resilience in Iran Is Its Collapse Plan

https://parpanchi.substack.com/p/what-looks-like-resilience-in-iran
2•gambutin•10m ago•0 comments

Gea: A batteries-included, reactive JavaScript UI framework

https://github.com/dashersw/gea
2•thunderbong•10m ago•0 comments

Scroll Press

https://scroll.press/
2•mathgenius•11m ago•0 comments

The Claude Dichotomy

https://jonathannen.com/the-claude-dichotomy/
2•jwilliams•11m ago•0 comments

Bye Bye RTMP

https://daniel.haxx.se/blog/2026/03/21/bye-bye-rtmp/
2•jandeboevrie•13m ago•0 comments

Show HN: GoldenMatch – Entity resolution with LLM scoring, 97% F1, no Spark

https://github.com/benzsevern/goldenmatch
2•benzsevern•16m ago•0 comments

Russian women who don't want children will be sent to psychologist

https://www.thetimes.com/world/russia-ukraine-war/article/russian-women-children-psychologist-zzn...
5•randycupertino•19m ago•1 comments

Show HN: I ran Qwen3.5 35B on my iPhone at 5.6 tok/SEC

https://twitter.com/alexintosh/status/2035386645764006102
3•alexintosh•20m ago•0 comments

How to Attract AI Bots to Your Open Source Project

https://nesbitt.io/2026/03/21/how-to-attract-ai-bots-to-your-open-source-project.html
2•zdw•20m ago•0 comments

My Beef with Substack

https://tasshinfogleman.substack.com/p/my-beef-with-substack
2•tasshin•22m ago•0 comments

SSH Certificates and Git Signing

https://codon.org.uk/~mjg59/blog/p/ssh-certificates-and-git-signing/
3•zdw•22m ago•0 comments

Revert "userdb: add birthDate field to JSON user records

https://github.com/systemd/systemd/pull/41179
5•smartmic•25m ago•1 comments

The World Mood – A real-time, anonymous emotional map of the world

https://theworldmood.com
2•Unical-A•25m ago•0 comments

How Do You Design a Large-Scale AI Trust Experiment?

https://weightedthoughts.substack.com/p/how-do-you-design-a-large-scale-ai
2•starlitlog•27m ago•0 comments

Ironsmith – MTG card (de)compiler and multiplayer rules engine

https://github.com/chiplis/ironsmith
2•nicolas-siplis•29m ago•0 comments

Ask HN: Need IP attorney for DMCA/open-source licensing dispute

2•sansanagar•30m ago•0 comments

Claude Code workspace trust dialog bypass, settings loading order CVE-2026-33068

https://raxe.ai/labs/advisories/RAXE-2026-040
2•raxe•30m ago•0 comments

Gadoosh God Is Going to Jail for a Long Time [video]

https://www.youtube.com/watch?v=o1XflZUexvw
2•SteveClement•32m ago•0 comments

Prevalence of GenAI sexualized image usage by adolescents in the US

https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0342824
2•gnabgib•32m ago•0 comments

Plot Ark – Open-source agentic curriculum engine (React/Flask/LightRAG)

https://github.com/Schlaflied/Plot-Ark
3•yuting_•39m ago•1 comments

Taxes: Geopolitics

https://devblogs.microsoft.com/oldnewthing/20051129-00/?p=33173
3•stevefan1999•40m ago•0 comments

The Engineer Who Tried to Put Age Verification into Linux

https://www.sambent.com/the-engineer-who-tried-to-put-age-verification-into-linux-5/
4•stalfosknight•40m ago•0 comments

Kagi Translate's AI answers question "What would horny Margaret Thatcher say?"

https://arstechnica.com/ai/2026/03/kagi-translates-ai-answers-the-question-what-would-horny-marga...
2•gnabgib•43m ago•0 comments

ZX Spectrum Basic controls a lunar lander in Kerbal Space Program [video]

https://www.youtube.com/watch?v=XQTh1Davsj8
2•nopakos•44m ago•0 comments

When the city becomes the weapon: IoT, AI, and the new face of warfare

https://andreafortuna.org/2026/03/21/when-the-city-becomes-the-weapon
2•fbistrash•45m ago•0 comments

Do Not Turn Child Protection into Internet Access Control

https://news.dyne.org/child-protection-is-not-access-control/
35•smartmic•46m ago•8 comments