frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•11mo ago

Comments

kate_at_refact•11mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Republican Rep. Obsessed with Hating Muslims Unveils Mamdani Act

https://newrepublic.com/post/209272/republican-rep-chip-roy-mamdani-act-muslims-socialists-immigr...
1•handfuloflight•3m ago•0 comments

OpenMythos: A looped transformer take on how Claude Mythos might work

https://firethering.com/openmythos-open-source-claude-mythos-reconstruction/
1•steveharing1•5m ago•0 comments

China's energy fortress was built to withstand just this type of oil shock

https://www.cnn.com/2026/04/20/china/china-energy-security-global-oil-crisis-iran-intl-hnk
1•breve•12m ago•0 comments

Key Front End Architectural Patterns for Complex Applications

https://sometechblog.com/posts/frontend-architecture-decisions/
1•l5870uoo9y•12m ago•0 comments

PhantomChat – Post-quantum messenger with Monero-style stealth addresses

https://github.com/cengo441337-a11y/phantomchat
2•n0l3x•14m ago•0 comments

Modeling Sparse and Bursty Vulnerability Sightings

https://arxiv.org/abs/2604.16038
1•cedricbonhomme•18m ago•0 comments

The History of SuperTuxKart

https://supertuxkart.net/History_of_SuperTuxKart
1•mdtrooper•20m ago•0 comments

Digital Omnibus reality check: 83.5% of access requests not properly answered

https://noyb.eu/en/digital-omnibus-reality-check-835-access-requests-not-properly-answered
1•latexr•21m ago•0 comments

Show HN: Static Flipbooks of Complex Media

https://flipbook.browserbox.io/
1•keepamovin•23m ago•0 comments

Claude for Equity Research

https://www.asiancenturystocks.com/how-to-use-claude-for-equity-resear/
1•fritz123•23m ago•0 comments

DotLLM – Building an LLM Inference Engine in C#

https://kokosa.dev/blog/2026/dotllm/
2•bjarteaarmolund•27m ago•0 comments

A DIY Watch You Can Actually Wear

https://www.hackster.io/news/a-diy-watch-you-can-actually-wear-8f91c2dac682
1•sarusso•27m ago•0 comments

Grafana 13

https://grafana.com/blog/grafana-13-release-all-the-latest-features/
3•dabinat•29m ago•0 comments

Overview of Kimi K2.6 Model

https://platform.kimi.ai/docs/guide/kimi-k2-6-quickstart
2•igravious•30m ago•0 comments

Ask HN: Hit Supabase's free→$25 pricing cliff. Any middle-tier options?

1•cgozdemm•34m ago•2 comments

OpenDyslexic: A Typeface for Dyslexia

https://opendyslexic.org/
2•molp•37m ago•0 comments

Plotnine: Grammar of Graphics for Python

https://plotnine.org/
1•fanf2•37m ago•0 comments

GPT Image 2 – AI-Powered Image Generation Tool

https://gptimg2ai.net
1•danielmateo773•43m ago•0 comments

Valgrind-3.27.0 Is Available

https://sourceforge.net/p/valgrind/mailman/message/59324626/
1•paulf38•46m ago•0 comments

Crystal Now Has Official Linux ARM64 Builds

https://crystal-lang.org/2026/04/07/official-linux-arm64-builds/
3•TheWiggles•48m ago•0 comments

The AI revolution – spamming 680PRs in 442 GitHub repos in 21 days in April

https://github.com/SAY-5
1•ddorian43•50m ago•1 comments

The first neural interface that transforms your thoughts into text

https://sabi.com/
3•filippofinke•55m ago•0 comments

Indent Is All You Need

https://blog.est.im/2026/stdin-11
2•est•58m ago•0 comments

The arrogant superbanker whose hubris brought Britain to its knees

https://inews.co.uk/opinion/arrogant-superbanker-hubris-brought-britain-knees-4331457
1•robtherobber•59m ago•0 comments

Making the Rails Default Job Queue Fiber-Based

https://paolino.me/solid-queue-doesnt-need-a-thread-per-job/
1•earcar•1h ago•0 comments

The Dirty Little Secret of AI (On a 1979 PDP-11) [video]

https://www.youtube.com/watch?v=OUE3FSIk46g
1•KnuthIsGod•1h ago•0 comments

HappyHorse AI – AI-Powered Equestrian Training

https://www.runhappyhorse.net
1•danielmateo773•1h ago•2 comments

Master of chaos wins $3M math prize for 'blowing up' equations

https://www.scientificamerican.com/article/master-of-chaos-wins-usd3m-math-prize-for-blowing-up-e...
1•signa11•1h ago•0 comments

Why the Original Task Manager Was Under 80K and Insanely Fast [video]

https://www.youtube.com/watch?v=OyN4LGyPwxc
2•KnuthIsGod•1h ago•0 comments

Influencers Are Spinning Nicotine as a 'Natural' Health Hack

https://www.nytimes.com/2026/04/20/well/nicotine-health-maha.html
3•SockThief•1h ago•2 comments