frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•11mo ago

Comments

kate_at_refact•11mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Metallic Microlattice

https://en.wikipedia.org/wiki/Metallic_microlattice
1•ZeljkoS•42s ago•0 comments

GNOME GitLab Git traffic caching

https://www.dragonsreach.it/2026/04/17/gnome-gitlab-git-pulls-caching-improvements/
11•JNRowe•3m ago•0 comments

The Quiet Coup: How AI Is Rewriting Power, Wealth, and Human Agency

https://neerajkarimpuzha.wordpress.com/2026/04/18/293/
2•neeraj_r•8m ago•0 comments

Fixing DNS tail latency with a 5-line config and a 50-line function

https://numa.rs/blog/posts/fixing-doh-tail-latency.html
2•fanf2•9m ago•0 comments

Biangbiang Noodles

https://en.wikipedia.org/wiki/Biangbiang_noodles
1•thunderbong•11m ago•0 comments

China humanoid robot half-marathon to showcase technical leaps

https://www.reuters.com/world/asia-pacific/china-humanoid-robot-half-marathon-showcase-technical-...
3•JumpCrisscross•14m ago•0 comments

A brief history of C/C++ programming languages

https://lemire.me/blog/2026/04/09/a-brief-history-of-c-c-programming-languages/
1•signa11•15m ago•0 comments

Cannabis may make you remember things that never happened

https://www.nationalgeographic.com/health/article/how-cannabis-affects-memory-thc-false-recall
2•johntfella•21m ago•0 comments

Anthropic decided to shut down our organization for an alleged violation

https://twitter.com/patomolina/status/2045281665363386504
1•isolli•21m ago•1 comments

Ask HN: How do small startups, solo/lean HR agencies manage hiring pipeline?

1•kathir05•24m ago•0 comments

Show HN: I can't write Python. It works anyway

https://github.com/Wewoc/Garmin_Local_Archive
1•Wewoc•25m ago•0 comments

Laimark – 8B LLM that self-improves. Consumer GPU

https://github.com/seetrex-ai/laimark
2•jesustabares•33m ago•0 comments

Peter Thiel Is Launching an "AI Ministry of Truth" Called Objection

https://old.reddit.com/r/antiai/comments/1sngw6f/peter_thiel_is_launching_an_ai_ministry_of_truth/
4•doener•40m ago•0 comments

Men caught competing in women's category of prestigious South African marathon

https://www.cnn.com/2026/04/17/sport/men-found-womens-category-sa-marathon-intl-scli
1•breve•40m ago•0 comments

Grok TTS and STT APIs

https://x.ai/news/grok-stt-and-tts-apis
2•chopete3•40m ago•1 comments

BibCrit – LLM grounded in ETCBC corpus data for Biblical textual criticism

https://github.com/Jossifresben/BibCrit
1•jossifresben•45m ago•0 comments

Long Covid Diagnostic Out of Stanford

https://join.muno.bio/
2•limalabs•50m ago•0 comments

Forsp: A Forth+Lisp hybrid lambda calculus language (2024)

https://xorvoid.com/forsp.html
1•HeliumHydride•51m ago•0 comments

The Art of the Fictional Pop Song

https://www.newyorker.com/culture/pop-music/the-art-of-the-fictional-pop-song
2•fortran77•52m ago•0 comments

America Lost the Mandate of Heaven

https://geohot.github.io//blog/jekyll/update/2026/04/18/america-mandate-of-heaven.html
3•mefengl•55m ago•1 comments

Claude Opus wrote a Chrome exploit for $2,283

https://www.theregister.com/2026/04/17/claude_opus_wrote_chrome_exploit/
4•Mohansrk•56m ago•0 comments

Purdue University CS240 Class over 50% of students 'caught' using AI on homework

https://old.reddit.com/r/Purdue/comments/1sogfb4/comment/ogsvymy/
1•twaldin•1h ago•2 comments

Unweight: Lossless MLP Weight Compression for LLM Inference

https://research.cloudflare.com/nikulin2026/
2•jgrahamc•1h ago•0 comments

Helpmate-Live, Social and AI Chat with Built-In CRM for WordPress

1•RhapsodyPlugins•1h ago•0 comments

Show HN: A delivery gate that automatically releases files when invoice is paid

1•pixelatedRudy•1h ago•1 comments

GloraMD Face Lift Serum

https://www.facebook.com/GloraMDFaceLiftSerumUS
1•bbangerr•1h ago•0 comments

Show HN: I made a self-employed expense keeper

https://bizlect.com
1•ispaceman•1h ago•0 comments

Garry Tan – On the LOC Controversy

https://twitter.com/garrytan/status/2045404377226285538
1•helloplanets•1h ago•0 comments

48 domains produce 22.5% of ChatGPT's B2B citations

https://growtika.com/blog/chatgpt-citation-economy
2•Growtika•1h ago•0 comments

Soul.md – open file format for AI agent identity

https://github.com/AntonioTF5/soul-spec
1•afonie•1h ago•0 comments