frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•11mo ago

Comments

kate_at_refact•11mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Are ClickHouse JOINs Slow? A 2026 PR-by-PR Analysis

https://dataanalyticsguide.substack.com/p/clickhouse-join-performance-2026
1•manveerc•1m ago•0 comments

Sandyaa: Recursive-LLM source code auditor that writes exploitable PoCs

https://github.com/securelayer7/sandyaa
1•sandeep_kamble•1m ago•1 comments

How Not to 'Pilet' a Kickstarter

https://c33tech.com/blog/2026/04/how_not_to_pilet_a_kickstarter/
1•mikeflynn•2m ago•0 comments

Michael O. Rabin has passed away

https://en.wikipedia.org/wiki/Michael_O._Rabin
1•statusreport•3m ago•0 comments

Connect iMessage to your Claude Code assistant

https://github.com/anthropics/claude-plugins-official/tree/main/external_plugins/imessage
1•rob•3m ago•0 comments

New (Twin) Dad Advice

https://hec.works/blog/new-twin-dad/
1•dividedcomet•5m ago•1 comments

Show HN: Turned a viral DevOps debugging tweet into a playable incident SIM

https://youbrokeprod.com/login?redirect=%2Fplay%2Frunaway-process-001
1•cdnsteve•6m ago•0 comments

Anthropic Redesigns Claude Code Desktop

https://twitter.com/claudeai/status/2044131493966909862
1•Nevin1901•7m ago•1 comments

Show HN: Start Using Claude Managed Agents Today – Posse

https://github.com/oguzbilgic/posse
1•obilgic•7m ago•0 comments

I Went to China to See Its Progress on A.I. We Can't Beat It

https://www.nytimes.com/2026/04/13/opinion/china-ai-america-chipmakers.html
1•suvan•8m ago•1 comments

Show HN: Would you score a podcast debate?

1•fcpguru•9m ago•0 comments

California moves forward with its 'Stop Nick Shirley Act'

https://www.deseret.com/politics/2026/04/14/stop-nick-shirley-act-california-fraud/
2•donsupreme•11m ago•0 comments

Agent Skill for Jj Jujutsu VCS

https://github.com/danverbraganza/jujutsu-skill
1•nvader•14m ago•0 comments

Android IRCx

https://github.com/AndroidIRCx/AndroidIRCx
1•sans_souse•15m ago•0 comments

Lisp is Not an Acceptable Lisp (2006)

https://steve-yegge.blogspot.com/2006/04/lisp-is-not-acceptable-lisp.html
2•fyskij•15m ago•0 comments

TN's Charlie Kirk Act bans student walkouts, protects conservative speakers

https://wpln.org/post/tennessees-charlie-kirk-act-bans-student-walkouts-protects-conservative-spe...
1•bediger4000•18m ago•2 comments

The Timeless Way of Building

https://en.wikipedia.org/wiki/The_Timeless_Way_of_Building
1•gradus_ad•18m ago•1 comments

Cozy landing page I liked

https://www.chloeyan.me/
1•Akcium•18m ago•0 comments

Looms taught us to store, share, and "run" logic

https://cyrusradfar.com/thoughts/thread
1•cyrusradfar•18m ago•1 comments

Amazon to acquire Globalstar in $11.6B satellite bet

https://www.bloomberg.com/news/articles/2026-04-14/amazon-to-buy-satellite-operator-globalstar-fo...
1•samaysharma•19m ago•0 comments

How Poor Am I?

https://howpoorami.org
1•gaws•19m ago•0 comments

Tactical Success, Strategic Failure? Washington Walks the Path to Defeat in Iran

https://warontherocks.com/tactical-success-strategic-failure-washington-walks-the-path-to-defeat-...
2•colonCapitalDee•22m ago•0 comments

Show HN: Three backtested quant strategies as Jupyter notebooks

https://mattitude8861.gumroad.com/l/QuantStrategyTemplatesBundle
1•Shmungus•23m ago•0 comments

Ups Seeks to Replace Manual Scans with RFID Tracking Tech

https://www.wsj.com/logistics-report/ups-seeks-to-replace-manual-scans-with-tracking-tech-caf437db
2•bookofjoe•24m ago•1 comments

AI and Videogames

https://www.youtube.com/watch?v=0sNxMeBU_Tg
1•frag•24m ago•0 comments

Mirror – private reflection app with long-term memory

https://mirror-eight-gamma.vercel.app/en
1•eduardonrj•25m ago•1 comments

SpaceX Is Basically a Huge Meme Stock

https://www.theatlantic.com/ideas/2026/04/spacex-ipo-elon-musk/686793/
5•breve•25m ago•0 comments

Rails at the Center of DNSimple

https://podcast.rubyonrails.org/2462975/episodes/18999348-simone-carletti-rails-at-the-center-of-...
3•robbyrussell•27m ago•1 comments

Show HN: Greptile for Security (open source)

https://www.strix.ai/blog/pentesting-every-pull-request
2•bearsyankees•28m ago•0 comments

Domain Knowledge Is the Product

https://automato.substack.com/p/your-domain-knowledge-is-the-product
1•andrewstetsenko•29m ago•0 comments