frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

SEOTrends scans the internet to uncover easy-to-rank SEO opportunities

https://seotrends.pro/
1•kluiii•24s ago•1 comments

OpenWRT Performance Optimizer

https://github.com/Ahmad10611/openwrt-performance-optimizer
1•cf100clunk•2m ago•0 comments

Meta layoffs stress harsh AI reality inside Zuckerberg's company

https://www.cnbc.com/2026/05/18/metas-layoffs-starting-this-week-underscore-zuckerbergs-ai-realit...
2•drob518•2m ago•0 comments

What the AI hype gets wrong about software engineering

https://stackoverflow.blog/2026/05/18/what-the-ai-hype-gets-wrong/
1•mikece•2m ago•0 comments

The Open Agent Leaderboard

https://huggingface.co/blog/ibm-research/open-agent-leaderboard
1•ibobev•2m ago•0 comments

AI-Mediated Communication Can Steer Collective Opinion

https://arxiv.org/abs/2605.16245
1•sbulaev•4m ago•0 comments

First Streaming Fraud Case: A Musician's Alleged $10M Scam

https://www.rollingstone.com/music/music-features/streaming-fraud-fake-streams-mike-smith-1235500...
1•Geekette•7m ago•0 comments

Show HN: ThreeFour – run multi-step procedures one step at a time

https://threefour.app
1•onwardwild•7m ago•1 comments

How to Read Like a Child Again

https://www.theatlantic.com/newsletters/2026/05/childrens-books-adults/687191/
2•paulpauper•8m ago•0 comments

Microsoft testing adjustable taskbar, Start menu in Windows 11

https://www.bleepingcomputer.com/news/microsoft/windows-11-finally-gets-a-resizable-taskbar-and-s...
1•Brajeshwar•9m ago•0 comments

AI Has Broken Containment

https://www.theatlantic.com/technology/2026/05/ai-inflection-point-trump-china/687202/
2•paulpauper•9m ago•0 comments

News.Y Combinator.com/Submit

https://agentmemo.vercel.app
1•pulsoai•9m ago•0 comments

Antislop: Identifying and Eliminating Repetitive Patterns in LLMs

https://iclr.cc/virtual/2026/poster/10008156
2•Der_Einzige•10m ago•0 comments

ImpactArbiter – A PyTorch autograd trap for LLM memory bugs

https://github.com/msunda17/impactarbiter-cli
1•maniksundar•11m ago•0 comments

The US space enterprise is desperately waiting for Starship

https://arstechnica.com/space/2026/05/the-us-space-enterprise-is-desperately-waiting-for-starship...
1•tosh•12m ago•0 comments

A Rust-Python thing I am working on. Apache 2 licence

https://github.com/KevinKenya/nairobi-connector-open-source
5•kevinkenya•12m ago•0 comments

Bachelors Without Bachelor's: Gender Gaps in Education and Declining Marriage

https://www.nber.org/papers/w35179
2•paulpauper•12m ago•0 comments

Skybridge – the MCP Apps framework released v1.0

https://github.com/alpic-ai/skybridge/releases/tag/v1.0.0
3•Eldodi•13m ago•1 comments

Windows 11 brings back much-missed taskbar options

https://arstechnica.com/gadgets/2026/05/five-years-later-windows-11-brings-back-much-missed-taskb...
1•tosh•13m ago•0 comments

Everything You Need to Know About Black Cocoa Powder (2022)

https://saltandbaker.com/black-cocoa-powder-guide/
1•thomassmith65•15m ago•0 comments

Show HN: Eazip – Password-protected ZIPs (AES-256) in the browser, no upload

https://www.eazip.ch/
2•Zmaon•16m ago•0 comments

At Protocol for Agents

https://davidgasquez.com/atproto-agents
1•kalendos•17m ago•0 comments

For 20 years, Stephen Colbert distinguished truth from truthiness

https://www.npr.org/2026/05/18/nx-s1-5815315/stephen-colbert-final-show
4•geox•20m ago•0 comments

Preventing AI agents from executing destructive terminal commands

https://github.com/7Majesty-M/terminal-guardian-mcp
1•majesty-m•21m ago•1 comments

OVCS: Raspberry Pi–powered electric car

https://www.raspberrypi.com/news/ovcs-raspberry-pi-powered-electric-car/
1•Brajeshwar•22m ago•0 comments

Can we combine excellent design and branding simultaneously?

https://antar.me/blog/branding-vs-good-design/
1•redaantar•22m ago•0 comments

Show HN: Citycal – Collaborative Events Calendar

https://citycal.com
1•oliv__•23m ago•0 comments

How India's cooking fuel shortage is driving up California's gas prices

https://www.reuters.com/business/energy/how-indias-cooking-fuel-shortage-is-driving-up-california...
1•tartoran•23m ago•0 comments

Show HN: Kaption – Live OCR subtitle overlay

https://github.com/wojciechowskiapp/Kaption
1•wojciechowskiap•23m ago•0 comments

Clojure Freed Me from the Ceremony

https://carlosblanco.github.io/clojure/functional-programming/2020/10/15/functional-programming-c...
2•zonotope•24m ago•0 comments