frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•11mo ago

Comments

kate_at_refact•11mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

The World Can't Keep Up with AI Labs

https://www.greaterwrong.com/posts/fewDbvpKMZLgGuWT2/the-world-can-t-keep-up-with-ai-labs
1•gmays•1m ago•0 comments

Bubbles

https://bubbles.town/
1•sonicrocketman•3m ago•0 comments

Show HN: Archik – architecture diagrams as YAML, with a Claude Code skill

https://github.com/bacharSalleh/archik
1•bacharsalehov•3m ago•0 comments

Can Musk force OpenAI to stay a nonprofit? AI's most anticipated trial starts

https://arstechnica.com/tech-policy/2026/04/musk-and-altman-face-off-in-trial-that-will-determine...
1•chhum•7m ago•1 comments

We retired an AI agent through a formal hearing

https://gist.github.com/david-steel/92fe0d4abb610303a3da0613ad5710d4
1•dsteel•9m ago•0 comments

The quiet power of headphones for people with autism

https://www.rnz.co.nz/life/wellbeing/the-quiet-power-of-headphones-for-people-with-autism
3•billybuckwheat•9m ago•0 comments

Entering the Post-Prompting World

https://blog.southparkcommons.com/p/entering-the-post-prompting-world
1•nadis•13m ago•0 comments

Show HN: A small hook to prevent agents from destructive things

https://gist.github.com/natew/3ff0751f26195e4e6b9927473595f5fe
1•nwienert•13m ago•0 comments

Ask HN: Will fixed applications become a thing of the past with agentic AI?

4•ex-aws-dude•14m ago•0 comments

Matter, Thread and Zigbee: What IoT Developers Need to Know Before Choosing

https://electronicsconsult.com/blog/matter-thread-zigbee-what-iot-developers-need-to-know-before-...
1•OOHehir•14m ago•0 comments

Series Raised $5.1M to Put Warm Intros in iMessage

https://www.siliconsnark.com/series-raised-5-1-million-to-put-warm-intros-in-imessage/
1•SaaSasaurus•14m ago•0 comments

(Morgan) Supersport 400

https://morgan-motor.com/news/introducing-supersport-400/
1•gnabgib•14m ago•0 comments

Zero-config Go heap profiling

https://coroot.com/blog/zero-config-go-heap-profiling/
2•valyala•17m ago•0 comments

An eBay Outage is Underway [reddit Megathread]

https://old.reddit.com/r/Ebay/comments/1sxbik2/ebay_outage_megathread/
1•WarOnPrivacy•17m ago•0 comments

Spotify Launches Fitness Hub with 1,400 Peloton Workouts

https://newsroom.spotify.com/2026-04-27/spotify-fitness-workouts-peloton/
1•7777777phil•18m ago•0 comments

Version Sentinel – Claude Code plugin that blocks outdated dependency installs

https://github.com/KSEGIT/Version-Sentinel
1•dupadupa234•21m ago•0 comments

Message Brokers Are Modern Grids

https://yusufaytas.com/message-brokers-are-modern-grids
1•birdculture•21m ago•0 comments

Longevity Science Is Overhyped. But This Research Could Change Humanity

https://www.nytimes.com/2026/04/27/magazine/cell-rejuventation-biotech-longevity-research-altos-l...
1•wjb3•23m ago•0 comments

Three men are facing 44 charges in Toronto SMS Blaster Arrests

https://www.tps.ca/media-centre/stories/unprecedented-sms-blaster-arrests/
3•gnabgib•24m ago•0 comments

Ask HN: Best way to escalate critical GAS bug ignored by Google

1•civeng•24m ago•0 comments

CinemaCLIP: A hybrid CLIP model for the visual language of cinema

https://www.ozu.ai/cinemaclip/
3•rsomani95•29m ago•0 comments

Rick and Morty Tried to Warn Us About Agentic AI

https://jadarma.github.io/blog/posts/2026/04/rick-and-morty-tried-to-warn-us-about-agentic-ai/
1•tymscar•29m ago•0 comments

Softmax, can you derive the Jacobian? And should you care?

https://idlemachines.co.uk/essays/softmax
1•smaddrellmander•30m ago•0 comments

The origin of human eyes traces back to an ancient "cyclops"

https://www.sciencedaily.com/releases/2026/04/260426012308.htm
1•yusufaytas•30m ago•0 comments

China warns EU over 'Made in Europe' plan, vows countermeasures

https://www.france24.com/en/europe/20260427-china-warns-eu-made-in-europe-plan-countermeasures
3•baal80spam•30m ago•0 comments

Show HN: Klutch MCP: control your credit card programmatically via Claud

https://www.klutchcard.com/landing-pages/klutch-mcp
1•renatost•30m ago•0 comments

Show HN: SQL Protocol – a SQL game that's now an MMO-lite

https://sqlprotocol.com
3•ItaiZeilig•30m ago•0 comments

Show HN: MyFriendlyWallet – a wallet running 54 chains on Cloudflare Workers

https://myfriendlywallet.io
1•sebjornstad•32m ago•0 comments

The $50 Movie Ticket Has Arrived

https://www.wsj.com/business/media/the-50-movie-ticket-has-arrived-42251672
2•bookofjoe•34m ago•2 comments

GitHub Copilot shifts to usage-based pricing June 1 – why that's no surprise

https://www.zdnet.com/article/github-copilot-shifts-to-usage-based-pricing/
1•CrankyBear•35m ago•0 comments