frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

AI-gateway that cuts LLM API costs by 40=70%

1•arnab777•38s ago•0 comments

Code Is Not a Product, Product Is Not a Startup

https://pawelbrodzinski.substack.com/p/code-is-not-a-product-product-is
1•flail•3m ago•0 comments

Microsoft Shareholders Sue over $357B Stock Wipeout

https://www.gadgetreview.com/microsoft-shareholders-sue-over-357-billion-stock-wipeout
1•speckx•5m ago•0 comments

Manna announces 'strategic pause' that grounds drone deliveries in Ireland

https://www.irishtimes.com/business/2026/06/19/manna-announces-strategic-pause-that-grounds-drone...
1•trusche•8m ago•0 comments

How I turned World Cup data into posters

https://zehfernandes.com/posts/how-i-turned-world-cup-data-into-posters
1•zehfernandes•8m ago•1 comments

Open-source AI skills that make Claude/ChatGPT produce real work, eval-scored

https://github.com/mohitagw15856/pm-claude-skills
1•mohitagw•8m ago•0 comments

A Shallow Introduction to the K Programming Language (2002)

https://web.archive.org/web/20070519112242/http://www.kuro5hin.org/story/2002/11/14/22741/791
1•tosh•8m ago•0 comments

Show HN: Intelligrade – EU Based Digital Exams

https://www.intelligrade.de/en
1•Escapado•8m ago•0 comments

Balsamiq Turns 18

https://e.customeriomail.com/deliveries/dgTrwgsDAKuYBqqYBgGe3uUDTbaOe6hHejNN_78=
2•constantinum•10m ago•0 comments

Show HN: StartupsBR – A Map of Brazilian Startups

https://www.startupsbr.com/sao-paulo
1•leonagano•10m ago•0 comments

Amazon Studios Won't Release Movie It Made That Makes Tech Billionaires Look Bad

https://www.showbiz411.com/2026/06/19/jeff-bezoss-amazon-studios-wont-release-movie-it-made-about...
3•harambae•10m ago•0 comments

LeCun blasts Musk's xAI as 'failure,' labs are risking a 'big bubble explosion'

https://www.cnbc.com/2026/06/18/yann-lecun-elon-musk-xai-failure-ai-labs-bubble-risk.html
1•1vuio0pswjnm7•10m ago•0 comments

Show HN: Gn – gets, filters, and labels news on the command-line

https://github.com/jake-gh1/gn
1•Jake83741•11m ago•0 comments

Google workspace threatening to block Firefox access

https://tales.fromprod.com/2026/169/google-workspace-threatening-to-block-firefox.html
3•birdculture•12m ago•0 comments

How four years at a startup changed the way I work

https://diogomr.com/posts/how-four-years-at-a-startup-changed-the-way-i-work/
2•mgdo•13m ago•1 comments

AI credit markets at risk of 'violent' correction investors must stay clear-eyed

https://www.cnbc.com/2026/06/17/ai-credit-markets-at-risk-of-violent-correction-man-group-says.html
1•1vuio0pswjnm7•14m ago•1 comments

Hyundai buys Boston Dynamics, Atlas humanoid to be used at vehicle plant by 2028

https://startupfortune.com/hyundai-takes-full-control-of-boston-dynamics-as-softbank-exits-for-32...
4•ck2•14m ago•0 comments

Proportion-Integral-Derivative Controllers

https://en.wikipedia.org/wiki/PID_controller
2•dhorthy•14m ago•0 comments

Bioregional Resilience Analysis: Mexican Dry and Coniferous Forests

https://naturalsystems.substack.com/p/bioregional-resilience-analysis-mexican
1•jelani_thompson•15m ago•0 comments

World Cup Tracker

https://www.schuetzler.net/blog/world-cup-tracker/
1•phalangion•17m ago•0 comments

Bringing an Open Source Project Back from the Dead

https://go-micro.dev/blog/27
2•asim•18m ago•0 comments

Mary Somerville: The Woman for Whom the Word "Scientist" Was Coined (2016)

https://www.themarginalian.org/2016/12/26/mary-somerville-scientist/
2•downbad_•19m ago•0 comments

I built a CLI poker game that you don't need to install to play

https://filiph.net/text/pokerd.html
2•filiph•20m ago•0 comments

Show HN: LeadLu – local business lead gen with built-in email and SMS outreach

https://www.leadlu.com/
1•brevn•21m ago•0 comments

US getting modern sunscreen formulations

https://www.theverge.com/column/952744/optimizer-sunscreen-bemotrizinol-fda-health
2•ctur•22m ago•1 comments

Show HN: OpenTunnel – Run Remote Commands as Local Agent Tool Calls

https://github.com/akoenig/opentunnel
3•akoenig•23m ago•0 comments

Spotify Killed the Thrill of the Hunt

https://erildrun.bearblog.dev/spotify-killed-the-thrill-of-the-hunt/
2•speckx•23m ago•0 comments

SpaceX Bankers Prepare for Bond Sale of at Least $20B

https://www.bloomberg.com/news/articles/2026-06-18/spacex-bankers-preparing-for-bond-sale-of-at-l...
3•2OEH8eoCRo0•23m ago•1 comments

Thirty Six Hours with Fable

https://tossrock.substack.com/p/36-hours-with-fable
1•Tossrock•24m ago•0 comments

Show HN: Leadmux – A Lead Generation Platform

https://leadmux.com
1•kataqatsi•24m ago•0 comments