frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Show HN: TripTip – A minimalist travel planner from wiki data

https://triptip.cat
1•belforn•1m ago•0 comments

Don't Make Gates Optional, Make Them Flexible

https://wakamoleguy.com/p/flexible-gates
1•wakamoleguy•3m ago•0 comments

Trump Threatens 100% Tariffs over Digital Services Tax on U.S. Firms

https://www.cnbc.com/2026/06/26/trump-tariff-trade-tech-tax.html
3•billybuckwheat•4m ago•0 comments

The AI-Run Business Index: measuring execution, not AI adoption

https://www.leapd.ai/resources/state-of-ai-run-businesses-2026
1•Cyrus2050•5m ago•0 comments

Show HN: Even, the terminal-first desktop workspace

https://eventerm.com/
1•todience•5m ago•0 comments

Font-Family Recommendations

https://chrismorgan.info/font-family
1•birdculture•6m ago•0 comments

The Thing We All Obviously Want

https://kmicinski.com/thing-we-all-want
1•matt_d•6m ago•0 comments

Ask HN: Can distributed data centers in individual households provide UBI?

1•SuboptimalEng•9m ago•3 comments

Ask HN: If we could remake Linux in 2026, what would you change?

1•alonsovm44•9m ago•1 comments

Pystd, similar-ish functionality with a fraction of the compile time

https://nibblestew.blogspot.com/2026/06/pystd-standard-library-similar-ish.html
2•ibobev•9m ago•0 comments

The Dottie Number

https://lawrencecpaulson.github.io//2026/06/26/Dottie_Number.html
1•ibobev•15m ago•0 comments

Show HN: Forensic stock analysis from SEC filings, no LLM guessing (free)

https://stockonomy.net/proof
1•SEC_Lense•17m ago•0 comments

Show HN: Deskmate Live – AI Desktop Pet Companions

https://deskmatelive.com/
1•valisvalis•17m ago•0 comments

The Nationwide Backlash Against Cameras Watching Your Car

https://www.wsj.com/us-news/the-nationwide-backlash-against-cameras-watching-your-car-401a656a
4•JumpCrisscross•20m ago•0 comments

SpaceX bonds sell off days after AI and rocket group's $25B debt deal

https://www.ft.com/content/04f98e21-4ce7-43d2-8651-44557e12c31c
2•JumpCrisscross•22m ago•0 comments

President warns of 100% tariff on countries implementing digital services tax

https://www.ft.com/content/5d886d47-c509-44a4-9077-bcd25158b61e
5•JumpCrisscross•23m ago•0 comments

AgentKits – 60 production-ready AI agent blueprints with guardrails

https://www.agent-kits.com
2•stoicstoic•24m ago•0 comments

The National Parks Were Reportedly Told to Stay Silent on Deaths

https://www.outsideonline.com/outdoor-adventure/environment/nps-internal-memo-deaths/?link_source...
5•LostMyLogin•24m ago•0 comments

A C++ implementation of a fast hash map and hash set using hopscotch hashing

https://github.com/Tessil/hopscotch-map
7•gjvc•25m ago•0 comments

Evan's Jujutsu Tutorial

https://evmar.github.io/jjtut/
2•joecobb•26m ago•0 comments

A couple of months ago in Miami, I sat down and dumped my brains

https://ghuntley.com/miami/
1•ghuntley•26m ago•0 comments

After 80 Years, Mathematicians Give Famed 'Erdős Method' an Upgrade

https://www.quantamagazine.org/after-80-years-mathematicians-give-famed-erdos-method-an-upgrade-2...
2•ibobev•28m ago•0 comments

The gap between open weights LLMs and closed source LLMs

https://blog.doubleword.ai/frontier-os-llm
7•kkm•29m ago•1 comments

Primed for Malware: Stop Selling Compromised Android Devices

https://www.eff.org/deeplinks/2026/06/primed-malware-stop-selling-compromised-android-devices
2•hn_acker•29m ago•0 comments

We Can Still Stop California's 3D Printer Surveillance Scheme

https://www.eff.org/deeplinks/2026/06/we-can-still-stop-californias-3d-printer-surveillance-scheme
4•hn_acker•30m ago•0 comments

Build real agentic apps using CUGA

https://huggingface.co/blog/ibm-research/cuga-apps
1•gmays•30m ago•0 comments

32bit Apple app running on M4 natively

https://old.reddit.com/r/MacOS/comments/1ufug75/major_breakthrough_32bit_apple_app_running_on_m4/
2•carlosjobim•30m ago•0 comments

Chatbots vs. Ozone

https://blog.dshr.org/2026/05/chatbots-vs-ozone.html
1•anonymous_user9•31m ago•0 comments

Lawmakers Must Act Now to Prevent Armed Police Drones

https://www.eff.org/deeplinks/2026/06/lawmakers-must-act-now-prevent-armed-police-drones
5•hn_acker•31m ago•0 comments

Show HN: I build an app for you to take a break from AI. RainBreak

https://rainbreak.franzai.com/
1•franze•34m ago•0 comments