frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Llama.cpp flags auto-tuning tool

https://github.com/raketenkater/ggrun
1•raketenkater•1m ago•0 comments

You must be This Tall to Ride

https://www.jacquescorbytuech.com/writing/you-must-be-tall-ride
1•iamacyborg•2m ago•0 comments

Agentlint – A security scanner for MCP server configs

https://github.com/Leporis14/agentlint
1•leporis14•3m ago•0 comments

AI Search Engine Exa Raises $250M Series C

https://exa.ai/blog/announcing-series-c
1•ankitg12•6m ago•0 comments

MDLight: 14MB Markdown reader (Wails, Go and Svelte) – no vaults or plugins

https://github.com/mdlight-dev/mdlight/
1•ZuhayrBarhoumi•7m ago•0 comments

It took two weeks to make Claude's "overnight solution" for flaky tests useful

https://thoughtbot.com/blog/what-it-took-to-use-this-overnight-ai-solution
1•zingar•9m ago•0 comments

Ask HN: Which free video downloader do you trust in 2026?

1•vanshsoni0027•10m ago•0 comments

Cargo Culture

https://www.wheresyoured.at/cargo-culture/
1•trueduke•11m ago•0 comments

Cambridge hospital staff accessed records of boy hurt in crocodile pit

https://www.theguardian.com/uk-news/2026/jun/26/cambridge-hospital-staff-investigated-accessing-r...
1•veltas•12m ago•0 comments

Volkswagen to axe up to 100k jobs in cost-cutting drive

https://www.ft.com/content/d0760eaf-d345-4964-b2ae-f55f6dfd9a4a
2•thm•18m ago•1 comments

ConvertBlink – convert rare image formats in the browser

https://www.convertblink.com/
1•maxzah•19m ago•0 comments

Show HN: I built a small audit layer for LLM-as-judge decisions

https://github.com/MatteoLeonesi/claim-memory-graph-sdk
1•ML0037•23m ago•0 comments

Intelligence per Watt: A Unified Metric for the AI Era

https://www.intelligence-per-watt.ai/
1•ilreb•24m ago•0 comments

Everyone suddenly sells themselves as "AI-native" on LinkedIn

https://write.as/7e9847vzuyxkx
3•garn810•29m ago•1 comments

From Isolated Agents to Agentic Mesh: Orchestrating SDLC with A2A and AP2

https://blog.owulveryck.info/2026/06/25/from-isolated-agents-to-agentic-mesh-orchestrating-sdlc-w...
1•owulveryck•31m ago•0 comments

The Baffling World of Masayoshi Son's Presentations

https://www.bloomberg.com/news/features/2020-06-23/golden-geese-and-unicorns-inside-the-eccentric...
1•phaser•33m ago•1 comments

Research as a Stochastic Decision Process

https://cs.stanford.edu/~jsteinhardt/ResearchasaStochasticDecisionProcess.html
1•sideway•35m ago•0 comments

Self-proclaimed King of Switzerland uses loophole to build his empire for free

https://www.france24.com/en/europe/20260510-how-switzerland-self-proclaimed-king-built-a-land-emp...
4•mrkn1•35m ago•0 comments

Monedula Apache Kafka Simulator

https://monedula.dev/kafka-simulator/
1•mmatloka•36m ago•0 comments

Jonas Lauwiner

https://en.wikipedia.org/wiki/Jonas_Lauwiner
1•GaryBluto•38m ago•0 comments

California State Government Launches AI Job Loss Tracker as Layoff Fears Grow

https://www.bloomberg.com/news/articles/2026-06-25/california-state-government-launches-ai-job-lo...
2•thm•40m ago•0 comments

New EU rules: military age Ukrainian men to lose refugee visas Jun 27 2027

https://www.reuters.com/world/eu-proposes-extending-ukrainian-protection-2028-limit-men-fighting-...
4•spwa4•45m ago•1 comments

The Naibbe cipher: a cipher that produces Voynich Manuscript-like ciphertext

https://www.tandfonline.com/doi/full/10.1080/01611194.2025.2566408#d1e6668
2•wise_blood•51m ago•0 comments

I created a new open-source project

2•danielsyauqi•52m ago•0 comments

AI 2027 Tracker

https://ai2027-tracker.com/
3•merksittich•1h ago•0 comments

Alan Greenspan Has Died

https://businessdesk.co.nz/article/economy/remembering-alan-greenspan
4•vismit2000•1h ago•0 comments

How much? The hidden costs of restaurant dishes

https://www.theguardian.com/food/2026/jun/26/how-much-the-hidden-costs-of-restaurant-dishes
6•helsinkiandrew•1h ago•0 comments

Midwit Cleanse – midwits wipe themselves off the gene pool

https://demiculus.com/midwit-cleanse/
2•demiculus•1h ago•0 comments

Translating Pandas to Polars using LLMs

https://pola.rs/posts/llm-polars-patterns/
8•jeroenjanssens•1h ago•0 comments

Paris to ban drinking alcohol in public as hospitals hit heatwave breaking point

https://www.theguardian.com/world/2026/jun/26/paris-heatwave-drinking-ban-drinking-alcohol-public
3•teleforce•1h ago•2 comments