frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Australia's Solar Sharer: free electricity 3 hours/day starting July 1

https://www.abc.net.au/news/2026-06-29/free-electricity-solar-sharer-scheme/105999242
1•indynz•39s ago•0 comments

I built a one-time-payment alternative to subscription cloud movers

https://www.gtransfer.app
1•localappointmnt•3m ago•0 comments

Towards Understandable Software

https://gracefulliberty.com/articles/towards-understandable-software/
1•birdculture•3m ago•0 comments

Google's Whitepaper on SDLC in the Agentic Era

https://www.kaggle.com/whitepaper-the-new-SDLC-with-vibe-coding
1•shahargl•5m ago•0 comments

Good article about local LLM on MacBook Air

https://www.xda-developers.com/ollama-new-mlx-engine-local-llm-mac-twice-fast/
1•taintech•5m ago•0 comments

Show HN: Smug – single binary, dependency free session manager for tmux

https://github.com/ivaaaan/smug
1•iillexial•6m ago•0 comments

Anthropic embedded spyware in Claude Code – and attempted to hide it from you

https://old.reddit.com/r/ClaudeAI/comments/1ujila1/anthropic_embedded_spyware_in_claude_code_and/
1•theanonymousone•6m ago•0 comments

AMD Stretches Server DRAM with Flash Extended Memory

https://www.nextplatform.com/store/2026/06/29/amd-stretches-server-dram-with-flash-extended-memor...
1•rbanffy•10m ago•0 comments

Celebrating my one-year layoff anniversary

https://iiro.dev/one-year-layoff-anniversary/
1•roughike•11m ago•0 comments

Show HN: 1ShotGen – From rough idea to full build prompt in 1 shot

https://1shotgen.com/
1•zachisparanoid•13m ago•0 comments

Show HN: Not Another AI Platform

https://tryhello.app
1•hayden_k•13m ago•1 comments

Show HN: SlimSnap – mark a screenshot element, get JSON for your coding agent

https://slimsnap.ai/
1•bickov•13m ago•0 comments

Appropriate Technology

https://en.wikipedia.org/wiki/Appropriate_technology
3•Atiscant•16m ago•0 comments

Horsewood (USA and Canada) Scam or Legit Male Performance Supplement?

https://finance.yahoo.com/sectors/healthcare/articles/horsewood-urgent-report-2026-horse-19110038...
2•gapusart•27m ago•0 comments

A Rust trading bot for Polymarket, 800µs decision loop

https://github.com/casatrick/polymarket-arbitrage-bot
3•casatrick•28m ago•0 comments

TurboPrefill: 2.7× faster than llama.cpp Pipeline Parallel on Llama-3-70B

https://github.com/ggml-org/llama.cpp/pull/24219
1•trykhlieb•30m ago•0 comments

The Pregnancy and Health Apps Still Leaking Data in 2026

https://arxiv.org/abs/2606.26276
4•ddxv•30m ago•1 comments

Toys from Trash

https://www.arvindguptatoys.com/toys-from-trash.php
1•noufalibrahim•31m ago•0 comments

AI and Mathematics Research – Yikes (N.J. Wildberger)

https://www.youtube.com/watch?v=-QlC4C6mPIw
1•nyc111•32m ago•0 comments

The cost of AI is someone else's time

https://cephalosec.com/blog/the-hidden-cost-of-ai-is-someone-elses-time/
2•ilreb•32m ago•0 comments

Ozymandias on Rails. The Pedestal Inscription

https://baweaver.com/writing/2026/06/28/ozymandias-on-rails-the-pedestal-inscription/
2•Liriel•34m ago•0 comments

Show HN: I built an agent that uses email as a file system

https://www.supafax.com/
1•rohanmahen•38m ago•0 comments

The "I don't know, Claude wrote this" pandemic

https://newsletter.manager.dev/p/the-i-don-t-know-claude-wrote-this-pandemic
1•flail•40m ago•0 comments

Show HN: Ask your AI what changed across your competitors

https://industry-lens.com/mcp
1•IndustryLens•41m ago•0 comments

When Your IDE Becomes a RCE Endpoint

https://medium.com/trendyol-tech/when-your-ide-becomes-a-rce-endpoint-c87b85096b19
1•nofool•41m ago•0 comments

AI-native Formik alternative form library

https://fillament.dev
1•trialerror123•48m ago•0 comments

Getting Started with Rhombus

https://docs.racket-lang.org/rhombus-getting-started/index.html?fam=Rhombus&famroot=rhombus
1•azhenley•49m ago•0 comments

Ask HN: Sub 5 person team – Claude team plan?

1•anoop_kumar•50m ago•1 comments

Hand and Brain and Artificial Intelligence

https://www.theinternationalism.org/2026/02/hand-brain-artificial-intelligence.html
1•abbassix•55m ago•1 comments

Entertainment Software Association: "Minecraft private servers are illegal" [video]

https://www.youtube.com/watch?v=RgmtdeBIZ2s
1•krige•57m ago•0 comments