frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Starling – Managed-first .NET web browser engine, built from primitives

https://starlingbrowser.com
1•bj-rn•2m ago•0 comments

Show HN: NEP – Ethereum JSON-RPC transform that beats ZSTD by 12%

https://github.com/Louw115/nep-ethereum-compression
1•LBWasserman•8m ago•1 comments

The Future of Film May Just Be Old Movies (2024)

https://www.theringer.com/2024/10/23/movies/repertory-revival-cinema-old-movie-screenings-vidiots...
1•cocacola1•11m ago•0 comments

Thinking more about Netscape Time

https://thehistoryoftheweb.com/thinking-more-about-netscape-time/
1•Brajeshwar•14m ago•0 comments

The Stochastically K Shaped Job Market

https://www.williamangel.net/blog/2026/06/05/the-stochastically-k-shaped-engineering-job-market.html
1•datadrivenangel•22m ago•0 comments

Silicon Valley's Secretive, Orgiastic Dark Side (2018)

https://www.vanityfair.com/news/2018/01/brotopia-silicon-valley-secretive-orgiastic-inner-sanctum
2•mgh2•22m ago•0 comments

Getting silly with C, part and((int*)1)[-1]

https://lcamtuf.substack.com/p/getting-silly-with-c-part-and-int1
3•surprisetalk•23m ago•0 comments

Show HN: Backup Your Perplexity Research to Markdown and Obsidian

https://chatgpt2notion.com/products/perplexity-to-obsidian/
1•chatgpt2notion•35m ago•0 comments

Show HN: Zedra – Mobile control plane for AI coding agents

1•tanlethanh•36m ago•1 comments

Why is the HN crowd so anti-AI?

3•Ekami•37m ago•8 comments

Definitive guide for creating skill.md for your tools

https://docsalot.dev/blog/what-is-skill-md
1•fazkan•42m ago•0 comments

Agent-ML-skills – Teach Codex/Claude/Cursor to stop making ML mistakes

https://github.com/param087/agent-ml-skills
1•param087•47m ago•0 comments

Show HN: Apple Contacts MCP – Local AI Access to macOS Contacts

https://github.com/lu-wo/apple-contacts-mcp
1•luwo•47m ago•0 comments

Trump Signals Interest in US Owning Stakes in Top AI Labs

https://www.bloomberg.com/news/articles/2026-06-05/us-exploring-government-partnerships-with-ai-f...
3•grassfedgeek•54m ago•2 comments

A better go file/text sharing service with single binary, inspired by microbin

https://github.com/zaaack/go-bin
1•zaaack•55m ago•0 comments

Show HN: The Deterministic Core Architecture for AI-Augmented Applications

https://brandonbellsystems.com/deterministic-core/
1•Brandon_Bell•1h ago•0 comments

Something is jamming GPS over Europe. Here's what we found (Veritasium) [video]

https://www.youtube.com/watch?v=tz23G_UXCGA
3•kordlessagain•1h ago•1 comments

Show HN: Lite Agent redefines what an AI agent is

https://liteagent.cloud
2•cheikhshift•1h ago•0 comments

Show HN: Declank – Remove AI Watermarks from Images

https://declank.skeptrune.com/
1•skeptrune•1h ago•0 comments

Bernie Sanders: A.I. Is a Public Resource. You Should Own Half of It

https://www.nytimes.com/2026/06/01/opinion/artificial-intelligence-bernie-sanders.html
6•ankitr•1h ago•2 comments

SAT-Physical Thermodynamic Framework: treating constraints as a thermal system

https://github.com/alikamp/SAT_HARDNESS_P-NP
1•kauai1•1h ago•0 comments

Tinker Cookbook

https://github.com/thinking-machines-lab/tinker-cookbook
2•dima1830•1h ago•0 comments

Why sophrosyne, an ancient Greek virtue, matters more than ever in the age of AI

https://theconversation.com/why-sophrosyne-an-ancient-greek-virtue-matters-more-than-ever-in-the-...
3•1659447091•1h ago•0 comments

Rethinking the Value of Generated Tests for LLM Software Engineering Agents

https://arxiv.org/abs/2602.07900
1•zuzululu•1h ago•0 comments

Ask HN: Will your company be doing "LeetCode" interviews a year from now?

2•locusofself•1h ago•5 comments

Show HN: Incremental SfM pipeline that reconstructs 3D point clouds from images

https://github.com/egeozgul/Incremental-3D-Reconstruction-SfM/tree/main
1•egeozgul•1h ago•0 comments

Show HN: ABC Classic 100 Rankings visualised

https://classic100.gotski.workers.dev/
17•gotski•1h ago•11 comments

Some concerns about Ladybird's bylaws

https://tuananh.net/2026/06/06/ladybird-bylaws/
3•tuananh•1h ago•3 comments

Alzheimer's patient gets back speech, bladder control and memory in drug trial

https://nypost.com/2026/06/04/health/alzheimers-patient-recovers-speech-continence-and-memory-wit...
10•virgildotcodes•1h ago•2 comments

Google will pay SpaceX $920M per month for compute capacity

https://twitter.com/JackKuhr/status/2062975800488394777
8•bear_with_me•1h ago•2 comments