frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

I still enjoy building websites without AI

https://alprado.com/blog/i-still-enjoy-building-websites-without-ai/
1•alprado50•3m ago•0 comments

Everything Is Downstream of Technology

https://twitter.com/deepwhitman/status/2072896235753947361
2•bilater•7m ago•0 comments

Startup Spotlight: Inside Pastmaps' Solo Climb to Six Figures

https://runtimewire.com/article/startup-spotlight-inside-pastmaps-solo-climb-to-six-figures
1•ryanmerket•11m ago•0 comments

Relocating 6M Singapore bees and counting, one nest at a time

https://www.reuters.com/world/asia-pacific/relocating-6-million-singapore-bees-counting-one-nest-...
1•petethomas•17m ago•0 comments

Show HN: WyrmRSS – a self-hosted RSS reader with inline YouTube

https://github.com/kryoseu/WyrmRSS
1•kryoseu•18m ago•0 comments

Trump Jr.'s 'Amazon of guns' could make millions under new proposed firearm rule

https://www.reuters.com/legal/government/trump-jrs-amazon-guns-could-make-millions-under-new-prop...
5•petethomas•19m ago•0 comments

Claude Code Settings That Made Me a Faster Software Architect

https://jsdev.space/claude-code-settings-software-architect/
1•javatuts•19m ago•0 comments

A Deterministic Replacement for LLM-as-Judge in Stateful Agent Evaluation

https://arxiv.org/abs/2606.22737
4•jflynt76•20m ago•0 comments

Raw footage to maximized-retention videos

https://www.autoeditor.app/
1•Quise•24m ago•0 comments

New legal right to speak to a human for finance consumers

https://www.rte.ie/news/2026/0702/1581517-finance-chatbots/
3•austinallegro•33m ago•0 comments

Deepagents

https://github.com/langchain-ai/deepagents
2•kristianpaul•38m ago•1 comments

AI dev platform that keeps project context across the whole codebase lifecycle

https://brunelly.com/
2•RihabAI•43m ago•0 comments

AskHN: Using 'claude -p' for running Mr.Jassy - AWS butler agent

2•anoop_kumar•46m ago•0 comments

Wasmer: Fast, secure, lightweight containers based on WebAssembly

https://wasmer.io/
9•handfuloflight•49m ago•1 comments

BYD Denza Z steer-by-wire

https://carnewschina.com/2026/07/01/byd-denza-z-steer-by-wire-fudi-chassis/
6•Alien1Being•52m ago•0 comments

Google used its Android phone network's accelerometers as mini-seismometers

https://substack.com/@jklundblad/note/c-285567479
2•initramfs•53m ago•0 comments

From Open Source Software to Open Source Strategy

https://p3institute.substack.com/p/from-open-source-software-to-open
3•cletusigwe•55m ago•0 comments

The Free Market Lie: Why Switzerland Has 25 Gbit Internet and America Doesn't

https://stefan.schueller.net/posts/the-free-market-lie/
146•talonx•55m ago•76 comments

How to avoid AI in as many places as possible

https://www.fastcompany.com/91566861/how-to-avoid-ai-in-as-many-places-as-possible
2•1vuio0pswjnm7•1h ago•0 comments

Show HN: Bedtimeforkids let kids learn while entertain

https://bedtimeforkids.vercel.app
2•dutay05•1h ago•0 comments

Ua-tracer: what does a user agent fetch, follow and run

https://uatracer.com/
2•twapi•1h ago•0 comments

Every AI Visibility Tool Is Lying to You

https://canonry.ai/blog/ai-visibility-tools-are-lying
3•arberx•1h ago•0 comments

Google loses fight against record €4.1B EU antitrust fine

https://www.reuters.com/world/eu-top-court-dismisses-google-fight-against-record-41-billion-eu-an...
3•1vuio0pswjnm7•1h ago•0 comments

What Would Mark Twain Think of America at 250?

https://www.theatlantic.com/ideas/2026/07/mark-twain-america-anniversary-critique/687718/
3•paulpauper•1h ago•0 comments

Why Everyone Is Suddenly Talking About 'Universal Basic Capital'

https://www.theatlantic.com/economy/2026/07/universal-basic-capital-ai/687759/
6•paulpauper•1h ago•0 comments

Merlin: A computed tomography vision–language foundation model and dataset

https://www.nature.com/articles/s41586-026-10181-8
2•bryanrasmussen•1h ago•0 comments

Show HN: I built a declarative layout engine for SVG, Canvas, WebGL

https://github.com/carnworkstudios/boxwood
3•bonzai2carn•1h ago•0 comments

Artificial and Fake Eggs: Dance of Death

https://www.researchgate.net/publication/281149909_Artificial_and_Fake_Eggs_Dance_of_Death
2•ms7892•1h ago•0 comments

The Programming Wars: How Microsoft Crushed Borland

https://www.youtube.com/watch?v=AQiULz4Z4TQ
2•cable2600•1h ago•0 comments

14× faster embeddings: how we rebuilt the ONNX path in Manticore

https://manticoresearch.com/blog/onnx-embeddings-speedup/
6•snikolaev•1h ago•0 comments