frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•2mo ago

Comments

kate_at_refact•2mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Context Engineering – can't call it engineering if we can't predict it breaking [video]

https://www.youtube.com/watch?v=vsfbplnJyA8
1•HammadB•1m ago•0 comments

MacPaint Art from the Mid-80s Still Looks Great Today

https://blog.decryption.net.au/posts/macpaint.html
1•decryption•1m ago•0 comments

The Dark Factor of Personality

https://darkfactor.org/
1•thinkingemote•4m ago•0 comments

Idris 2: quantitative type theory in practice

https://arxiv.org/abs/2104.00480
1•fanf2•5m ago•0 comments

The Role of XML in Interoperability

https://blog.documentfoundation.org/blog/2025/07/11/the-role-of-xml-in-interoperability/
2•zaik•7m ago•0 comments

Open-sourcing our clinical triage benchmark for evaluating LLMs

https://github.com/medaks/medask-benchmarks
1•klemenvod•9m ago•2 comments

Surprising not CR*P – Vibing hardware

https://www.youtube.com/watch?v=UQCpDarEoBc
2•iamflimflam1•11m ago•0 comments

Measuring the Impact of LLMs on Experienced Developer Productivity

https://hackaday.com/2025/07/11/measuring-the-impact-of-llms-on-experienced-developer-productivity/
2•iamflimflam1•12m ago•0 comments

The FBI just shut down a Nintendo Switch piracy site

https://www.theverge.com/news/705202/the-fbi-just-shut-down-a-nintendo-switch-piracy-site
1•01-_-•14m ago•0 comments

The Great Lay-Off'ening is already well underway. What'll happen to the economy?

https://old.reddit.com/r/wallstreetbets/comments/1lx93c4/the_great_layoffening_is_already_well_underway/
2•disqard•16m ago•0 comments

New Windows 11 build adds self-healing "quick machine recovery" feature

https://arstechnica.com/gadgets/2025/07/new-windows-11-build-adds-self-healing-quick-machine-recovery-feature/
1•01-_-•16m ago•0 comments

Transformers are the best equivalents of cognitive ability

https://dmf-archive.github.io/docs/posts/form-follows-function-2/
2•NetRunnerSu•23m ago•0 comments

Modern Java – A book teaches how to write modern and effective Java

https://javabook.mccue.dev
1•0x54MUR41•24m ago•0 comments

Show HN: Microsoft official MCP for documentation and more

https://github.com/MicrosoftDocs/mcp
2•ztq121121•26m ago•1 comments

New Date("WTF") – How well do you know JavaScript's Date class?

https://jsdate.wtf
3•OuterVale•33m ago•0 comments

Show HN: MailTion – AI-Powered Email Marketing for Businesses

https://mailtion-waitlist.vercel.app
1•jawwadjamiu•42m ago•0 comments

Ask HN: How to combine work, entrepreneurship and having a life

1•Nadssat•43m ago•0 comments

The Rise and Fall of the Knowledge Worker

https://jacobin.com/2025/07/knowledge-workers-ai-globalization-deindustrialization/
2•chobeat•44m ago•0 comments

Upwork Suspended My Account Without Reason

2•kerimn•48m ago•1 comments

Zuck Races to Build Godlike AI, Women and People of Color Aren't Invited

https://gizmodo.com/as-zuck-races-to-build-godlike-ai-women-and-people-of-color-arent-invited-2000628303
6•Bluestein•49m ago•0 comments

Why U.S. Geothermal May Advance, Despite Political Headwinds

https://e360.yale.edu/features/united-states-geothermal-republican-spending-bill
1•jbotz•49m ago•0 comments

Superpowers are real–these people are living proof

https://www.nationalgeographic.com/science/article/superpowers-real-human-abilities-genetics
5•Bluestein•49m ago•0 comments

Show HN: Sage-AI – AI-Powered Writing Assistant

https://sage-ai-waitlist.vercel.app
1•jawwadjamiu•50m ago•0 comments

Ask HN: What are some non-standard ways to reduce the size of executable files?

1•FerkiHN•53m ago•1 comments

Show HN: A simple old school news website

https://news.unixpods.dev
1•dd_xplore•57m ago•0 comments

Better Software Conference (Casey Muratori on OOP)

https://www.twitch.tv/bettersoftwareconference
2•creikey•1h ago•0 comments

Show HN: Free YouTube Tag Generator to Improve Video SEO

1•rahulbstomar•1h ago•0 comments

Building a Distributed Cache for S3

https://clickhouse.com/blog/building-a-distributed-cache-for-s3
1•subset•1h ago•0 comments

Bad Actors Are Grooming LLMs to Produce Falsehoods

https://americansunlight.substack.com/cp/168074209
44•nsoonhui•1h ago•23 comments

The Open Source AI Definition 1.0

https://opensource.org/ai
2•doener•1h ago•0 comments