frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•11mo ago

Comments

kate_at_refact•11mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Wikipedia: AI or Not Quiz

https://en.wikipedia.org/wiki/Wikipedia:AI_or_not_quiz
1•alibarber•1m ago•0 comments

ReSyn: A Generalized Recursive Regular Expression Synthesis Framework

https://arxiv.org/abs/2603.24624
1•PaulHoule•1m ago•0 comments

2-day-old GitHub account added AI-generated dependency to Mailgen (2.5k stars)

https://github.com/eladnava/mailgen/pull/86
1•foray1010•4m ago•0 comments

Anti-Amyloid Antibodies for Alzheimer: You Know

https://www.science.org/content/blog-post/anti-amyloid-antibodies-alzheimer-you-already-know
1•u1hcw9nx•10m ago•1 comments

Tesla is facing up to $14.5B in lawsuits – and it's only getting worse

https://electrek.co/2026/04/16/tesla-facing-up-to-14-billion-lawsuits-deep-dive/
1•breve•12m ago•0 comments

The Feeling of Power – Isaac Asimov

https://hex.ooo/library/power.html
1•MSFT_Edging•13m ago•0 comments

Floating Point Fun on Cortex-M Processors

https://danielmangum.com/posts/floating-point-cortex-m/
1•hasheddan•14m ago•0 comments

Local LLM agent with persistent memory and learnable skills

https://github.com/nevenkordic/localmind
1•yotta25•17m ago•0 comments

Interviewing Japanese about Trump's Pearl Harbor Response [video]

https://www.youtube.com/watch?v=jS0ZjVbzGWg
1•keepamovin•17m ago•0 comments

Knowledge OS on Claude Code: 6 Commands, 14 Skills, Semantic Search, 2.5k Docs

https://augmentedcode.dev/knowledge-os-claude-code/
2•viktorianer•22m ago•0 comments

€5 gadget tracks down Dutch Navy's stealth warship while on mission

https://nltimes.nl/2026/04/17/eu5-gadget-tracks-dutch-navys-stealth-warship-mission
1•repelsteeltje•22m ago•1 comments

Desktop GitHub Gist Client

https://github.com/hackjutsu/Lepton
1•ankitg12•23m ago•0 comments

A simple way of making hydrogen from alcohol by using iron and UV light

https://www.kyushu-u.ac.jp/en/researches/view/387/
1•geox•24m ago•0 comments

Show HN: WhatsApp CLI without getting banned

https://github.com/Wassist/cli
1•joshwarwick15•25m ago•2 comments

How to Actually Use Postgres as a Message Queue

https://medium.com/@coders.stop/how-to-actually-use-postgres-as-a-message-queue-9f2d42b034b8
2•bundie•28m ago•0 comments

White House Works to Give US Agencies Anthropic Mythos AI

https://www.bloomberg.com/news/articles/2026-04-16/white-house-moves-to-give-us-agencies-anthropi...
2•surprisetalk•28m ago•0 comments

Getting Started with Go Testscript

https://blog.windpul.eu/posts/testscript/
1•brammeleman•30m ago•0 comments

Things you didn't know about indexes

https://jon.chrt.dev/2026/04/15/things-you-didnt-know-about-indexes.html
1•birdculture•31m ago•0 comments

The United Kingdom of Great Britain and Northern Ireland: The Text Adventure

https://uk-the-text-adventure.think.somethingorotherwhatever.com/
1•ColinWright•33m ago•0 comments

The Case for Out-of-Process Enforcement for AI Agents

https://runtime-guard.ai/articles/agent-security-enforcement-layer/
2•JimmyRacheta•34m ago•0 comments

Love Affairs and Differential Equations (1988) [pdf]

https://ai.stanford.edu/~rajatr/articles/SS_love_dEq.pdf
1•surprisetalk•36m ago•1 comments

European age verification app to keep children safe online

https://commission.europa.eu/news-and-media/news/european-age-verification-app-keep-children-safe...
1•LelouBil•36m ago•2 comments

Brainshare – Business Consulting and Advisory Services

https://www.grandviewresearch.com/services/brainshare
1•marketinsights•39m ago•0 comments

Durable Objects in Dynamic Workers: Give each AI-generated app its own database

https://blog.cloudflare.com/durable-object-facets-dynamic-workers/
1•tosh•39m ago•0 comments

M5Stack CardputerZero – Pocket Raspberry Pi Computer for Hackers

https://shop.m5stack.com/pages/m5-cardputerzero
1•wlkr•42m ago•1 comments

The Skills That Matter Now

https://jasonrobert.dev/blog/2026-04-10-the-skills-that-matter-now/
1•cebert•44m ago•0 comments

From five optional fields to a discriminated union: CLI parsing with Optique 1.0

https://hackers.pub/@hongminhee/2026/optique-10-discriminated-unions-for-cli
1•maleldil•44m ago•1 comments

ChatMCP – Connect your AI browser chats to your coding agents

https://github.com/IndianTinker/chatmcp
1•indiantinker•44m ago•0 comments

It's Not Always Sunny in Clawland

https://telegraphic.substack.com/p/its-not-always-sunny-in-clawland
1•telegrahi•45m ago•0 comments

Show HN: Candalf – a simple tool to manage Linux/Unix-like systems via shell+SSH

1•jarm0•47m ago•0 comments