frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•12mo ago

Comments

kate_at_refact•12mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

ChatGPT Wrestles with Its Most Chilling Conversation: How Do I Plan an Attack?

https://www.wsj.com/us-news/chatgpt-mass-shooting-openai-78a436d1
1•Brajeshwar•1m ago•0 comments

OpenAI Codex system includes explicit directive to "never talk about goblins"

https://arstechnica.com/ai/2026/04/openai-codex-system-prompt-includes-explicit-directive-to-neve...
2•randycupertino•3m ago•1 comments

Checkpoint/Restore in Userspace (CRIU)

https://criu.org/Main_Page
1•htfy96•3m ago•0 comments

OpenJDK 25 Security Update Released

https://tux.re/forum/viewtopic.php?t=222
1•sys3000•4m ago•0 comments

Enabling Monoglot Programming

https://www.humprog.org/~stephen/blog/research/enabling-monoglot-programming.html
1•htfy96•12m ago•0 comments

The Shape of a Guitar Pick

https://www.johndcook.com/blog/2026/05/03/guitar-pick/
1•malshe•16m ago•0 comments

Quantum teleportation between two quantum dots demonstrated over 270M

https://www.sciencedaily.com/releases/2026/04/260429102030.htm
1•thegdsks•18m ago•0 comments

Ask HN: Does Claude Code succeed after being asked "should we give up?" for you?

2•webwielder2•22m ago•0 comments

Show HN VibeAI FoldSpace by HugonomySystems

https://hugonomy.com/
1•GlyphWeaver_a•22m ago•0 comments

Scientists discover 27 potential new planets that orbit two stars

https://www.theguardian.com/science/2026/may/04/scientists-discover-27-potential-new-planets
2•nhatcher•23m ago•0 comments

Make some art with your phone sensors

https://tautme.github.io/phone-sensors/sensor-etch.html
1•adm4•29m ago•0 comments

Why Almost Everyone Loses on Prediction Markets

https://www.wsj.com/finance/investing/polymarket-kalshi-betting-profits-prediction-markets-eb23ac11
2•tysone•29m ago•0 comments

Don't fly if you can help it

https://michaelbluejay.com/airfare/dontfly.html
1•cxr•37m ago•2 comments

The Rise of Emotional Surveillance

https://www.theatlantic.com/culture/2026/05/worker-surveillance-emotion-ai/687029/
1•eloisius•42m ago•0 comments

Show HN: ReflowPDF – wrote a layout engine because every PDF library failed

https://reflowpdf.com
1•exsol•45m ago•0 comments

Think pop music is basic? Even classical and jazz are getting less complex

https://connectsci.au/news/news-parent/9259/Think-pop-music-is-basic-Even-classical-and-jazz
1•gmays•45m ago•0 comments

Shoppers falsely identified by facial recognition system

https://www.theguardian.com/technology/2026/may/03/guilty-until-proven-innocent-shoppers-falsely-...
2•kayfox•45m ago•0 comments

GameStop Proposes to Acquire eBay at $125.00 per Share

https://investor.gamestop.com/news-releases/news-details/2026/GameStop-Proposes-to-Acquire-eBay-a...
4•tech234a•46m ago•2 comments

PicoServer: A glue library embedding web server for .NET, no IIS, no Kestrel

https://www.nuget.org/packages/PicoServer
1•myhackernew•46m ago•0 comments

PostgreSQL databases are boring on purpose

https://stormatics.tech/blogs/the-best-postgresql-databases-are-boring-on-purpose
2•pgdatabase•1h ago•0 comments

Unauthorized macOS port claiming Don Ho as an author?

https://github.com/notepad-plus-plus/notepad-plus-plus/issues/17982
5•jethronethro•1h ago•0 comments

Bluchan.org – Anonymous Free Speech Forum

https://www.bluchan.org
1•jjhbhjbj•1h ago•1 comments

Diatom

https://en.wikipedia.org/wiki/Diatom
1•lucaslazarus•1h ago•0 comments

Trump: U.S. Navy will "guide" ships out of Strait of Hormuz from Monday

https://www.axios.com/2026/05/03/trump-us-navy-iran-ships-strait-hormuz
2•cosmicgadget•1h ago•1 comments

UIGen – Why runtime rendering is better than codegen (low code)

https://uigen-docs.vercel.app/blog/runtime-rendering-vs-code-generation
1•ombedzi•1h ago•0 comments

Tesla is facing up to $14.5B in lawsuits and it's only getting worse

https://electrek.co/2026/04/16/tesla-facing-up-to-14-billion-lawsuits-deep-dive/
6•1vuio0pswjnm7•1h ago•0 comments

Stohr

https://stohr.io
1•wesscope•1h ago•1 comments

Amazon takes $45M hit, abandons planned West Auckland data centre

https://www.rnz.co.nz/news/business/594164/amazon-takes-45m-hit-abandons-planned-west-auckland-da...
4•billybuckwheat•1h ago•0 comments

The new grads are not okay

https://blog.evan.hu/p/the-new-grads-are-not-okay
1•evanhu_•1h ago•2 comments

Request for Testers: Personal Planning App

https://www.threads.com/@alaaalatif/post/DX5VxSbmSgE
1•JustCallMeAl•1h ago•1 comments