frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•11mo ago

Comments

kate_at_refact•11mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Chronicle

https://developers.openai.com/codex/memories/chronicle
2•gmays•5m ago•0 comments

Show HN: Preflight – Test your MCP server before submitting to Claude/OpenAI

https://m8ven.ai/preflight
2•mengjiang•8m ago•2 comments

Nothing Matters

https://martinrue.com/nothing-matters/
1•afisxisto•10m ago•0 comments

What's new in JavaScript (and what's coming next)

https://neciudan.dev/whats-new-in-javascript
1•thunderbong•18m ago•0 comments

Flipbook – self hosted static viewers for media, documents and browser replays

https://flipbook.browserbox.io/
1•keepamovin•19m ago•0 comments

ElastAlert is dead, long live Clickdetect

https://clickdetect.souzo.me/blog/2026/04/19/elastalert-is-dead-long-live-clickdetect/
1•souzo•20m ago•0 comments

For $700 a Month, Sleeping Pods Make SF More Affordable

https://www.kqed.org/news/12080289/700-a-month-sleeping-pods-make-sf-more-affordable-but-at-what-...
2•harambae•20m ago•0 comments

Computerising Hyerogliphic Scripts [video]

https://www.youtube.com/watch?v=Vhx-hRyh6BM
1•downboots•21m ago•0 comments

Linkages to Trisect an Angle

http://www.takayaiwamoto.com/Greek_Math/Trisect/Linkage/Linkage_Tri.html
1•downboots•22m ago•0 comments

Pepperlot

https://pepperlot.com
1•alexrusulot•24m ago•0 comments

When oil prices spike, where does the money go?

https://theconversation.com/when-oil-prices-spike-where-does-the-money-go-280763
2•thelastgallon•24m ago•0 comments

Pressure, Temperature, and Phase Changes Within Supercritical CO2 Pipelines

https://www.mdpi.com/2227-9717/14/7/1039
2•PaulHoule•25m ago•0 comments

Windows 9x Subsystem for Linux

https://codeberg.org/hails/wsl9x
1•pabs3•26m ago•1 comments

Arch Linux Now Has a Bit-for-Bit Reproducible Docker Image

https://antiz.fr/blog/archlinux-now-has-a-reproducible-docker-image/
2•maxloh•27m ago•0 comments

A Generation Lost in the Bazaar – Quality happens when someone is responsible (2012)

https://queue.acm.org/detail.cfm?id=2349257
1•pabs3•28m ago•0 comments

Photographing Rocket Chute Deployment at 10 Km

https://hackaday.com/2026/04/22/photographing-rocket-chute-deployment-at-10-km/
2•y1n0•31m ago•0 comments

Test-foundry – QEMU-based Windows VM testing for kernel drivers and UEFI apps

https://github.com/jc-lab/test-foundry
2•joseph2024•31m ago•1 comments

Habitual coffee intake modifies host physiology and cognition

https://www.nature.com/articles/s41467-026-71264-8
2•gogobio•31m ago•1 comments

FlashDrive: Flash Vision-Language-Action Inference for Autonomous Driving

https://z-lab.ai/projects/flashdrive/
1•gmays•33m ago•0 comments

Microsoft looked at buying Cursor before SpaceX deal

https://www.cnbc.com/2026/04/22/microsoft-looked-at-buying-cursor-before-spacex-deal-sources-say....
1•mfiguiere•35m ago•0 comments

XAIDR – first runtime benchmark for agent-to-agent attack detection

https://github.com/anirudhraokotaru/xaidr-benchmark
2•delphisec•35m ago•0 comments

Let's Simulate the Org Charts Meme with Agents and See Who Wins

https://kunchenguid.substack.com/p/org-bench-lets-simulate-the-org-charts
2•bpierre•35m ago•0 comments

Fatty acid could restore failing vision

https://www.sciencedaily.com/releases/2026/04/260422091043.htm
2•y1n0•39m ago•0 comments

Job Is to Give a Shit

4•danfunk•41m ago•1 comments

Orthogravity [Desktop Webgame]

https://app-b5dj4l0ji2gx.appmedo.com/
1•mrKola•41m ago•0 comments

TeraFab facilities will use Intel's 14A process

https://www.tomshardware.com/tech-industry/semiconductors/elon-musk-says-terafab-will-use-intels-...
2•y1n0•42m ago•0 comments

Bruce Davidson – His landmark Subway series and his path to Magnum

https://www.youtube.com/watch?v=8KmDB4VHpzQ
1•fallinditch•43m ago•0 comments

ICE Got My Data – EFFector 38.8

https://www.eff.org/deeplinks/2026/04/how-ice-got-my-data-effector-388
4•omer_k•47m ago•1 comments

Vibe Genomics

https://vibe-genomics.replit.app/
1•jedixit•48m ago•0 comments

Database Turing Award Winner Mike Stonebraker [video]

https://www.youtube.com/watch?v=YPObBOwIrHk
3•guiambros•49m ago•0 comments