frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•11mo ago

Comments

kate_at_refact•11mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

It Is Time to Ban Sale of Precise Geolocation

https://news.risky.biz/srsly-risky-biz-it-is-time-to-ban-sale-of-precise-geolocation/
1•lschueller•49s ago•0 comments

Kintify Cloud Built something to make cloud debugging less painful

1•kintify•1m ago•0 comments

Book Translator: Two-pass local translation with self-reflection via Ollama

https://github.com/KazKozDev/book-translator
1•kazkozdev•5m ago•0 comments

Show HN: GrahamBell – If blockchain mining looked like this, would you mine?

1•HurairahShamsi•5m ago•0 comments

Mythos and National Power

https://www.chinatalk.media/p/mythos-and-national-power
1•0xkato•6m ago•0 comments

IPv6 GitHub Proxy

https://gh-v6.com
2•Alifatisk•11m ago•0 comments

Radiologists' Diagnostic Accuracy in Detecting ChatGPT-Generated Radiographs

https://pubs.rsna.org/doi/10.1148/radiol.252094
1•doener•13m ago•0 comments

About Data Lifetime (2013)

https://web.archive.org/web/20130425015441/http://arnulf.us/sevendipity/archives/59-About-Data-Li...
1•severo_bo•17m ago•1 comments

Avoiding Malloc for Small Strings in C with Variable Length Arrays (VLAs)

https://medium.com/@yair.lenga/avoiding-malloc-for-small-strings-in-c-with-variable-length-arrays...
2•yairlenga•17m ago•1 comments

10th Anniversary of jQuery (2016)

https://johnresig.com/blog/10th-anniversary-of-jquery/
1•downbad_•19m ago•1 comments

Shares in shoe brand Allbirds rise 580% after it pivots from footwear to AI

https://www.bbc.com/news/articles/c98mrepzgj7o
2•tcp_handshaker•21m ago•0 comments

Eurosky: Portal to the Atmosphere

https://portal.eurosky.tech/
1•doener•21m ago•0 comments

Intel Intrinsics Guide

https://www.intel.com/content/www/us/en/docs/intrinsics-guide/index.html
1•tosh•22m ago•0 comments

The becquerel as an SI unit for request rate

https://entropicthoughts.com/si-units-for-request-rate
2•fanf2•22m ago•0 comments

Breathing Exercises to Relieve Stress

https://www.bhf.org.uk/informationsupport/heart-matters-magazine/wellbeing/breathing-exercises
1•thunderbong•25m ago•0 comments

Forti FIDE – local instrument for rhetorical awareness (open source, GPL v3)

https://fortifide.org
1•fluxussomnii•30m ago•0 comments

S3mini: Tiny and fast S3 client, new version wrapping fast Bun.S3

https://github.com/good-lly/s3mini/releases/tag/v0.9.4
1•Peter_J•32m ago•0 comments

Show HN: I built Emailbottle – AI email assistant, no inbox access

https://emailbottle.com
3•devshaded•33m ago•0 comments

Nobody Got Fired for Uber's $8M Ledger Mistake?

https://news.alvaroduran.com/p/nobody-got-fired-for-ubers-8-million
1•ohduran•34m ago•0 comments

Thin Harness, Fat Skills

https://twitter.com/garrytan/status/2042925773300908103
1•Anon84•34m ago•0 comments

KeePassχ – A KeePassXC Fork

https://codeberg.org/keepasschi
1•birdculture•35m ago•0 comments

The sonic anatomy of a double-tap strike

https://earshotngo.substack.com/p/the-sonic-anatomy-of-a-double-tap
1•moxifly7•35m ago•0 comments

Any Color You Like: NIST Scientists Create 'Any Wavelength' Lasers

https://www.nist.gov/news-events/news/2026/04/any-color-you-nist-scientists-create-any-wavelength...
1•geox•36m ago•0 comments

Ask HN: Simple tooling for local LLM code critique without IDE integration?

1•gspr•39m ago•0 comments

JetBrains goes all-in on agents with Central

https://leaddev.com/ai/jetbrains-goes-all-in-on-agents-with-central
2•chhum•39m ago•1 comments

Solid-state EV batteries are coming sooner than expected after another breakthro

https://electrek.co/2026/04/15/solid-state-ev-batteries-coming-sooner-than-expected/
1•xbmcuser•41m ago•0 comments

Servy – Any App as a Windows Service

https://servy-win.github.io/
1•mjtk•45m ago•0 comments

Claude Mythos and the EU Cyber Resilience Act

https://til.andrew-quinn.me/posts/claude-mythos-and-the-eu-cyber-resiilience-act/
1•hiAndrewQuinn•48m ago•0 comments

Can a General LLM Diagnose a Dicom Slice?

https://avkcode.github.io/blog/codex-dicom-benchmark.html
1•KyleVlaros•50m ago•1 comments

Synth-dataset-kit: Generate and audit synthetic datasets from seed data

https://github.com/KazKozDev/synth-dataset-kit
1•kazkozdev•52m ago•0 comments