frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•10mo ago

Comments

kate_at_refact•10mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Hours of Darkness: The Ongoing Regime-Imposed Internet Blackout

https://twitter.com/netblocks/status/2028913162255282264
1•us321•1m ago•0 comments

Norway explains formula behind sustained success at Winter Olympics

https://www.japantimes.co.jp/olympics/2026/02/17/norway-keys-winter-success/
1•PaulHoule•2m ago•0 comments

Show HN: SQL-pipe – Query CSV streams with SQLite syntax (written in Zig)

https://github.com/vmvarela/sql-pipe
1•vmvarela•2m ago•1 comments

Self-Hosted Software List

https://hostedsoftware.org/
2•selfhostedsoft•3m ago•1 comments

Supported browser for Apple devices, derived from Atari TOS 1999

https://en.wikipedia.org/wiki/ICab
1•muzzy19•4m ago•1 comments

Show HN: Spanish Words, spaced repetition vocabulary app for frequent words

https://www.1000spanishwords.app/
1•bbmaxwell•7m ago•0 comments

Ask Your AI to Fill This

https://potomushto.com/2026/tell-your-ai/
1•speckx•8m ago•0 comments

A better way to manage environment variables

https://github.com/humblepenguinn/envio
1•doomlazer•8m ago•1 comments

Show HN: SEO That Fixes Itself

https://www.howtoseo.ai/
1•santiviquez•9m ago•0 comments

OpenAI: 5.4 sooner than you Think.

https://twitter.com/OpenAI/status/2028909019977703752
2•modeless•10m ago•1 comments

RLC Pro is an enterprise Linux for the AI era

https://thenewstack.io/ciq-launches-rlc-pro-for-enterprise-linux-for-the-ai-era/
1•CrankyBear•11m ago•0 comments

An Interactive Intro to CRDTs

https://jakelazaroff.com/words/an-interactive-intro-to-crdts/
2•evakhoury•11m ago•0 comments

The Shady World of IP Leasing

https://acid.vegas/blog/the-shady-world-of-ip-leasing/
1•slome•12m ago•0 comments

Delays in grant awards and funding calls worry NIH researchers

https://www.science.org/content/article/delays-grant-awards-and-funding-calls-worry-nih-researchers
2•epistasis•12m ago•0 comments

Show HN: Stackhaus – A marketplace for AI-built apps (1,204 verified at launch)

https://stackhaus.ai/
1•TheRealDaveO•12m ago•0 comments

Doing math is lonely [video]

https://www.youtube.com/watch?v=LEsI4kPEkgw
1•astroanax•14m ago•0 comments

GitHub Top Code Dataset: 1.3M+ code files from GitHub's top ranked developers

https://huggingface.co/datasets/ronantakizawa/github-top-code
1•ronantech•15m ago•0 comments

Show HN: Dbcli – A Lightweight Database CLI Designed for AI Agents

1•justvugg•15m ago•2 comments

Show HN: VeilDB – Open-source database anonymization platform

https://github.com/veildb-tech/service
1•ihorklymchuk•15m ago•1 comments

Show HN: Focused input cuts LLM output tokens by 63% bench on CC with FastAPI

1•nicola_alessi•16m ago•0 comments

Coruna: The Mysterious Journey of a Powerful iOS Exploit Kit

https://cloud.google.com/blog/topics/threat-intelligence/coruna-powerful-ios-exploit-kit
1•ledoge•17m ago•0 comments

Credential Protection for AI Agents: The Phantom Token Pattern

https://nono.sh/blog/blog-credential-injection
1•decodebytes•17m ago•1 comments

I taught my OpenClaw to call me on the phone [video]

https://www.youtube.com/shorts/WMNdEK28zo4
1•thisismyswamp•17m ago•0 comments

Beta Player – unofficial Bandcamp desktop and mobile player with remote control

https://github.com/eremef/bandcamp-player
1•eremef•18m ago•1 comments

PyTorch MPS Ops

https://github.com/users/kulinseth/projects/1/views/1
1•tosh•19m ago•0 comments

How Well Does Reinforcement Learning Scale?

https://www.tobyord.com/writing/how-well-does-rl-scale
1•AntiDyatlov•20m ago•0 comments

Linux perf Examples

https://www.brendangregg.com/perf.html
1•medbar•21m ago•0 comments

Building Things in Crowded Spaces

https://www.generative.inc/the-weight-of-how-things-are
1•altonwells•22m ago•0 comments

Iran Is Deploying Bubble Jammers Against the US Government [video]

https://www.youtube.com/watch?v=7E69ir4WhpQ
2•tartoran•23m ago•1 comments

M5 Pro and M5 Max are surprisingly big departures from older Apple Silicon

https://arstechnica.com/gadgets/2026/03/m5-pro-and-m5-max-are-surprisingly-big-departures-from-ol...
3•strongpigeon•24m ago•0 comments