frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•11mo ago

Comments

kate_at_refact•11mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Show HN: Homebutler – AI manages your homelab without getting shell access

https://github.com/Higangssh/homebutler
1•swq115•1m ago•0 comments

Starfish Space Raises Over $100M Series B

https://www.starfishspace.com/press-release/starfish-space-raises-over-100-million-series-b/
1•doppp•1m ago•0 comments

Show HN: Back2vibing – instantly jump back to your agent's tmux pane / terminal

https://back2vibing.builtby.win/
1•wjellyz•1m ago•1 comments

The asteroid belt contains solar system remnants

https://earthsky.org/space/what-is-the-asteroid-belt/
1•fosterrinehart•1m ago•0 comments

A four-month FDA delay forced a small biotech company to close its doors

https://www.statnews.com/2026/04/06/fda-delay-cited-in-closure-kezar-life-sciences-biotech-startup/
1•brandonb•3m ago•0 comments

Lemonade 10.1 Released for Improvements for Local LLMs on AMD GPUs and NPUs

https://www.phoronix.com/news/Lemonade-10.1-Released
1•breve•3m ago•0 comments

Artemis II in Eclipse

https://images.nasa.gov/details/art002e009301
1•reconnecting•5m ago•1 comments

Where AI Will and Won't Replace Us

https://deadneurons.substack.com/p/where-ai-will-and-wont-replace-us
1•nr378•6m ago•0 comments

What Paddle doesn't tell you about implementing metered billing

https://phare.io/blog/what-paddle-doesnt-tell-you-about-implementing-metered-billing/
1•nicbvs•7m ago•0 comments

"Absolute AI Maximalist" Adam Jacob on Building Software That Builds Software

https://redmonk.com/videos/adam-jacob-ai-maximalist/
1•mooreds•7m ago•0 comments

Show HN: Train ML models for 80% less by always picking the cheapest Spot region

https://spotroute.co/
1•hmontazeri•7m ago•0 comments

Generative Art over the Years

https://blog.veitheller.de/Generative_art_over_the_years.html
1•evakhoury•8m ago•0 comments

Did Test Automation Engineers Just Get Pluto-Ed?

https://testpappy.wordpress.com/2026/04/07/did-test-automation-engineers-just-get-pluto-ed/
1•mooreds•8m ago•0 comments

Educators Share Tips for Saving Time and Boosting Creativity with AI

https://www.aft.org/ae/spring2026/leonard_siebenmark_venagro
1•mooreds•8m ago•0 comments

Power Density at 50 KW/Rack: What It Costs and What It Breaks

https://syaala.com/blog
1•jaynamburi•9m ago•0 comments

How a blind man made it possible for others with low vision to build Lego sets

https://apnews.com/article/lego-bricks-for-blind-audio-braille-instructions-5a2a27de4354a0b144317...
1•speckx•9m ago•0 comments

New codex rate card made OAuth OpenClaw usage impossible

https://help.openai.com/en/articles/20001106-codex-rate-card#codex-rate-card-token-based-pricing
1•pama•9m ago•1 comments

AI Fixes the Bullshit Asymmetry

https://www.konstantinschubert.com/2026/03/31/ai-the-bullshit-defense.html
2•manx•9m ago•1 comments

Save tokens on Opus 4.6 thinking

https://github.com/juyterman1000/entroly
1•ashuabhi•10m ago•0 comments

Puru: A thread pool for JavaScript with Go-style concurrency primitives

https://github.com/dmop/puru
1•thunderbong•12m ago•0 comments

Wireless Festival cancelled after government stops Kanye West entering UK

https://www.bbc.co.uk/news/live/c77e60v0my1t
4•manarth•13m ago•0 comments

Show HN: A tool to turn random screenshots into structured tutorials

https://github.com/naimurhasan/PastePath
1•naimurhasanrwd•13m ago•0 comments

The upper middle class is now the largest income group in the U.S.

https://www.cbsnews.com/news/upper-middle-class-income-us-what-it-takes/
1•danielam•13m ago•2 comments

We Should Revisit Literate Programming in the Agent Era

https://silly.business/blog/we-should-revisit-literate-programming-in-the-agent-era/#footnote-3
1•evakhoury•14m ago•0 comments

Show HN: DeskTalk – talk to your desktop to build and modify local apps

https://www.desktalk.ai/
1•okcdz•14m ago•0 comments

Give an LLM an API and It'll Thrive. Give It a Touchscreen and It Struggles

https://blog.allada.com/give-an-llm-an-api-and-itll-thrive-give-it-a-touchscreen-and-it-struggles/
1•allada•14m ago•0 comments

Artemis II, Apollo 8, and Apollo 13

https://www.johndcook.com/blog/2026/04/02/artemis-apollo/
1•ibobev•15m ago•0 comments

Hyperbolic Version of Napier's Mnemonic

https://www.johndcook.com/blog/2026/04/02/hyperbolic-napier-mnemonic/
1•ibobev•15m ago•0 comments

Earthset and a solar eclipse: NASA releases first images from Moon fly-by

https://www.bbc.com/news/articles/cyv183v02j3o
1•meetpateltech•15m ago•0 comments

The Golden Path to Chaos: Adiabatic Twists

https://galileo-unbound.blog/2026/04/07/the-golden-path-to-chaos-adiabatic-twists/
1•ibobev•16m ago•0 comments