frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•6mo ago

Comments

kate_at_refact•6mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

1•yminsky•17s ago

The Danger of the Eat the Frog Productivity Method (and How to Use It Right)

https://zhighley.com/article/the-danger-of-the-eat-the-frog-productivity-method-and-how-to-actual...
1•akraker•2m ago•1 comments

UK Tech Secretary Urges Ofcom to Fast-Track Censorship Law Enforcement

https://reclaimthenet.org/uk-tech-secretary-presses-ofcom-fast-track-online-safety-act
1•Jigsy•2m ago•0 comments

iPhone Pocket

https://www.apple.com/ca/newsroom/2025/11/introducing-iphone-pocket-a-beautiful-way-to-wear-and-c...
1•fortran77•6m ago•1 comments

Abilene Paradox

https://en.wikipedia.org/wiki/Abilene_paradox
2•ugur2nd•9m ago•0 comments

ISRG (LetsEncrypt) has created two new roots [pdf]

https://letsencrypt.org/audits/ISRG-2025-Key-Generation-Report.pdf
1•randompeach•10m ago•1 comments

We Uncovered a Race Condition in Aurora RDS

https://hightouch.com/blog/uncovering-a-race-condition-in-aurora-rds
19•theanomaly•10m ago•0 comments

Liberating Search from the Search Engine

https://softwaredoug.com/blog/2025/06/03/liberating-search
1•softwaredoug•10m ago•0 comments

Django Developers Survey 2025 Results

https://lp.jetbrains.com/django-developer-survey-2025/
3•ferryth•11m ago•1 comments

Show HN: Dumbass Business Ideas

https://dumbassideas.com
4•elysionmind•12m ago•0 comments

Coding Trance Music (Full Narrated)

https://www.youtube.com/watch?v=GWXCCBsOMSg
4•tux1968•13m ago•0 comments

LangDiff: Progressive UI from LLM

https://github.com/globalaiplatform/langdiff
1•rob•13m ago•0 comments

Ask HN: How do you handle logging and evaluation when training ML models?

2•calepayson•13m ago•1 comments

Secret Boat Strike Memo Justifies Kills by Claiming Targeting Drugs, Not People

https://theintercept.com/2025/11/14/boat-strikes-immunity-legality-trump/
10•Qem•13m ago•0 comments

Red Hat OpenShift 4.20 Boosts AI, Security, Hybrid Cloud

https://thenewstack.io/red-hat-openshift-4-20-boosts-ai-security-hybrid-cloud/
1•CrankyBear•14m ago•1 comments

Private Equity Killed the Media Industry

https://talkingpointsmemo.com/tpm-25/private-equity-killed-media
3•speckx•14m ago•0 comments

Pope Leo XIV: "AI and Medicine: The Challenge of Human Dignity"

https://www.vatican.va/content/leo-xiv/en/messages/pont-messages/2025/documents/20251107-messaggi...
2•logannyeMD•15m ago•0 comments

Cursor never says "IDE" on their homepage

https://twitter.com/grigoriy_kogan/status/1989395569865838775
4•gk1•17m ago•1 comments

Cursor AI sent me a gift for pressing Tab 74,283 times [video]

https://www.youtube.com/shorts/vZ1Qd-gClu4
1•AshesOfOwls•17m ago•1 comments

The Multimillion-Dollar Plan to Make Mobile Voting Happen

https://www.wired.com/story/bradley-tusk-mobile-voting-protocol/
1•shpat•17m ago•0 comments

Cornserve: Serving multimodal AI models like microservices

https://cornserve.ai/
1•jaywonchung•18m ago•0 comments

Plant-based diets could reshape farming jobs and reduce labor costs worldwide

https://phys.org/news/2025-10-global-based-diets-reshape-farming.html
4•PaulHoule•19m ago•0 comments

Power Companies Are Using AI to Build Nuclear Power Plants

https://www.404media.co/power-companies-are-using-ai-to-build-nuclear-power-plants/
2•mdhb•19m ago•0 comments

Proton 10.0-3 Released for Steam Play with Fixes, More Games Working

https://www.phoronix.com/news/Proton-10.0-3-Released
8•Bender•20m ago•1 comments

Developer jobs that help you move states

https://nextstatejobs.substack.com/p/handpicked-usa-jobs-with-relocation
4•andrewstetsenko•20m ago•0 comments

Washington Post Says Nearly 10k Employees Impacted by Oracle Hack

https://www.securityweek.com/washington-post-says-nearly-10000-employees-impacted-by-oracle-hack/
5•Bender•21m ago•0 comments

World’s oldest RNA extracted from ice age woolly mammoth

https://arstechnica.com/science/2025/11/worlds-oldest-rna-extracted-from-ice-age-woolly-mammoth/
4•Bender•22m ago•0 comments

JPMorgan secures deals with fintech aggregators over fees to access data

https://www.reuters.com/sustainability/boards-policy-regulation/jpmorgan-secures-deals-with-finte...
2•impish9208•22m ago•0 comments

I Reverse Engineered a High-Volume Solana Arbitrage Bot

https://clumsy-geranium-e59.notion.site/Reverse-Engineering-a-4-500-Sol-m-Solana-Arbitrage-Bot-2a...
2•birdculture•22m ago•0 comments

How Linux is built with Greg Kroah-Hartman [video]

https://www.youtube.com/watch?v=7agB1vOl-wg
2•olvy0•23m ago•0 comments