frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•7mo ago

Comments

kate_at_refact•7mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

A secure cloud vault and usage-tracking service for all your LLM providers

https://blog.mozilla.ai/introducing-any-llm-managed-platform-a-secure-cloud-vault-and-usage-track...
1•mzlaai•34s ago•0 comments

Games That Pushed the Limits of the Sony PlayStation

https://racketboy.com/retro/games-that-pushed-the-limits-of-the-sony-playstation-ps1
1•tosh•49s ago•0 comments

Mental Models (2024)

https://allan.reyes.sh/models/
1•mooreds•1m ago•0 comments

Snapchat and FaceTime Banned in Russia

https://www.themoscowtimes.com/2025/12/04/roskomnadzor-confirms-its-blocking-snapchat-and-facetim...
1•defly•2m ago•0 comments

Why Are 38 Percent of Stanford Students Saying They're Disabled?

https://reason.com/2025/12/04/why-are-38-percent-of-stanford-students-saying-theyre-disabled/
1•delichon•3m ago•0 comments

Xzone Malloc

https://github.com/apple-oss-distributions/libmalloc/blob/af3c5dc3a540eeec030930b35b1349f4de40020...
1•gok•3m ago•0 comments

Vibecession: More Than You Wanted to Know

https://www.astralcodexten.com/p/vibecession-much-more-than-you-wanted
1•feross•4m ago•0 comments

Why PyTorch is an amazing place to work and Why I'm Joining Thinking Machines

https://www.thonking.ai/p/why-pytorch-is-an-amazing-place-to
1•vrnvu•4m ago•0 comments

How AI Is a Blessing and a Curse

https://substack.productmind.co/p/the-curse-of-ai-riches
1•okosisi•5m ago•0 comments

Super-Flat ASTs

https://jhwlr.io/super-flat-ast/
1•mmphosis•6m ago•0 comments

Russia blocks Snapchat, RIA reports

https://www.reuters.com/technology/russia-blocks-snapchat-ria-reports-2025-12-04/
1•schmuckonwheels•7m ago•0 comments

Show HN: LLM Debugging Traces

https://github.com/tomarrell/jtree
1•slicedbrandy•7m ago•0 comments

Show HN: ConvertDrop – Privacy-focused file converter in the browser

https://www.convertdrop.com/
1•BadgerSlayer•7m ago•0 comments

Generative AI is a Parasitic Cancer [video]

https://www.youtube.com/watch?v=-opBifFfsMY
1•ceving•7m ago•0 comments

Russia blocks Roblox and FaceTime amid growing rebuke of foreign tech platforms

https://www.cbc.ca/news/entertainment/russia-roblox-facetime-9.7002881
1•schmuckonwheels•8m ago•0 comments

Texas's Water Wars

https://www.newyorker.com/news/letter-from-the-southwest/texas-water-wars
1•PaulHoule•10m ago•0 comments

Han – A plugin marketplace for Claude Code built on Bushido principles

https://han.guru
1•jwaldrip•12m ago•1 comments

Ampcode / a Claude Code Alternative

https://ampcode.com/
1•krystofee•12m ago•0 comments

Show HN: After being laid off from a corporate job I built my first AI Startup

https://www.novichat.ai
2•antonio07c•13m ago•0 comments

Dan Wang on What China and America Can Learn from Each Other (Ep. 263)

https://conversationswithtyler.com/episodes/dan-wang/
1•paulpauper•15m ago•0 comments

Innovations in Health Care

https://marginalrevolution.com/marginalrevolution/2025/12/innovations-in-health-care.html
1•paulpauper•15m ago•0 comments

'Protestant Magic' Today

https://www.thefitzwilliam.com/p/protestant-magic-today
1•paulpauper•15m ago•0 comments

Travelers wear pajamas to airports in protest of government request

https://www.washingtonpost.com/travel/2025/12/04/airport-pajamas-duffy-video/
3•bookofjoe•17m ago•1 comments

Show HN: AQUA – model agnostic lightweight command line agent coordinator

https://vignesh07.github.io/aqua/2025/12/02/introducing-aqua.html
1•eigen-vector•20m ago•0 comments

Russia blocks Apple's FaceTime in mounting push against foreign tech platforms

https://www.reuters.com/business/retail-consumer/russia-imposes-restrictions-apples-facetime-app-...
2•coloneltcb•20m ago•0 comments

Spotlight on the Artist/Producer

https://billboard-bangladesh.odoo.com/
1•billboardbd•21m ago•0 comments

A small group of women changed the UK law on deepfake porn

https://www.theguardian.com/society/ng-interactive/2025/dec/04/i-dont-take-no-for-an-answer-how-a...
3•robaato•21m ago•0 comments

A Look at the PowerVR Graphics Architecture: Tile-Based Deferred Rendering

https://blog.imaginationtech.com/a-look-at-the-powervr-graphics-architecture-tile-based-rendering/
1•jakogut•22m ago•0 comments

Meta poaches Apple design exec Alan Dye to lead new Reality Labs studio

https://techcrunch.com/2025/12/03/meta-poaches-apple-design-exec-alan-dye-to-lead-new-creative-st...
1•biglyburrito•23m ago•0 comments

PyTogether: Collaborative lightweight real-time Python IDE for teachers/learners

https://github.com/SJRiz/pytogether
2•indigodaddy•24m ago•0 comments