frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•6mo ago

Comments

kate_at_refact•6mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Estimating AI productivity gains from Claude conversations

https://www.anthropic.com/research/estimating-productivity-gains
1•kerim-ca•2m ago•0 comments

Show HN: ConfluenceMeter Beta, live panel for crypto confluence

https://www.confluencemeter.com/mvp
1•Paugallego•4m ago•1 comments

Show HN: Root-dir: a command-line community for devs, builders and creators

https://www.root-dir.com
1•madsmadsdk•8m ago•0 comments

Formal Specification for Authorization: Clarity Before Implementation

https://blog.gchinis.com/posts/2025/11/formal-specification-for-authorization/
1•gchinis•8m ago•0 comments

Hamas attack victims sue Binance for allowing payments to militant group

https://www.reuters.com/legal/government/hamas-attack-victims-sue-binance-allegedly-allowing-paym...
2•barredo•9m ago•0 comments

Alphaproof paper (IMO 2024 Silver) is finally published in Nature [pdf]

https://www.nature.com/articles/s41586-025-09833-y_reference.pdf
1•zuzatm•10m ago•0 comments

Show HN: MenuPhotoAI – AI food photography that keeps dishes real

https://www.menuphotoai.com
1•redp314•12m ago•0 comments

Canva is considering porting Affinity to Linux

https://techcentral.co.za/affinity-for-linux-canvas-next-big-move-could-reshape-the-desktop-softw...
2•methuselah_in•13m ago•0 comments

Dutch public broadcaster NOS quits X over disinformation

https://www.reuters.com/business/media-telecom/dutch-public-broadcaster-nos-quits-x-over-disinfor...
2•giuliomagnifico•14m ago•0 comments

Skyscrapers engulfed in flames after fire spreads on bamboo scaffolding

https://metro.co.uk/2025/11/26/three-skyscrapers-engulfed-flames-fire-spreads-bamboo-scaffolding-...
1•perihelions•16m ago•0 comments

Coffee

https://chrispymm.co.uk/coffee
1•worez•22m ago•0 comments

Invisible Details of Interaction Design

https://rauno.me/craft/interaction-design
1•bfirsh•22m ago•0 comments

Learnings from 1 year of agents: PostHog AI

https://posthog.com/blog/8-learnings-from-1-year-of-agents-posthog-ai
1•czue•30m ago•0 comments

Show HN: An app that turns doomscrolling into learning

https://apps.apple.com/app/id6754678719
1•HamadAlmheiri•31m ago•0 comments

Show HN: NxtPitch – AI that instantly generates pitch proposals

https://nxtpitch.com
1•anmolkushwah19•34m ago•0 comments

Get us off Microsoft! Lawmakers press EU Parliament to change in-house IT

https://www.politico.eu/article/get-us-off-microsoft-eu-lawmakers-press-parliament-to-change-in-h...
3•robtherobber•36m ago•0 comments

Dell (Dell) Q3 2026 Earnings Call Transcript

https://www.theglobeandmail.com/investing/markets/stocks/DELL/pressreleases/36316186/dell-dell-q3...
1•doener•37m ago•1 comments

I don't care how well your "AI" works

https://fokus.cool/2025/11/25/i-dont-care-how-well-your-ai-works.html
3•todsacerdoti•38m ago•0 comments

Dynamic Skillset Reference Architecture

https://chatbotkit.com/examples/dynamic-skillset-reference-architecture
1•_pdp_•38m ago•1 comments

Cosmic Paradox Reveals the Awful Consequence of an Observer-Free Universe

https://www.quantamagazine.org/cosmic-paradox-reveals-the-awful-consequence-of-an-observer-free-u...
1•ibobev•40m ago•0 comments

A Cell So Minimal That It Challenges Definitions of Life

https://www.quantamagazine.org/a-cell-so-minimal-that-it-challenges-definitions-of-life-20251124/
2•ibobev•40m ago•0 comments

New York City's Next Super Storm

https://www.nytimes.com/video/nyregion/100000010524474/new-york-citys-next-super-storm.html
1•fleahunter•40m ago•0 comments

Particle Physicists Detect 'Magic' at the Large Hadron Collider

https://www.quantamagazine.org/particle-physicists-detect-magic-at-the-large-hadron-collider-2025...
1•ibobev•40m ago•0 comments

U.S. Nuclear Arms Chief Warns Against Leaks of Secret Information

https://www.nytimes.com/2025/11/26/science/brandon-williams-nuclear-weapons-nnsa.html
1•quapster•41m ago•0 comments

Underappreciated Books to Learn Object Oriented Design and UML?

2•shivajikobardan•54m ago•0 comments

Council reaches position on Chat Control

https://www.consilium.europa.eu/en/press/press-releases/2025/11/26/child-sexual-abuse-council-rea...
4•tonoto•54m ago•2 comments

Replace internal links in the new Gato AI Translations (WordPress)

https://gatoplugins.com/blog/replace-internal-link-urls-with-v15-2-of-gato-ai-translations-for-po...
1•leoloso•55m ago•0 comments

Show HN: VibeJar – Mood Tracking and Journal

https://apps.apple.com/in/app/vibejar-mood-tracker-journal/id6755709551
1•mohninad•1h ago•0 comments

LLMs: The gift that keeps on giving

https://mltrenches.substack.com/p/llms-the-gift-that-keeps-on-giving
2•druub•1h ago•0 comments

The Monocab Project

https://www.monocab-owl.de/english-language/
1•robin_reala•1h ago•0 comments