frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•7mo ago

Comments

kate_at_refact•7mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Virtualbox.org "402 Payment Required"

https://www.virtualbox.org/
1•SpikedCola•45s ago•1 comments

Growth Is Value Flow, Not Vanity Metrics

https://www.nibzard.com/growth-value-flow
1•nkko•1m ago•0 comments

Disney accuses Google of 'massive' copyright infringement

https://www.theverge.com/news/842573/disney-google-copyright-infringement-cease-and-desist
2•mikexstudios•4m ago•0 comments

Would You Pay to Have Your Resume Read?

https://brodzinski.com/2025/12/pay-for-resume-read.html
1•flail•4m ago•1 comments

Benchmark that evaluates LLMs using 759 NYT Connections puzzles

https://github.com/lechmazur/nyt-connections
1•ShrugLife•5m ago•0 comments

Repurposing OpenTelemetry as a local flight recorder for AI debugging

https://syn-cause.com/blog/repurpose-otel-for-coding-agents
3•morethananai•5m ago•0 comments

Hidden Setting Controls What Happens When You Tap a Call in Phone App (iPhone)

https://tidbits.com/2025/12/07/hidden-setting-controls-what-happens-when-you-tap-a-call-in-the-ph...
1•DavideNL•6m ago•0 comments

Confirm You're Not a Robot

https://www.internetarchive.eu/?mailpoet_page
1•FrancoSivieri•7m ago•0 comments

Cool Linux apps to try this weekend

https://www.howtogeek.com/linux-apps-to-try-this-weekend-december-12/
1•losgehts•8m ago•0 comments

Our "enterprise" experience with Stripe after $1B+ processed (be careful)

4•Boulderchaim•9m ago•0 comments

Codex is Open Sourcing AI models

https://huggingface.co/blog/hf-skills-training-codex
1•ibobev•9m ago•0 comments

New in Llama.cpp: Model Management

https://huggingface.co/blog/ggml-org/model-management-in-llamacpp
1•ibobev•10m ago•0 comments

Rabata offering $100k in free S3-compatible storage credits to Gen-AI startups

https://rabata.io/grant-application
1•ivankuznetsov11•10m ago•1 comments

AI They Collapse from Avoidance

https://www.intelligent-people.org/2025/12/12/systems-dont-collapse-from-attack-they-collapse-fro...
1•micvicfaust9•11m ago•0 comments

Dawn of Quantum Simulators

https://www.science.org/doi/10.1126/science.adt1732
1•tzury•11m ago•0 comments

Impact statements submitted by victims of Do Kwon, 2022 Terra/Luna meltdown

https://www.mollywhite.net/micro/entry/202512111643
1•speckx•12m ago•0 comments

WhatsApp chats where family offices vet deals, plan meetups, sell dinosaur bones

https://www.cnbc.com/2025/12/11/whatsapp-family-office.html
1•rmesters•12m ago•1 comments

Care About Partial Differential Equations (PDEs)

https://huggingface.co/blog/hugging-science/pde
1•ibobev•13m ago•0 comments

From Coldfusion/Flash Developer to AI Founder: 30 Years Later

https://www.adgena.com/blog/30-year-overnight-success
1•giannidalertaph•14m ago•0 comments

Researchers: An important wetland in Ghana is under siege

https://theconversation.com/an-important-wetland-in-ghana-is-under-siege-researchers-investigate-...
1•PaulHoule•16m ago•0 comments

Is A.I. Actually A Bubble?

https://www.newyorker.com/culture/open-questions/is-ai-actually-a-bubble
1•pseudolus•16m ago•1 comments

Push vs. Pull in Web-Based Network Management (1998) [pdf]

https://infoscience.epfl.ch/server/api/core/bitstreams/a0d3fa32-35c5-4db8-89a8-47f0d3744335/content
1•fodmap•18m ago•0 comments

BpfJailer: eBPF Mandatory Access Control [pdf]

https://lpc.events/event/19/contributions/2159/attachments/1833/3929/BpfJailer%20LPC%202025.pdf
2•voxadam•19m ago•0 comments

Columbia Sportswear offers Flat Earthers the keys to the company

https://www.creativereview.co.uk/columbia-flat-earthers-campaign/
1•geox•20m ago•0 comments

IBM Company Songs

https://www.digibarn.com/collections/songs/ibm-songs/index.html
2•miki123211•20m ago•1 comments

Checkers Arcade

https://blog.fogus.me/games/checkers-arcade.html
1•fogus•21m ago•0 comments

Notes on Internet Addiction

https://twitter.com/benroy/status/1999215143729742002
1•gmays•24m ago•0 comments

Vibe-Coding a Startup MVP

https://senkorasic.com/articles/mvp-vibe-code
3•senko•24m ago•0 comments

Proton referral program: What it is and how it works

https://proton.me/support/referral-program
1•teekert•25m ago•0 comments

Show HN: Argly – Turn your iPhone/iPad into a remote for your Mac [over WiFi]

https://apps.apple.com/us/app/argly/id6755750961
1•tn_•26m ago•2 comments