frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•7mo ago

Comments

kate_at_refact•7mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Exomind

https://bodybybtl.com/solutions/exomind/
1•andsoitis•26s ago•0 comments

Engineering a Better Java Build Tool Experience [video]

https://www.youtube.com/watch?v=-DTYm1qhQ6U
1•lihaoyi•4m ago•0 comments

Unacceptable

1•hdjidb•5m ago•0 comments

America has the tools to heal division: we need to relearn them

https://bigthink.com/the-well/how-america-can-heal-division/
1•PaulHoule•8m ago•0 comments

The Hater's Guide to Nvidia

https://www.wheresyoured.at/the-haters-guide-to-nvidia/
1•PessimalDecimal•8m ago•0 comments

New AI slop signal: code blocks with weird indentation

https://xeiaso.net/notes/2025/slop-signal-indentation/
3•ajdude•9m ago•0 comments

From Silicon Valley to Hollywood, why California's job market is taking a hit

https://www.latimes.com/business/story/2025-11-26/from-silicon-valley-to-hollywood-california-job...
1•cgoodmac•10m ago•0 comments

What are small language models and how do they differ from large ones?

https://theconversation.com/what-are-small-language-models-and-how-do-they-differ-from-large-ones...
3•billybuckwheat•12m ago•0 comments

Ask HN: Recruiters, does contractor vs. FTE matter?

1•salt-thrower•14m ago•0 comments

Scribblenauts for Software

https://build.ms/2025/12/1/scribblenauts-for-software/
1•mergesort•14m ago•0 comments

Three Levels of Running LLMs from Laptop to Cluster-Scale Distributed Inference

https://www.bentoml.com/blog/running-local-llms-with-ollama-3-levels-from-local-to-distributed-in...
1•bbzjk7•17m ago•0 comments

Hytale System Requirements: Can Your PC Run Hytale?

https://hytaletop100.com/blog/hytale-system-requirements-can-your-pc-run-hytale
2•doobie12•19m ago•0 comments

Z-Image: An Efficient Image Generation Foundation Model [pdf]

https://arxiv.org/abs/2511.22699
1•SerCe•20m ago•0 comments

Decreasing Certificate Lifetimes to 45 Days

https://letsencrypt.org/2025/12/02/from-90-to-45.html
2•abraham•21m ago•0 comments

What Will Enter the Public Domain in 2026?

https://publicdomainreview.org/features/entering-the-public-domain/2026/
10•herbertl•23m ago•0 comments

Show HN: Dotgh – CLI to manage AI-assistant config templates

https://github.com/openjny/dotgh
1•openjny•26m ago•0 comments

Airwallex Faces China Backdoor Allegations from Prominent VC

https://www.forbes.com/sites/boazsobrado/2025/12/01/airwallex-faces-china-backdoor-allegations-fr...
1•DustinEchoes•31m ago•0 comments

Water Cycle Diagram – Interactive 4 Stages Animation – Free Learning Tool

https://senvnv.com/
1•Luki1234•33m ago•0 comments

Leaf blowers are the latest thing dividing Americans

https://www.economist.com/united-states/2025/12/01/leaf-blowers-are-the-latest-thing-dividing-ame...
1•petethomas•33m ago•0 comments

Europe's Green Energy Rush Slashed Emissions – and Crippled the Economy

https://www.wsj.com/business/energy-oil/europes-green-energy-rush-slashed-emissionsand-crippled-t...
3•monero-xmr•33m ago•1 comments

Samsung Debuts Its First Trifold Months Ahead of Folding iPhone

https://www.bloomberg.com/news/articles/2025-12-02/samsung-debuts-2-450-galaxy-z-trifold-months-a...
3•ashishgupta2209•38m ago•0 comments

Things I Learned in 2025

https://medium.com/@tomwhitwell/52-things-i-learned-in-2025-edeca7e3fdd8
1•Brajeshwar•42m ago•0 comments

Coherent Multi-Agent Trajectory Forecasting in Team Sports with CausalTraj

https://causaltraj.github.io
1•wezteoh•44m ago•1 comments

SurrealDB – A scalable, distributed, document-graph db, for the realtime web

https://github.com/surrealdb/surrealdb
1•modinfo•49m ago•0 comments

Ontology-Based Meta-System Architecture (Experimental)

https://ontomesh.org/OntoMesh-Architecture.html
1•nettalk83•50m ago•1 comments

"Airwallex, a Chinese backdoor into American data from AI labs and defense"

https://twitter.com/rabois/status/1995532262998417834
4•krrishd•51m ago•0 comments

How to Sound Like an Expert in Any AI Bubble Debate

https://www.derekthompson.org/?sort=community
2•gamechangr•52m ago•1 comments

Pete Hegseth Needs to Go–Now

https://www.theatlantic.com/ideas/2025/12/pete-hegseth-pentagon-department-defense/685098/
5•JumpCrisscross•52m ago•2 comments

Egui: An easy-to-use GUI in pure Rust

https://github.com/emilk/egui
2•modinfo•52m ago•0 comments

Returning to Linux

https://zackbartel.com/blog/2025/02/return-to-linux/
1•zackb•55m ago•0 comments