frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•7mo ago

Comments

kate_at_refact•7mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

With Three Weeks Left, DJI Issues Last Plea for US to Start Its Mandated Audit

https://petapixel.com/2025/12/05/with-three-weeks-left-dji-issues-last-plea-for-us-to-start-its-m...
1•PaulHoule•1m ago•0 comments

Why Enterprises Need Evidential Control of AI Mediated Decisions

https://zenodo.org/records/17906869
1•businessmate•2m ago•1 comments

OpenAI opens internal merch store to the public

https://supply.openai.com
1•joos3•2m ago•0 comments

The Bay Area's longest-running morning news anchor is off the air. Here's why

https://www.sfgate.com/sf-culture/article/bay-area-morning-news-anchor-off-air-21237781.php
1•iancmceachern•5m ago•1 comments

Reverse-Eng the RK3588 NPU: Hacking Memory Limits to Run Vision Transformers

https://amohan.dev/blog/2025/shard-optimizing-vision-transformers-edge-npu/
1•homarp•5m ago•1 comments

A one file HTML concept to track project resources management

https://project-nine-xi-25.vercel.app/
1•DinakarS•5m ago•1 comments

Freeing a Xiaomi Humidifier from the Cloud

https://0l.de/blog/2025/11/xiaomi-humidifier/
2•stv0g•13m ago•0 comments

Nvidia builds location verification tech that could help fight chip smuggling

https://www.reuters.com/business/nvidia-builds-location-verification-tech-that-could-help-fight-c...
1•croes•17m ago•0 comments

Egg Decorating in Slavic Culture

https://en.wikipedia.org/wiki/Egg_decorating_in_Slavic_culture
1•petethomas•20m ago•0 comments

Build with Gemini Deep Research

https://blog.google/technology/developers/deep-research-agent-gemini-api/
1•GeorgeWoff25•26m ago•0 comments

TiXL – open-source motion graphics

https://tixl.app/
1•LordNibbler•28m ago•0 comments

How Newspapers Talked About Bitcoin in the Early 2010s

https://paleofuture.com/blog/2025/1/13/how-newspapers-talked-about-bitcoin-in-the-early-2010s
1•mooreds•37m ago•0 comments

Open AI, Microsoft face lawsuit over ChatGPT's alleged role in murder-suicide

https://apnews.com/article/ai-chatgpt-wrongful-death-lawsuit-greenwich-97fd7da31c0fa08f3d3ea9efd6...
2•airstrike•37m ago•1 comments

Making art is good for your health

https://text.npr.org/792439555
1•mooreds•38m ago•0 comments

The Abilene Paradox

https://cassidoo.co/post/abilene-paradox/
1•mooreds•38m ago•0 comments

Killer Whales and Dolphins May Team Up to Hunt Salmon

https://www.scientificamerican.com/article/killer-whales-and-dolphins-may-team-up-to-hunt-salmon/
3•1659447091•43m ago•0 comments

Show HN: Ocean Wave simulation: one-shotted by Gemini 3

1•freakynit•47m ago•0 comments

Humans were making fire 400k years ago, earlier than thought

https://apnews.com/article/britain-archaeology-fire-neanderthals-evolution-suffolk-3698b87f707ac4...
6•gmays•53m ago•0 comments

C2pm-Color to Pixel Map

1•Yukesh_J•55m ago•0 comments

The snail farm don: The most brazen tax avoidance scheme of all time

https://www.theguardian.com/news/ng-interactive/2025/dec/04/the-long-read-snail-farm-tax-avoidanc...
1•gmays•1h ago•0 comments

Measuring the Adoption of TLS ECH and Its Forebear in the Wild (2022)

https://web.archive.org/web/20250701041935/https://link.springer.com/chapter/10.1007/978-3-031-25...
1•1vuio0pswjnm7•1h ago•0 comments

SpaceX Plans to Go Public. Why?

https://arstechnica.com/space/2025/12/after-years-of-resisting-it-spacex-now-plans-to-go-public-why/
3•wintercarver•1h ago•0 comments

Why Private Equity Buyouts Are Taking the Wheel of Indian Businesses

https://taghash.io/blog/why-private-equity-buyouts-are-taking-the-wheel-of-indian-businesses/
1•koolhead17•1h ago•0 comments

The Word, the Name, the Fire (Book)

https://wordnamefire.com/
1•nuevita70•1h ago•2 comments

How I Streamlined My Development Workflow – A Game Changer for Productivity

1•quchao•1h ago•1 comments

When Money Buys Thinking: A New Day in the Life of Developers

https://tamnd.notion.site/2c74b9c1b50d8049b160f073cf773187
1•tamnd•1h ago•0 comments

Show HN: Record iOS Screen with Gestures Included

https://demoscope.app/#ios
1•admtal•1h ago•0 comments

Show HN: MiddleDrag: Three-finger trackpad gestures for middle-click on macOS

https://github.com/NullPointerDepressiveDisorder/MiddleDrag
1•NullPointerDD•1h ago•1 comments

America's Debanking Witch Hunt Finds No Evil

https://www.bloomberg.com/opinion/articles/2025-12-11/debanking-witch-hunt-finds-no-evildoing-in-...
1•petethomas•1h ago•0 comments

Show HN: I got my site down to 237kb by ditching Google Analytics

https://deadstack.net/
2•dreadsword•1h ago•0 comments