frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•8mo ago

Comments

kate_at_refact•8mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

I feel like an artisan shoe maker in the age of Nike

https://modelcontextexperience.com/blog/i-feel-like-an-artisan-shoe-maker-in-the-age-of-nike
1•petervandijck•6m ago•0 comments

Age of Invention: Tudor Trade War

https://www.ageofinvention.xyz/p/age-of-invention-tudor-trade-war
1•Khaine•8m ago•0 comments

Show HN: flash.nvim, but for tmux…sort of

https://github.com/Kristijan/flash-copy.tmux
2•KristijanM13•10m ago•0 comments

Newly discovered coffee compounds beat diabetes drug in lab tests

https://www.sciencedaily.com/releases/2026/01/260110211224.htm
1•ashishgupta2209•11m ago•0 comments

AI, Japanese chimpanzee who counted and painted dies at 49

https://www.bbc.com/news/articles/cj9r3zl2ywyo
1•reconnecting•12m ago•0 comments

Humans Have Accidentally Created a Barrier Around the Earth

https://www.iflscience.com/humans-have-accidentally-created-a-barrier-around-the-earth-81973
1•akg130522•12m ago•0 comments

NotebookLM Watermark Remover – Remove Watermark from PDF

https://geminiwatermarkremover.net/
1•AI_kid1412•13m ago•0 comments

Video Message from Federal Reserve Chair Jerome H. Powell

https://twitter.com/federalreserve/status/2010510130970849338
1•baxtr•14m ago•0 comments

The Simpler Things in Life [video]

https://www.youtube.com/watch?v=els71JSBIaY
1•genderdoog•15m ago•0 comments

Linux Market Share Remains Above 3% for 3 Months in a Row – January 2026 Report

https://itsfoss.com/linux-market-share/
2•mindracer•16m ago•0 comments

One Thousand Words

https://drewmayo.com/1000-words/index.html
1•todsacerdoti•20m ago•0 comments

iFixit The Worst Devices of CES 2026 [video]

https://www.youtube.com/watch?v=cxZgILm95BU
2•levanten•20m ago•0 comments

Anthropic brings Claude to healthcare with HIPAA-ready Enterprise tools

https://www.bleepingcomputer.com/news/artificial-intelligence/anthropic-brings-claude-to-healthca...
1•fleahunter•21m ago•0 comments

Select text and search with your preferred engine or AI, all in one click

https://chromewebstore.google.com/detail/onering-select-and-search/fjpigicmicdmlmhmkilknomjkkipgafk
1•nanxiaobei•29m ago•0 comments

Jerome Powell's being threatened [video]

https://www.youtube.com/watch?v=RFTGjDR72i4
1•chii•29m ago•0 comments

Why is it so hard to do the thing I claim to want?

https://seekingtrust.substack.com/p/why-is-it-so-hard-to-do-what-i-claim
1•FinnLobsien•32m ago•0 comments

A field guide to sandboxes for AI

https://www.luiscardoso.dev/blog/sandboxes-for-ai
1•saikatsg•40m ago•0 comments

Scope: Hierarchical planner beats LLMs, 55x faster, 1/160k size

https://skyfall.ai/blog/scope-hierarchical-planner-55x-faster-than-llms
1•GeorgeOldfield•42m ago•0 comments

Revit AI Render: Faster AI Rendering for Architects

https://vocus.cc/article/6964af54fd897800012db1b1
1•architech_willy•46m ago•0 comments

You Need to Yearn More

https://twitter.com/justalexoki/status/2010380526402900028
1•keepamovin•46m ago•0 comments

Show HN: Self-hosted micro-learning platform with Full featured (Django/SolidJS)

https://github.com/cobel1024/minima
1•pigon1002•46m ago•1 comments

What Accenture's acquisition of Faculty means for AI enablement services

https://www.aienablementinsider.com/p/what-accenture-s-acquisition-of-faculty-ai-means-for-ai-ena...
1•dylancollins•47m ago•0 comments

Ask HN: What business processes still waste time every week?

1•lzr_mihnea•51m ago•0 comments

Show HN: AIIM – platform to build AI agents with psychological depth

https://ai-im.tech
1•juliavvrn•51m ago•0 comments

AI industry insiders launch site to poison the data that feeds them

https://www.theregister.com/2026/01/11/industry_insiders_seek_to_poison/
3•50kIters•59m ago•0 comments

AGI Next Frontier Summit in Beijing (260110)

https://haebom.dev/archive?tl=en&post=d367nxm38w8xv2j98pv1
2•haebom•1h ago•0 comments

Writing a Program in Par [video]

https://www.youtube.com/watch?v=nU7Lt6k3lNQ
2•razodactyl•1h ago•0 comments

Show HN: GAM7 Companion – macOS app that automates Google Workspace admin

https://github.com/halcarrell/gamgui-releases
1•stormer72•1h ago•0 comments

Show HN: I built a keyword tool that finds terms traditional tools miss

https://brightkeyword.com/
1•nyku•1h ago•1 comments

UpgradeLink – An Open-Source All-in-One Cross-Platform App Upgrade System

1•toolsetlink•1h ago•0 comments