frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•8mo ago

Comments

kate_at_refact•8mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Ask HN: What is the best voice input mechanism on Linux (2026)?

1•orsenthil•1m ago•0 comments

LLM Pareto Frontier

https://michaelshi.me/pareto/
1•mikeshi42•2m ago•0 comments

Show HN: Free crypto liquidation tracker I built with AI

https://www.liquidationhunterpro.io/
1•Maxicatimus•3m ago•0 comments

America's $3T Nuclear Bet (HALEU) [video]

https://www.youtube.com/watch?v=O2JLbNDhoO4
1•gregbot•4m ago•0 comments

Grookey Community Day and a January That Matters in Pokémon Go

https://comuniq.xyz/post?t=725
1•01-_-•5m ago•0 comments

Sino-Soviet Split

https://en.wikipedia.org/wiki/Sino-Soviet_split
1•JumpCrisscross•9m ago•0 comments

I Learned to Love Lifting Heavy

https://www.nytimes.com/2026/01/17/well/move/weight-lifting-aging.html
1•mooreds•9m ago•0 comments

RFC 7519: JWTs

https://ciamweekly.substack.com/p/rfc-7519-jwts
1•mooreds•10m ago•0 comments

They are now putting composite armor on spacecraft to shield from orbital debris

https://spacenews.com/portal-space-selects-space-armor-debris-shield-for-2026-mission/
1•ck2•10m ago•1 comments

Healthcare RCM market map – January 2026

https://www.stedi.com/blog/healthcare-rcm-market-map-january-2026
1•mooreds•10m ago•0 comments

Iran's state broadcaster IRIB was hacked to air a message from Reza Pahlavi

https://twitter.com/MarioNawfal/status/2012972763783258389
1•seymon•10m ago•0 comments

Spec Driven Development: When Architecture Becomes Executable

https://www.infoq.com/articles/spec-driven-development/
1•msolujic•11m ago•0 comments

Our problems are too vast, our distance from them too great

https://longreads.com/2026/01/13/scale-climate-doomsday-clock/
1•treadump•12m ago•0 comments

Ralph Wiggum as a Degenerate Evolutionary Search

https://ianreppel.org/ralph-wiggum-as-a-degenerate-evolutionary-search/
1•i7l•12m ago•0 comments

Microslop: A Web Browser Extension

https://github.com/4O4-wasd/Microslop
2•Nales•13m ago•0 comments

What we get wrong about dopamine

https://www.bbc.com/future/article/20260116-what-we-get-wrong-about-dopamine
1•oxag3n•14m ago•0 comments

MacPacker: Preview archives on macOS without extracting. Extract single files

https://github.com/sarensw/MacPacker
1•avra•16m ago•0 comments

X1.95 solar flare

https://www.swpc.noaa.gov/news/x-class-flare-activity-observed-18-january-2026
2•sva_•16m ago•0 comments

Land Acknowledgement

https://en.wikipedia.org/wiki/Land_acknowledgement
1•vinnyglennon•17m ago•0 comments

The Gnome Village

https://happihacking.com/blog/posts/2025/the-gnome-village/
1•birdculture•19m ago•0 comments

Fossgis 2026 – German language conference of FOSSGIS e.V. (OSGeo Local Chapter)

https://www.fossgis-konferenz.de/2026/
1•slow_typist•19m ago•0 comments

Debugging consent and conversion tracking with a headless scan

https://consentcheck.online/
1•marstay•19m ago•1 comments

Detecting Podcast Ads on a Phone

https://earsay.app/blog/how-ad-detection-works.php
1•earsayapp•22m ago•0 comments

Trump: One Year Later

https://www.nytimes.com/2026/01/18/opinion/trump-one-year-later.html
1•xenophon•23m ago•1 comments

A gaming success story: how Warhammer became one of Britain's biggest companies

https://www.theguardian.com/lifeandstyle/2026/jan/18/a-gaming-success-story-how-warhammer-became-...
3•GeoAtreides•23m ago•0 comments

Your Agents can now orchestrate Ralph using skills!

https://github.com/davidkimai/ralph-zero
1•davidkimai•25m ago•1 comments

Show HN: Nvidia's CUDA libraries are generic and not optimized for LLM inference

https://github.com/Venkat2811/yali
1•venkat_2811•27m ago•1 comments

Evolution Unleashed (2018)

https://aeon.co/essays/science-in-flux-is-a-revolution-brewing-in-evolutionary-theory
3•DiabloD3•33m ago•0 comments

Show HN: Zpace – See which node_modules, venvs, and caches are eating your disk

https://github.com/AzisK/Zpace
1•azisk1•34m ago•0 comments

Digg.com Is Back

https://about.digg.com/
4•howToTestFE•34m ago•5 comments