frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•8mo ago

Comments

kate_at_refact•8mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

How do you forecast with tiny datasets (2–15M ARR)

1•Gransberry•1m ago•0 comments

Gemini: I can't help with that. Try asking something else about this video

https://www.youtube.com/watch?v=g-QyFIu8Zbc
1•bicepjai•6m ago•1 comments

Neo-Royalism, the Trump Administration, and the Emerging International System

https://www.cambridge.org/core/journals/international-organization/article/further-back-to-the-fu...
3•bikenaga•11m ago•0 comments

Australia's social media ban, one month on

https://www.bbc.com/news/articles/c0mpmgn3jv2o
3•dabinat•14m ago•1 comments

System: Control your Mac from anywhere with AI

https://github.com/ygwyg/system
1•latchkey•14m ago•0 comments

EU calls for input: How to strengthen EU Open Source

https://eur-lex.europa.eu/legal-content/EN/TXT/?uri=intcom:Ares%282026%2969111
2•Flundstrom2•16m ago•0 comments

The quietest home – an architect built it for himself out of medical need

https://nypost.com/2026/01/09/real-estate/inside-the-quietest-home-in-the-world/
1•Stratoscope•16m ago•0 comments

Timeline of supercomputers that carried the Cray name

https://cray-history.net/
2•stmw•16m ago•0 comments

Show HN: I vibecoded an ARM64 operating system that boots on real hardware

https://github.com/kaansenol5/VibeOS
3•kaansenol5•16m ago•0 comments

The places we make memories help us inscribe them

https://news.columbia.edu/news/places-we-make-memories-help-us-inscribe-them
1•hhs•18m ago•0 comments

Show HN: Constellations – On-the-fly D3 collaboration graphs of history via LLMs

https://github.com/johndimm/Constellations
1•johndimm•20m ago•1 comments

Amazon Has Big Hopes for Wearable AI – Starting with This $50 Gadget

https://www.bloomberg.com/news/articles/2026-01-09/amazon-has-big-hopes-for-wearable-ai-starting-...
1•geox•22m ago•0 comments

UK electric car charger rollout slows amid worries over EV switch

https://www.theguardian.com/environment/2025/dec/25/uk-electric-car-charger-ev-switch-sales
2•PaulHoule•22m ago•0 comments

Show HN: Senior Developer Playbook

https://thomastartiere.com/a-senior-developer-playbook
2•tartieret•23m ago•0 comments

Fly's Sprites.dev addresses dev environment sandboxes and API sandboxes together

https://simonwillison.net/2026/Jan/9/sprites-dev/
2•simonw•23m ago•1 comments

NT town of Katherine named Australia's best drop, nine years after PFAS detected

https://www.abc.net.au/news/2026-01-10/katherine-pfas-australia-best-drinking-water/106184842
1•defrost•24m ago•0 comments

Rust Crate for iMessage Database Operation

https://github.com/ReagentX/imessage-exporter
1•RyanZhuuuu•24m ago•0 comments

Washington National Opera Is Leaving the Kennedy Center

https://www.nytimes.com/2026/01/09/arts/music/washington-national-opera-kennedy-center.html
10•mikhael•25m ago•0 comments

Superposition

https://github.com/SuperP2026/RealStableSuperposition
1•SuperpositionCA•29m ago•0 comments

Senior Django Developers?

https://docs.google.com/forms/d/e/1FAIpQLSf_4wdfjMyIwqHm_3g0kP1KqtTZtusFrSv7J7c_JT-vqQdtGg/viewform
1•hoveratskycf•30m ago•0 comments

Transform a Commodore 1541 into a KIM-1

http://retro.hansotten.nl/transform-a-commodore-1541-into-a-kim-1/
3•reaperducer•31m ago•0 comments

First All-Solid-State Battery in Production Vehicles

https://www.donutlab.com/battery/
2•extesy•32m ago•1 comments

Small-time crypto investors are facing violent attacks

https://www.bloomberg.com/features/2026-crypto-thieves-kidnappers/
1•hhs•33m ago•0 comments

The Order in Chaos: 4M Double Pendulums [video]

https://www.youtube.com/watch?v=8jVogdTJESw
1•bromuro•33m ago•0 comments

X changed its Iran flag emoji to the historical lion and sun symbol

https://twitter.com/pubity/status/2009641460795416923
2•mahdihabibi•35m ago•1 comments

Online source of Lego instructions, catalogues and ideas books from years past

https://oldinstructions.com/
2•nailer•39m ago•0 comments

Show HN: I built a tool to create LLM Tier Lists based on real tasks

https://promt.oshn-ai.com/community/004471c6-b508-4ae8-a7cd-20ce6ab4ad65
1•iliailinskii•39m ago•0 comments

Are Tesla Gigafactory Berlin's days numbered?

https://electrek.co/2026/01/08/are-tesla-gigafactory-berlins-days-numbered/
7•pintxo•40m ago•3 comments

EktuPy

https://kushaldas.in/posts/introducing-ektupy.html
2•pauloxnet•45m ago•0 comments

The "self-help" books genre holds up an unflattering mirror to society

https://www.economist.com/culture/2025/12/30/what-self-help-books-tell-us-about-ourselves
1•hhs•47m ago•0 comments