frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•8mo ago

Comments

kate_at_refact•8mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Introduction to Computational Thinking by Grant Sanderson (3b1B) and MIT

https://ocw.mit.edu/courses/18-s191-introduction-to-computational-thinking-fall-2020/
1•kerim-ca•1m ago•0 comments

Liquid Bingo: Apple Product release bingo

https://substack.com/@joshuaherman/note/c-204382436
1•zitterbewegung•2m ago•0 comments

Critical Systems Thinking

https://bcghendersoninstitute.com/critical-systems-thinking-with-michael-c-jackson/
1•andsoitis•3m ago•0 comments

NASA's Artemis 2 moon rocket is on the launch pad: What's next?

https://www.space.com/space-exploration/human-spaceflight/nasas-artemis-2-moon-rocket-is-on-the-l...
1•mpweiher•4m ago•0 comments

Did Justin Sun buy his way out of an SEC lawsuit for $75M?

https://thebitgazette.com/did-justin-sun-buy-his-way-out-of-an-sec-lawsuit-for-75-million/
2•campusninja•8m ago•0 comments

Why 'market cap' doesn't mean what you think it means, and why it matters

https://thebitgazette.com/why-market-cap-doesnt-mean-what-you-think-it-means-and-why-it-matters/
2•campusninja•11m ago•0 comments

Dear America

https://dennisforbes.ca/blog/2026/01/dear_america/
2•llm_nerd•12m ago•0 comments

Endfield DB: An open-source production calculator for Arknights: Endfield

https://endfielddb.com/
1•causalzap•13m ago•1 comments

The Physics of Learning (and Why Almost No One Uses It)

https://twitter.com/justinskycak/status/2014496697481605246
2•JustinSkycak•13m ago•0 comments

Show HN: AI Advisory Board

https://stratis.one/
1•gewing•17m ago•0 comments

Exposing Game Servers over Tailscale

https://chameth.com/exposing-game-servers-over-tailscale/
1•PaulHoule•19m ago•0 comments

Coi – WebAssembly for the Modern Web

https://io-eric.github.io/coi/
1•todsacerdoti•20m ago•0 comments

3D-Printed Mathematical Lampshades

https://hessammehr.github.io/blog/posts/2025-12-24-maths-to-lampshade.html
2•hessammehr•20m ago•0 comments

Human Progress Data

https://humanprogress.org/datasets/
1•hubraumhugo•20m ago•0 comments

Show HN: MetaPurge – Strip metadata and timestamps from images/PDFs

https://github.com/XORD-AI/MetaPurge
1•Prof_Sigmund•22m ago•1 comments

Computers Can't Surprise

https://aeon.co/essays/sure-ai-can-do-writing-but-memoir-not-so-much
1•Brajeshwar•26m ago•0 comments

TR-49 is interactive fiction for fans of deep research rabbit holes

https://arstechnica.com/gaming/2026/01/tr-49-is-interactive-fiction-for-fans-of-deep-research-rab...
1•Brajeshwar•26m ago•0 comments

In 1932, Australia Started an 'Emu War'–and Lost

https://www.atlasobscura.com/articles/the-great-emu-war-australia
1•Brajeshwar•26m ago•0 comments

An open-source Git extension for tracking AI code

https://usegitai.com/
1•gempir•28m ago•0 comments

Can Time Be Computed? Part II

https://softwarefrontier.substack.com/p/can-time-be-computed-part-ii
2•CortexFlow•28m ago•1 comments

Show HN: SICore – Lightweight Java framework for beginners and AI codegen

https://github.com/sugaiketadao/sicore
1•sugaiketadao•35m ago•0 comments

Casmos: Optimizing for LLM Citations Instead of Rankings

https://yyyokel.com/claude-ai-search-monetization-operating-system-2026-playbook/
1•wompapumpum•36m ago•1 comments

US Army Poorly Prepared for Arctic: Finnish Forced Surrender During Exercise

https://militarnyi.com/en/news/us-army-poorly-prepared-for-arctic-operations-finnish-troops-force...
10•saubeidl•38m ago•1 comments

A brain glitch may explain why some people hear voices

https://www.sciencedaily.com/releases/2026/01/260122074033.htm
1•t-3•39m ago•0 comments

Building an open source anycast CDN (2021)

https://blog.apnic.net/2021/04/07/building-an-open-source-anycast-cdn/
1•Gooblebrai•39m ago•0 comments

Show HN: Doom running in OpenSCAD at 10-20 FPS

https://www.mikeayles.com/#openscad-doom
2•mikeayles•39m ago•0 comments

I need help finding VPNs for my Iranian friend

1•pickeledLobe•40m ago•1 comments

Latest ChatGPT model uses Elon Musk's Grokipedia as source, tests reveal

https://www.theguardian.com/technology/2026/jan/24/latest-chatgpt-model-uses-elon-musks-grokipedi...
5•nickcotter•41m ago•1 comments

Show HN: Built an AI powered image editor for IntelliJ

https://plugins.jetbrains.com/plugin/29778-imageedit-pro
1•erikpau•44m ago•0 comments

Evolving Instruction Following Beyond IFEval and "Avoid the Letter C"

https://surgehq.ai/blog/advancedif-and-the-evolution-of-instruction-following-benchmarks
1•gk1•46m ago•0 comments