frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•8mo ago

Comments

kate_at_refact•8mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Zerowriter Ink

https://zerowriter.ink/
2•kristianpaul•2m ago•0 comments

Show HN: go-stats-calculator, CLI for computing stats:mean,median,variance,etc.

https://github.com/jftuga/go-stats-calculator
1•jftuga•4m ago•0 comments

There's a Hit Movie Set Deep Inside an AI Lab–and It Will Give You Goosebumps

https://www.wsj.com/tech/ai/google-deepmind-documentary-youtube-thinking-game-732bfa06
1•fortran77•6m ago•1 comments

Something Important to Say

https://somethingimportanttosay.com
1•kilroy123•7m ago•0 comments

Dramatiq: A fast and reliable background task processing library for Python 3

https://github.com/Bogdanp/dramatiq
2•tosh•9m ago•0 comments

Alec – Adaptive compression codec for IoT, 90% data reduction

https://github.com/zeekmartin/alec-codec
1•alec_codec•10m ago•1 comments

I Cut Vercel's JSON-Render LLM Costs by 89% Using Toon

https://mateolafalce.github.io/2026/I%20Cut%20Vercel%27s%20json-render%20LLM%20Costs%20by%2089%25...
1•lafalce•11m ago•0 comments

Why You Should Eat the RFK Diet

https://unherd.com/2026/01/rfk-is-right-eat-more-meat-and-cheese/
1•RickJWagner•13m ago•0 comments

24 Small Ways Writing Makes You Wealthy

https://dariusforoux.com/24-small-ways-writing-make-you-wealthy/
1•RickJWagner•14m ago•0 comments

What life is like in Minneapolis now

https://donmoynihan.substack.com/p/dispatch-from-the-occupation
6•_tk_•15m ago•0 comments

Life on Tel Aviv's Streets

https://www.haaretz.com/magazine/2026-01-15/ty-article-magazine/.premium/im-a-prostitute-and-a-dr...
2•diogenes_atx•17m ago•1 comments

Microblog: Performance Testing – Benchmarks vs. Endurance

https://bencane.com/posts/2026-01-15/
1•madflojo•18m ago•0 comments

The Wall Looks Permanent Until It Falls

https://data4democracy.substack.com/p/the-wall-looks-permanent-until-it
2•mooreds•21m ago•0 comments

Kenneth Lane Thompson, 1983 ACM Turing Award Recipient [video]

https://www.youtube.com/watch?v=309siTvApbY
2•joebig•21m ago•0 comments

Rust for C Programmers

https://rust-for-c-programmers.com
3•ahlCVA•21m ago•1 comments

Drop Bear

https://australian.museum/learn/animals/mammals/drop-bear/
1•mooreds•24m ago•0 comments

Show HN: OSS, Server-rendered multiplayer games with Lua (no client code)

https://github.com/cleoselene-engine/cleoselene
1•brunovcosta•24m ago•0 comments

Show HN: RatatuiRuby Wraps Rust Ratatui as a RubyGem – TUIs with the Joy of Ruby

https://www.ratatui-ruby.dev/
1•Kerrick•25m ago•0 comments

Binary Fuse Filters: Fast and Smaller Than XOR Filters

https://arxiv.org/abs/2201.01174
1•redbell•26m ago•0 comments

AI Meets Terraform: Prompt Strategies for Test Generation

https://masterpoint.io/blog/ai-meets-tf-prompt-strategies-for-test-generation/
1•mooreds•27m ago•0 comments

Grokipedia in OpenStreetMap

https://community.openstreetmap.org/t/grokipedia-usage/140596
1•faebi•28m ago•0 comments

The Mythology of Conscious AI

https://www.noemamag.com/the-mythology-of-conscious-ai/
4•XzetaU8•28m ago•1 comments

LG UltraFine Evo 6K 32-inch Monitor Review

https://www.wired.com/review/lg-ultrafine-evo-6k-32-inch-monitor/
1•tosh•28m ago•0 comments

A 200-page digital notebook for journaling and writing

https://paperjournaling.space/
2•zikosichi•29m ago•0 comments

2026 Autonomous Snowplow Competition (With Livestream)

https://mail.autosnowplow.com/welcome.html
1•nibalizer•29m ago•0 comments

I Applied to 500 Jobs. Got Zero Interviews. Then I Tried This

https://twitter.com/aakashgupta/status/2012311340803916113
2•bilsbie•30m ago•0 comments

Crisis Response Without a Record Is Not Crisis Management

https://www.aivojournal.org/crisis-response-without-a-record-is-not-crisis-management/
2•businessmate•31m ago•0 comments

Krnr – Early-Stage CLI for Persisting Shell Workflows

https://github.com/VoxDroid/krnr
1•voxdroid•32m ago•0 comments

PRS-A / FV-FEU – A non-clinical cognitive research corpus (OSF)

https://osf.io/ub5f4/
1•DELTA-X•35m ago•1 comments

One Colombian family's fight for justice after the US boat strikes

https://www.aljazeera.com/news/longform/2026/1/16/inevitably-difficult-inside-a-familys-fight-aga...
4•Qem•35m ago•0 comments