frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•7mo ago

Comments

kate_at_refact•7mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Ask HN: How do you keep track of developments in the AI space?

1•abrbhat•1m ago•0 comments

What's in a Button?

https://belkadan.com/blog/2025/11/Whats-in-a-Button/
1•PaulHoule•2m ago•0 comments

When A.I. Took My Job, I Bought a Chain Saw

https://www.nytimes.com/2025/12/28/opinion/artificial-intelligence-jobs.html
1•gmays•3m ago•0 comments

Preview of 'The Joy of Cryptography'

https://garbledcircus.com/kemdem/real-rand
1•altro•3m ago•0 comments

Trying to be the new GitHub, let me know what you think

https://app.principal-ade.com
1•fernandoramlugo•4m ago•0 comments

The Wave Function of the Universe and Inflation

https://arxiv.org/abs/2510.04775
1•northlondoner•7m ago•1 comments

Show HN: Isit2026yet.com – A single-serving site for the New Year

https://isit2026yet.com/
1•eamongordon•8m ago•1 comments

New Year Zone

https://newyear.zone
2•aaaronson•11m ago•0 comments

Shipping at Inference-Speed

https://steipete.me/posts/2025/shipping-at-inference-speed
2•xngbuilds•13m ago•0 comments

Writing for Developers

https://codecrafters.io/blog/writing-for-developers
1•0x54MUR41•14m ago•0 comments

Kiwix: Free educational content, offline browser apps, and local hotspot device

https://kiwix.org/en/
2•adityaathalye•20m ago•1 comments

The Economist – Archive 1945 – NotebookLM

https://notebooklm.google.com/notebook/34510332-d39c-499e-882d-e48393d612cd
3•instagraham•25m ago•0 comments

ChatGPT involvement in mentally-ill person's murder and suicide

https://en.wikipedia.org/wiki/Murder_of_Suzanne_Adams
3•d_silin•26m ago•0 comments

Show HN: Sessy – Open-source email observability for AWS SES

https://github.com/marckohlbrugge/sessy
1•marckohlbrugge•28m ago•0 comments

Fork Yeah: We're keeping ingress-Nginx alive

https://www.chainguard.dev/unchained/keeping-ingress-nginx-alive
2•gpi•30m ago•0 comments

Crazy Jam Jar: Match-3 Blast for Nonstop Fun

https://ibb22.com/casino/bbgame-13370/
1•gamedemoplayer•30m ago•1 comments

A Big, Long Day: The Fastest Known Time on the Everest Base Camp Trail

https://strivetrips.org/blog/ebc-writeup/
2•mcoliver•32m ago•0 comments

A new era of Stack Overflow

https://stackoverflow.blog/2025/12/30/a-new-era-of-stack-overflow/
1•gudzpoz•32m ago•0 comments

Sirius DB

https://www.sirius-db.com/
1•manoji•34m ago•0 comments

Conduit (Rust Matrix Server) v0.10.11 another critical vulnerability

https://conduit.rs/changelog/#v0-10-11-2025-12-30
2•acheong08•42m ago•0 comments

Apps Let You Bet on Deportations and Famine. Mainstream Media Is Eating It Up

https://theintercept.com/2025/12/29/polymarket-kalshi-betting-prediction-cnn-news-media/
2•thm•48m ago•0 comments

Show HN: S3Broker – CF Worker library to protect your S3 storage from ransomware

https://github.com/tsunrise/s3broker
1•tsunrise•49m ago•0 comments

Show HN: Perfetto2LLM - A tool to pass system traces to an LLM

https://perfetto-to-llm.vercel.app/
2•ak2242•50m ago•0 comments

Nexels

https://lessvrong.com/cs/nexels/
1•ibobev•51m ago•0 comments

Show HN: Supertictactoe.gg – A real-time PvP implementation of Ultimate TTT

https://supertictactoe.gg
1•dheesh•52m ago•0 comments

Direct3D 12: The Behavior of ClearUnorderedAccessViewUint/Float

https://asawicki.info/news_1795_secrets_of_direct3d_12_the_behavior_of_clearunorderedaccessviewui...
1•ibobev•52m ago•0 comments

Microsoft's Nadella overhauls leadership as he plots AI strategy beyond OpenAI

https://www.ft.com/content/255dbecc-5c57-4928-824f-b3f2d764f635
4•JamesAdir•53m ago•1 comments

OpenUSD Core Spec 1.0 is Here

https://aousd.org/blog/foundations-of-open-3d-development-introducing-aousd-core-specification-1-0/
1•ibobev•54m ago•0 comments

RunST does not prevent resources from escaping

https://welltypedwit.ch/posts/runst-does-not-prevent-resources-from-escaping.html
1•todsacerdoti•56m ago•0 comments

ByteDance to pour US$14B into Nvidia chips in 2026

https://www.scmp.com/tech/big-tech/article/3338191/bytedance-pour-us14-billion-nvidia-chips-2026-...
2•mfiguiere•57m ago•0 comments