frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•9mo ago

Comments

kate_at_refact•9mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Same SQL, Different Results: A Subtle Oracle vs. PostgreSQL Migration Bug

https://databaserookies.wordpress.com/2026/01/30/same-sql-different-results-a-subtle-oracle-vs-po...
1•tanelpoder•36s ago•0 comments

Non sucking, easy tool to convert any website to LLM ready data, Mojo

https://github.com/malvads/mojo
1•malvads•1m ago•1 comments

AIWriter unleashes your writing creativity

https://aiwriter.fun
1•zhouhua•3m ago•0 comments

Anthropic integrates interactive MCP apps into Claude

https://www.testingcatalog.com/anthropic-integrates-interactive-mcp-apps-into-claude/
1•gmays•5m ago•0 comments

DBaaS Performance Benchmarks – The results will shock you!

https://github.com/iamalnewkirk/dbaas-benchmark/blob/master/REPORT.md
1•iamalnewkirk•7m ago•0 comments

Opclaw.io – $10/Mo VPS with OpenClaw Preinstalled (4 VCPU, 8GB RAM, 150GB SSD)

https://opclaw.io/
1•eugeneevstafev•8m ago•1 comments

Ask HN: What are company or side project ideas that you have but will never do?

1•valgor•8m ago•0 comments

Show HN: Voiden – an offline, Git-native API tool built around Markdown

https://github.com/VoidenHQ/voiden
1•dhruv3006•9m ago•1 comments

French tech giant Capgemini to sell US subsidiary working for ICE

https://www.bbc.com/news/articles/cd9e4xw8vqqo
2•tartoran•9m ago•0 comments

Study finds cell memory can be more like a dimmer dial than an on/off switch

https://news.mit.edu/2025/study-finds-cell-memory-can-be-more-like-dimmer-dial-0909
1•stmw•12m ago•0 comments

Michael Beck, 65, Dies; First to Report Symptoms of 'Havana Syndrome'

https://www.nytimes.com/2026/01/29/us/michael-beck-dead.html
1•georgecmu•13m ago•0 comments

Commodore 64 JIT compilation into MSIL

https://old.reddit.com/r/dotnet/comments/1qsl99h/commodore_64_jit_compilation_into_msil/
3•KallDrexx•14m ago•0 comments

Show HN: Another social/job market for AI agents (this one paid bill)

https://ugig.net/
1•cranberryturkey•14m ago•0 comments

How Apple Hooks Fifty Thousand Methods [video]

https://www.youtube.com/watch?v=SuQGQ1vh9k0
1•zdw•16m ago•0 comments

Firebase: PostgreSQL

https://firebase.google.com/products/data-connect
2•tosh•17m ago•0 comments

'It Wasn't Working': Canada Province Ends Drug Decriminalization

https://www.barrons.com/news/it-wasn-t-working-canada-province-ends-drug-decriminalization-9047f3...
2•Teever•18m ago•0 comments

Israel to ban Médecins Sans Frontières (MSF) from working in Gaza

https://www.bbc.com/news/articles/cvg1ymmkpkro
14•tartoran•21m ago•0 comments

OpenClaw in Practice: A Small Team's Field Notes

https://www.subeasy.ai
1•terryops•25m ago•1 comments

Tesla TeraFab

https://grokipedia.com/page/Tesla_TeraFab
1•andsoitis•27m ago•1 comments

Show HN: CLI for prompt-driven API endpoint testing instead Postman, JMeter etc.

https://github.com/onurkanbakirci/prompmeter
1•onurkanbkrc•27m ago•1 comments

Show HN: A private FIRE calculator suite that runs in the browser

https://firenum.com/
2•Mikulas_Tomanka•30m ago•0 comments

Blue Origin to Pause New Shepard Flights for No Less Than Two Years

https://www.blueorigin.com/news/new-shepard-to-pause-flights
2•bookofjoe•33m ago•1 comments

Falsifiable Topological Signatures of "Faithful Reasoning" in LLMs

https://osf.io/2r6v8
1•Keeper123•33m ago•1 comments

Show HN: Credibly – Automate testimonial collection and analysis using OCR/AI

https://getcredibly.org
2•Shiv_Thomet•33m ago•0 comments

What Are Novels For?

https://www.economist.com/culture/2026/01/29/what-are-novels-for-george-saunders-has-answers
1•andsoitis•34m ago•0 comments

Falsifiable Topological Signatures of "Faithful Reasoning" in LLMs

1•Keeper123•34m ago•0 comments

Giant Virus Discovered in Japanese Pond May Hint at Multicellular Life's Origins

https://www.sciencealert.com/giant-virus-discovered-in-japanese-pond-may-hint-at-multicellular-li...
1•amichail•35m ago•0 comments

Hacking Health in 2026 with Wearables – What's on Your Wrist?

1•accofrisk•37m ago•0 comments

Show HN: A game where you invest in startups from history

https://store.steampowered.com/app/4249890/ZeroOne_Terminal/
1•vire00•38m ago•0 comments

The Route of History: Geology and Empire Collide on the Danube

https://worldhistory.substack.com/p/the-route-of-history
2•crescit_eundo•38m ago•0 comments