frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•8mo ago

Comments

kate_at_refact•8mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

The Guardian view on Europe's payments problem: sovereignty starts at the till

https://www.theguardian.com/commentisfree/2026/jan/25/the-guardian-view-on-europes-payments-probl...
1•mcc1ane•36s ago•0 comments

Show HN: I used my book generator to generate a catalog of books it can generate

https://www.ebook-forge.com/Omni
1•lywald•50s ago•1 comments

Show HN: Forward My Inbox – IMAP‑to‑Gmail Forwarding After Gmail Kills POP3

https://forwardmyinbox.com/
1•moshetanzer•3m ago•0 comments

Hypergrowth Isn't Always Easy

https://tailscale.com/blog/hypergrowth-isnt-always-easy
1•tosh•5m ago•0 comments

New study disrupts the narrative that ChatGPT's launch triggered a job decline

https://the-decoder.com/new-study-disrupts-the-narrative-that-chatgpts-launch-triggered-a-job-dec...
1•Vaslo•5m ago•0 comments

Life Map

https://lifemap.mattrighetti.com/
1•akktor•6m ago•0 comments

Agent Index: Building a "Tiobe Index" for AI Coding Agents (January Survey)

https://agentic-coding-survey.pages.dev/
1•7777777phil•7m ago•1 comments

The behavioral cost of personalized pricing

https://digitalseams.com/blog/the-behavioral-cost-of-personalized-pricing
3•bobbiechen•8m ago•0 comments

Steinway Spirio: The most famous concert pianos got a major tech upgrade (2024)

https://www.technologyreview.com/2024/02/28/1088268/steinway-spirio-concert-pianos-performance-up...
1•xeonmc•10m ago•0 comments

The Most Extreme CSS Reset Ever Created: 10k Lines of Failure

https://dustin.boston/css-reset/
2•clairvoyant_cod•10m ago•0 comments

Incidental Complexity

https://blog.kasperhermansen.com/posts/incidental-complexity/
1•kjuulh•14m ago•0 comments

Spanish track was fractured before high-speed train disaster, report finds

https://www.bbc.com/news/articles/c1m77dmxlvlo
2•Rygian•14m ago•0 comments

The AI Revolution in Coding: Why I'm Ignoring the Prophets of Doom

https://codingismycraft.blog/index.php/2026/01/23/the-ai-revolution-in-coding-why-im-ignoring-the...
11•mmphosis•17m ago•1 comments

A Metabolic Workspace

https://www.joanwestenberg.com/a-metabolic-workspace/
1•andsoitis•22m ago•0 comments

First, Make Me Care

https://gwern.net/blog/2026/make-me-care
2•andsoitis•23m ago•0 comments

National poll: Less than half of parents say swearing is never OK for kids

https://www.michiganmedicine.org/health-lab/less-half-parents-say-swearing-never-ok-kids
3•PaulHoule•26m ago•0 comments

Frozen Insight in a Moving World

https://jdu.github.io/2026-01-25-frozen-insights-in-a-moving-world.html
1•todsacerdoti•26m ago•0 comments

Strategies and lessons from partitioning a 17TB table in PostgreSQL

https://www.tines.com/blog/futureproofing-tines-partitioning-a-17tb-table-in-postgresql/
1•shayonj•27m ago•0 comments

List of Engineering Blunders

https://en.wikipedia.org/wiki/List_of_engineering_blunders
4•erhuve•27m ago•1 comments

The API Authorization Hierarchy of Needs: Why You Aren't Ready for AI Agents

https://auth0.com/blog/api-authorization-hierarchy-needs/
1•aaguiarz•29m ago•0 comments

Show HN: HyprKCS – A fast, native GTK4/Adwaita keybind manager for Hyprland

https://github.com/kosa12/hyprKCS
1•kosa12•29m ago•0 comments

Show HN: Decompile and deminify Bun using an LLM

https://www.npmjs.com/package/@shepherdjerred/bun-decompile
1•shepherdjerred•30m ago•0 comments

Show HN: Fdir – find and organize anything on your system

https://github.com/VG-dev1/fdir
1•Orbyss_Studio•31m ago•0 comments

Forza's Game Studio Rejects No-AI Clause, French VA Localization Canceled

https://twitter.com/MathieuTouquet/status/2015425148237533311
2•WhereIsTheTruth•31m ago•0 comments

Show HN: Uv-pack – Pack a uv environment for later portable (offline) install

https://github.com/davnn/uv-pack
2•davnn•31m ago•0 comments

Animals Build a Sense of Direction

https://www.quantamagazine.org/how-animals-build-a-sense-of-direction-20260121/
1•tzury•31m ago•0 comments

ACM Conference on Reproducibility and Replicability

https://acmrep.github.io
2•jruohonen•32m ago•1 comments

Generative AI is not trained on "data"

https://deniz.aksimsek.tr/2026/training-data/
1•speckx•32m ago•0 comments

PkgFed: ActivityPub for Package Releases

https://nesbitt.io/2026/01/25/pkgfed-activitypub-for-package-releases.html
2•8organicbits•32m ago•0 comments

Why Building AI Agents Is Mostly a Waste of Time

https://medium.com/data-science-collective/why-building-ai-agents-is-mostly-a-waste-of-time-55600...
4•onurkanbkrc•36m ago•1 comments