frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•10mo ago

Comments

kate_at_refact•10mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Show HN: Qlog – grep for logs, but 100x faster

https://github.com/Cosm00/qlog
1•cosm00•1m ago•0 comments

Building a New Flash

https://bill.newgrounds.com/news/post/1607118
1•TechPlasma•2m ago•0 comments

Im in needs and going to sale my resonantgenesis.xyz

https://resonantgenesis.xyz/
1•nemesh38•3m ago•1 comments

Was Windows 1.0's lack of overlapping windows a legal or a technical matter?

https://retrocomputing.stackexchange.com/questions/32511/was-windows-1-0s-lack-of-overlapping-win...
1•SeenNotHeard•3m ago•0 comments

Show HN: MixPal, a tiny social network for sharing mixtapes by request

https://mixpal.net
1•lowercasename•3m ago•0 comments

Show HN: Ring Widget – iOS/macOS Widgets for Your Oura Ring

https://www.ringwidget.app/
1•drankou•4m ago•0 comments

Show HN: NexQuake – Q1 Browser Multiplayer (Docker, WASM, Go)

https://kitty1.quake.nexus/
1•brsm•5m ago•0 comments

Shopify Development Agency – Design and Development Experts

https://ecommerce.folio3.com/shopify-development/shopify-agency/
1•mikeconner•7m ago•0 comments

Japanese man arrested for staining a temple in 2015

https://www3.nhk.or.jp/nhkworld/en/news/20260304_21/
2•shlip•11m ago•0 comments

So long, and thanks for all the logs

https://jerodsanto.net/2026/03/so-long-changelog/
1•surprisetalk•11m ago•0 comments

Show HN: AI Town – Your Claude conversation history as a living pixel city

https://aitown-seven.vercel.app
1•alexcloudstar•14m ago•0 comments

Autonomous Weapon Systems and International Humanitarian Law

https://www.icrc.org/en/article/autonomous-weapon-systems-and-international-humanitarian-law-sele...
1•johnbarron•15m ago•0 comments

Linux Mint: Monthly News – February 2026

https://blog.linuxmint.com/?p=5010
1•theschmed•16m ago•1 comments

Why Understanding AI Internals Won't Explain Agent Failures

https://www.vichoiglesias.com/writing/why-understanding-ai-internals-wont-explain-agent-failures
1•vichoiglesias•16m ago•0 comments

Real-time deepfake detection API and X integration demo

2•kmiyachi•17m ago•0 comments

The Power Brokers Behind the $250B Influencer Economy

https://www.wsj.com/lifestyle/careers/uta-influencer-managers-ali-berman-raina-penchansky-alix-ea...
2•thm•18m ago•0 comments

CPU scam: Chuwi CoreBook X uses AMD Ryzen 5 5500U instead of 7430U

https://www.notebookcheck.net/CPU-scam-Chuwi-CoreBook-X-uses-AMD-Ryzen-5-5500U-instead-of-7430U.1...
2•ndsipa_pomu•19m ago•0 comments

10% of Firefox crashes are caused by bitflips

https://mas.to/@gabrielesvelto/116171750653898304
2•marvinborner•20m ago•0 comments

Ask HN: How do you find contracting/freelance roles without recruiters nowadays?

1•Gooblebrai•21m ago•1 comments

Pentagon Eyes New 'Robot Ship' Concept for Low-Profile, All-Domain Logistics

https://nextgendefense.com/pentagon-robot-ship-concept/
1•asdefghyk•21m ago•1 comments

ChatRoutes is open source now

https://github.com/afzal-xyz/chatroutes-opensource
1•mednosis•22m ago•0 comments

Agent's context is a junk drawer

https://www.augmentcode.com/blog/your-agents-context-is-a-junk-drawer
1•knes•23m ago•0 comments

Show HN: OpenTimelineEngine – Shared local memory for Claude Code and codex

https://github.com/JOELJOSEPHCHALAKUDY/open-timeline-engine
1•joeljoseph_•23m ago•0 comments

I'm building a $15/mo status page would you pay for it?

https://www.indiehackers.com/post/im-building-a-15-mo-status-page-would-you-actually-pay-for-it-6...
1•Powellfgn•24m ago•1 comments

The Purpose of Keyboard Bumps – Its Not What You Think

https://www.youtube.com/watch?v=FfkxxSOforw
1•aloneguid•24m ago•0 comments

Enterprise UI Module Federation

https://stevekinney.com/courses/enterprise-ui/module-federation
1•nadis•25m ago•0 comments

Show HN: We want to kill SaaS glue code with one shared infrastructure model

https://wacht.dev/
1•snipextt•25m ago•0 comments

Show HN: Tyop: A macOS menu bar app that fixes typos on demand

https://github.com/liamg/tyop
1•liamg•26m ago•0 comments

Show HN: safe-docx lets coding agents edit Word docs without breaking formatting

https://github.com/UseJunior/safe-docx
1•sobiajulu•26m ago•2 comments

Show HN: I built a language app that generates songs from your vocab list

https://www.lingotify.app/
1•gursu8•27m ago•0 comments