frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•10mo ago

Comments

kate_at_refact•10mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Anthropic Unveils Amazon Inspired Marketplace

https://www.bloomberg.com/news/articles/2026-03-06/anthropic-unveils-amazon-inspired-marketplace-...
1•dthread3•9m ago•0 comments

Show HN: Glad-IA-Tor – Tired of Vibecoded Products? Come and Roast Them for Free

https://glad-ia-tor.com/
1•GiornoJojo•9m ago•1 comments

Ontology (Information Science)

https://en.wikipedia.org/wiki/Ontology_(information_science)
1•downboots•10m ago•0 comments

Show HN: Wireframable – Generate wireframes from any website URL

https://wireframable.com/
1•rosiepuppy•11m ago•0 comments

Google Always-On Memory Agent

https://github.com/GoogleCloudPlatform/generative-ai/tree/main/gemini/agents/always-on-memory-agent
1•sowbug•12m ago•1 comments

Tractography

https://en.wikipedia.org/wiki/Tractography
1•downboots•15m ago•0 comments

Show HN: SurvivalIndex – which developer tools do AI agents choose?

https://survivalindex.org/
1•scalefirst•15m ago•1 comments

FounderScope – Integrated business model validation platform

https://workspace.founderscope.app/
1•zekiunal•17m ago•1 comments

The 2026 Global Intelligence Crisis - postings for devs are rising, up 11% YoY

https://www.citadelsecurities.com/news-and-insights/2026-global-intelligence-crisis/
1•alhazrod•19m ago•1 comments

Show HN: DiggaByte Labs – pick your stack, download production-ready SaaS code

https://diggabyte.com/
1•GraysoftDev•20m ago•0 comments

Love, Premonition and a Robot Partner

https://twitter.com/expatlitj/status/2029554217958916277
1•shikano•20m ago•0 comments

The State of Consumer AI

https://apoorv03.com/p/the-state-of-consumer-ai-part-1-usage
1•gmays•23m ago•0 comments

Show HN: I accidentally caught an AI agent trying to poison my prod config

https://github.com/liuhaotian2024-prog/k9-solo-hook
1•zippolyon•24m ago•0 comments

AI and the Illegal War

https://buttondown.com/creativegood/archive/ai-and-the-illegal-war/
3•interpol_p•25m ago•0 comments

An ugly year for the Louvre: where does the biggest museum go from here?

https://www.theguardian.com/world/ng-interactive/2026/mar/01/an-ugly-year-for-the-louvre-where-do...
1•PaulHoule•25m ago•0 comments

Show HN: Citepo-CLI, a lightweight CLI for creating blogs, build for AI agent

https://github.com/LinklyAI/citepo-cli
1•blueeon•26m ago•0 comments

Big Sleep Tracker: Google Project Zero + Google DeepMind find security bugs

https://issuetracker.google.com/savedsearches/7155917
2•guessmyname•28m ago•0 comments

Suggestion Regarding References to the Prophet Muhammad (Peace Be Upon Him)

1•naseerwafa•29m ago•0 comments

Show HN: Career AutoPilot – AI guidance for navigating your career

https://www.careerautopilot.ai
2•bvikasgupta•29m ago•0 comments

Can a wealthy family change the course of a deadly brain disease?

https://www.science.org/content/article/can-wealthy-family-change-course-deadly-brain-disease
6•Snoozus•33m ago•0 comments

Show HN: Contd makes interactive CLIs usable for agents in an async way

https://github.com/werifu/contd
1•wefchen•33m ago•0 comments

Hitting the High Notes (2005)

https://www.joelonsoftware.com/2005/07/25/hitting-the-high-notes/
1•benatkin•39m ago•0 comments

Show HN: What zero-intervention E2E test generation looks like

https://www.youtube.com/watch?v=G6mtaC15ocw
1•nadeem1•40m ago•0 comments

Neolab and Emerging AI Lab Tracker

https://cleverhack.com/neolab-and-emerging-ai-lab-tracker
2•jxmorris12•42m ago•0 comments

"Clinejection" Turned an AI Bot into a Supply Chain Attack

https://snyk.io/blog/cline-supply-chain-attack-prompt-injection-github-actions/
1•vismit2000•45m ago•0 comments

Show HN: Managed S3 exports for billing data (no AWS setup required)

https://flexprice.io/
3•manishfp•47m ago•0 comments

Coruna: The Mysterious Journey of a Powerful iOS Exploit Kit

https://cloud.google.com/blog/topics/threat-intelligence/coruna-powerful-ios-exploit-kit
1•mitchbob•50m ago•0 comments

Vibe Security Radar – Tracking the security cost of vibe coding

https://vibe-radar-ten.vercel.app
1•guessmyname•53m ago•0 comments

Spark Runner: Easily Automate Front End Tests

https://github.com/simonarthur/spark-runner/
1•chromaton•57m ago•1 comments

I built this privacy-focused analytics tool

1•webanalyzerapp•57m ago•0 comments