frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•10mo ago

Comments

kate_at_refact•10mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via CI

https://arxiv.org/abs/2603.03823
1•mpweiher•2m ago•0 comments

SCRY 17-source research engine for Claude Code(no API keys, pure stdlib)

https://github.com/Kastarter/scry
1•Kastarted•5m ago•0 comments

Show HN: Cursor skill for Claude Code's /loop scheduler

https://gist.github.com/aydinnyunus/9d507810e78554e2a18668a3dcfd65a8
1•runtimepanic•6m ago•0 comments

Show HN: Go LLM inference with a Vulkan GPU back end that beats Ollama's CUDA

https://github.com/computerex/dlgo
1•computerex•8m ago•0 comments

I built a tool that tailor your resume and cover letter for every job in seconds

https://cvrepair.guru
1•ahmedgmurtaza•13m ago•1 comments

LLMs take the fun out of coding

https://twitter.com/atmoio/status/2030289138126107074
2•vhiremath4•14m ago•1 comments

Show HN: MOCC – Turn your MRR or follower milestones into beautiful mockups

https://mocc-delta.vercel.app/
2•suryanshmishrai•19m ago•0 comments

New Research Reassesses the Value of Agents.md Files for AI Coding

https://www.infoq.com/news/2026/03/agents-context-file-value-review/
2•noemit•21m ago•2 comments

Ask HN: Has finding more competitors ever made you more confident?

1•stokemoney•22m ago•0 comments

The Synthetic Data Playbook: Generating Trillions of the Finest Tokens

https://huggingface.co/spaces/HuggingFaceFW/finephrase
2•JoelNiklaus•23m ago•0 comments

72 commits in a day, a third of them reverting the rest

1•madebyjam•24m ago•0 comments

From Iran to Ukraine, everyone's trying to hack security cameras

https://www.wired.com/story/from-ukraine-to-iran-hacking-security-cameras-is-now-part-of-wars-pla...
2•asplake•34m ago•0 comments

How good is Claude, really?

https://alinpanaitiu.com/blog/how-good-is-claude-really/
2•dmoro•36m ago•1 comments

Watch Now: 'Gaza: Doctors Under Attack' – The Film the BBC Refused to Air

https://zeteo.com/p/watch-now-gaza-doctors-under-attack
5•abdelhousni•40m ago•1 comments

Show HN: TracePact – Catch tool-call regressions in AI agents before prod

https://github.com/dcdeve/tracepact
1•soydanicg•43m ago•0 comments

Add llms.txt and fix robots.txt for AI agent discoverability

1•nishiohiroshi•50m ago•0 comments

Show HN: JRD Garage – $99 one-time auto shop management (Mitchell1 alternative)

https://jrdconnect.com/apps
1•jaydurangodev•51m ago•0 comments

How to Talk About Books You Haven't Read

https://www.themarginalian.org/2012/06/15/how-to-talk-about-books-you-havent-read/
3•rramadass•52m ago•3 comments

Need Help: Promoting indie AI image edit platform VAKPixel

1•krishna-vakx•55m ago•1 comments

Show HN: Curiosity – DIY 6" Newtonian Reflector Telescope

https://curiosity-telescope.vercel.app/
2•big_Brain69•58m ago•0 comments

Shithead: The greatest card game in the world

https://shitheads.lovable.app
1•ZguideZ•1h ago•1 comments

Show HN: Malicious Extension Sentry: database of removed Chrome/Edge extensions

2•toborrm9•1h ago•0 comments

Show HN: TTS.ai

https://tts.ai/
1•nadermx•1h ago•0 comments

MCP vs. CLI for AI Agents

https://manveerc.substack.com/p/mcp-vs-cli-ai-agents
1•manveerc•1h ago•1 comments

Ivy – Bringing LLMs to 35M offline students in Ethiopia

1•zeshama•1h ago•0 comments

Abstraction Is Overrated

https://www.heise.de/en/blog/Software-Development-Abstraction-is-Overrated-11198327.html
2•goloroden•1h ago•0 comments

NeSystem (Fka NeKernel) v0.1.5

https://github.com/ne-foss-org/ne_system
2•Amlal•1h ago•1 comments

The End of Identity: AI, Plasticity, and the Divergence Machine

https://aneeshsathe.com/2026/03/07/the-end-of-identity-ai-plasticity-and-the-divergence-machine/
3•boredgargoyle•1h ago•0 comments

Trump says not mulling a draft executive order to seize control over elections

https://www.pbs.org/newshour/politics/trump-says-hes-not-mulling-a-draft-executive-order-to-seize...
3•pabs3•1h ago•1 comments

ScopeCreepSurvival

https://scopecreepsurvival.vercel.app/
1•ravenReema•1h ago•0 comments