frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•10mo ago

Comments

kate_at_refact•10mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Myrient will shut down on 31 March 2026. Download any content you find important

https://myrient.erista.me
1•chaifeng•2m ago•0 comments

Neural-Temporal Compression – A State-Persistence Framework

https://github.com/andresuarus10-byte/memory-engine
1•KaelyrAT13•4m ago•1 comments

Show HN: A Calculator for Garden Horizons

https://gardenhorizons.app/
1•hugh1st•5m ago•0 comments

Doing a Video Call over a Database

https://www.youtube.com/watch?v=zwIc9fFcYVw
1•Jacques2Marais•7m ago•0 comments

Superagers' Secret Ingredient May Be the Growth of New Brain Cells

https://www.sciencealert.com/superagers-secret-ingredient-may-be-the-growth-of-new-brain-cells
1•jnord•8m ago•0 comments

Fooling Go's X.509 Certificate Verification

https://danielmangum.com/posts/fooling-go-x509-certificate-verification/
1•hasheddan•10m ago•0 comments

'To be free, we have to be feared,' Macron says in keynote nuclear speech

https://www.france24.com/en/france/20260302-macron-unveils-france-nuclear-strategy-eu-counter-rus...
1•vrganj•10m ago•0 comments

I built a pint-sized Macintosh

https://www.jeffgeerling.com/blog/2026/pint-sized-macintosh-pico-micro-mac/
2•ingve•16m ago•0 comments

Ask HN: How to get traction for Open-Source Projects

1•human_hack3r•17m ago•0 comments

Show HN: Proofbox – Defensive Publishing Platform to preserve freedom to operate

https://www.proofbox.co/en
2•gartheuncle•18m ago•0 comments

Building AI agent for our own company

https://blog.leanmcp.com/blog/llms-getting-leanmcp-wrong
1•dheerajmp•19m ago•0 comments

Show HN: I built a social media distribution tool that helps you find users

https://signal-grow.vercel.app
1•dog52841•22m ago•1 comments

Performance Analysis and Tuning on Modern CPUs 2nd edition [pdf]

https://github.com/dendibakh/perf-book/releases/download/2.0_release/PerformanceAnalysisAndTuning...
3•medbar•22m ago•0 comments

Show HN: Neural Siege – A Multi-Agent RL Combat Simulation

https://github.com/ayushdnb/Neural-Siege
2•luthor190397•26m ago•0 comments

4. How to Keep Using Nano Banana Pro After Gemini Replaces It with Nano Banana 2

2•zaaaaooo•27m ago•0 comments

Show HN: C-Suite Skills – a full exec team as skills

https://github.com/pollow/c-suite-skills
2•pollow•32m ago•0 comments

Veo 3 AI

https://veo-3-ai.org/
2•Evan233•36m ago•1 comments

Show HN: GitHub Repo Agent – an agent that explores and reasons on GitHub repos

https://github.com/gauravvij/GithubRepoAgent
3•gauravvij137•39m ago•0 comments

I Put a Full JVM Inside a Browser Tab

https://bmarti44.substack.com/p/i-put-a-full-jvm-inside-a-browser
3•todsacerdoti•41m ago•0 comments

Full speech pipeline in native Swift/MLX – ASR, TTS, speech-to-speech, on-device

https://github.com/ivan-digital/qwen3-asr-swift
2•ipotapov•41m ago•1 comments

People in northeast BC say rest of province should embrace year-round time zone

https://www.cbc.ca/news/canada/british-columbia/time-change-british-columbia-9.7112139
2•divbzero•42m ago•0 comments

California to require age verification for all OS including Linux

https://www.tomshardware.com/software/operating-systems/california-introduces-age-verification-law
4•hambes•43m ago•1 comments

Rare Not Random – Using Token Efficiency for Secrets Scanning

https://lookingatcomputer.substack.com/p/rare-not-random
2•boyter•45m ago•0 comments

Strict Monospace Font for LLM-CLI users using Chinese Japanese Korean, CodexMono

https://www.npmjs.com/package/@monolex/codexmono
4•monokist•48m ago•1 comments

Working on Things That Suck

https://mayberay.bearblog.dev/working-on-things-that-suck/
2•mugamuga•51m ago•0 comments

Ask HN: How Do Emergency Alerts on Phone Work?

2•rishikeshs•55m ago•2 comments

US President struggles to explain why he launched another Middle Eastern war

https://www.ft.com/content/fd31c6ad-39f0-4fae-851c-fadf44f006eb
10•Jimmc414•1h ago•3 comments

Apple Does Value (Week)

https://om.co/2026/03/02/apple-does-value-week/
1•tosh•1h ago•1 comments

The Pointless War Between The Pentagon and Anthropic

https://www.wsj.com/opinion/the-pointless-war-between-the-pentagon-and-anthropic-9284fd37
5•jrosenblatt•1h ago•2 comments

Show HN: wo; a better CD for repo management

https://github.com/anishalle/wo
1•itsagamer124•1h ago•0 comments