frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•12mo ago

Comments

kate_at_refact•12mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Show HN: Orchestrate Dockerized Claude Code sessions from your issue tracker

https://github.com/smithy-ai/smithy-ai
1•t0mas88•49s ago•0 comments

Should You Be a Carpenter? [video]

https://www.youtube.com/watch?v=RJyPVLMyyuA
1•DeathArrow•59s ago•0 comments

Caisi Evaluation of DeepSeek V4 Pro

https://www.nist.gov/news-events/news/2026/05/caisi-evaluation-deepseek-v4-pro
1•chvid•2m ago•0 comments

The clients you didn't know you lost

https://techlex.net/the-clients-you-didnt-know-you-lost/
1•basket278•4m ago•0 comments

A Dark-Money Campaign Is Paying Influencers to Frame Chinese AI as a Threat

https://www.wired.com/story/super-pac-backed-by-openai-and-palantir-is-paying-tiktok-influencers-...
1•chvid•5m ago•1 comments

LLMs Are Not a Higher Level of Abstraction

https://www.lelanthran.com/chap15/content.html
1•lelanthran•7m ago•0 comments

A framework agnostic platform to manage local agents from your phone

https://onepilotapp.com
2•elearia•10m ago•0 comments

Musk spars with OpenAI atty in trial over OpenAI's evolution from a nonprofit

https://apnews.com/article/musk-altman-openai-nonprofit-trial-bdbe85d62c2b678458fe68148eb6fba5
1•1vuio0pswjnm7•11m ago•1 comments

We Caught Prompt Security Leaking API Keys

https://www.youtube.com/watch?v=cZLdWtcSE04
1•acorn221•11m ago•0 comments

I Recreated the Apple Lisa Computer Inside an FPGA – The LisaFPGA Project

https://www.youtube.com/watch?v=8jNQDcpHc68
2•cyrc•13m ago•0 comments

Questions of US interventionism as 25story Juárez surveillance tower scrutinized

https://english.elpais.com/international/2026-05-03/amid-questions-of-us-interventionism-in-mexic...
1•c420•15m ago•0 comments

FCC votes to ban all Chinese labs from certifying electronics sold in the US

https://www.tomshardware.com/tech-industry/fcc-votes-to-ban-all-chinese-labs-from-certifying-elec...
2•jonbaer•18m ago•1 comments

Elon Musk Says AI 'Smarter Than Humans' Next Year During OpenAI Testimony

https://www.newsweek.com/elon-musk-vs-sam-altman-feud-explained-as-openai-trial-begins-11886815
2•1vuio0pswjnm7•18m ago•2 comments

PHP King Extension and KingRT Video Call App

https://kingrt.com/
1•bold_iggl•18m ago•1 comments

Space War

http://cleancoder.com/space-war
2•evo_9•19m ago•0 comments

Maybe AI Isn't a Bubble After All

https://www.theatlantic.com/economy/2026/05/ai-bubble-revenue-anthropic/687022/
21•Anon84•20m ago•8 comments

Collaborative Editing in CodeMirror

https://marijnhaverbeke.nl/blog/collaborative-editing-cm.html
2•luu•20m ago•0 comments

Show HN: Local semantic memory for coding agents

https://github.com/Chadi00/thr
1•chadiiek•21m ago•0 comments

Apple Was Caught Off Guard by MacBook Neo's "Off the Charts" Demand

https://www.macrumors.com/2026/05/01/apple-was-caught-off-guard-by-macbook-neo/
2•ZeidJ•24m ago•0 comments

New Claude-Code Plugin for Jupyterlab

https://github.com/stellarshenson/jupyterlab_claude_code_extension
1•stellars•24m ago•0 comments

The Oscars Just Banned AI from Winning Acting and Writing Awards

https://gizmodo.com/the-oscars-just-banned-ai-from-winning-acting-and-writing-awards-2000753740
5•ZeidJ•25m ago•0 comments

PolyPulse – C++ TUI scalper that exploits oracle lag Polymarket BTC/ETH markets

https://github.com/NeuroNord/PolyPulse
2•neuronord•27m ago•0 comments

Achieving Rapid CVE Remediation in an Era of Escalating Vulnerabilities

https://flox.dev/blog/achieving-rapid-cve-remediation-in-an-era-of-escalating-vulnerabilities/
2•ronef•27m ago•0 comments

Most Companies Aren't Anywhere Near Ready for AI

https://twitter.com/DanielMiessler/status/2050666594188304484
2•iceboundrock•34m ago•0 comments

Show HN: MegaLLM – Universal LLM client for any OpenAI-compatible API

https://megallm.netlify.app/
2•heliskyr2•43m ago•0 comments

How many e's are in the word seventeen [video] (AI hallucination)

https://www.youtube.com/shorts/nks72LuZO20
3•Imustaskforhelp•47m ago•1 comments

tank-os: Fedora bootc image for running OpenClaw as a rootless Podman workload

https://github.com/LobsterTrap/tank-os
2•indigodaddy•47m ago•0 comments

Feedback Loops

https://fastersafely.com/lean-software-engineering/principles/feedback-loops/
3•dev_by_day•48m ago•0 comments

Barry Levinson's box-office flop 'Toys' predicted the future of warfare

https://www.cnn.com/2026/05/03/entertainment/toys-movie-barry-levinson-modern-warfare-cec
3•mooreds•52m ago•0 comments

How to organize 3 acquired companies into one coherent website

https://littlelanguagemodels.com/how-to-structure-your-sites-after-a-big-acquisition/
2•mooreds•52m ago•0 comments