frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•12mo ago

Comments

kate_at_refact•12mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

DeepSeek: Thinking with Visual Primitives [pdf]

https://huggingface.co/datasets/NodeLinker/deepseek-ai-Thinking-with-Visual-Primitives-deleted-re...
1•krackers•1m ago•0 comments

A Prescription for Fixing the Prevailing Wage System [pdf]

https://ifp.org/wp-content/uploads/IFP_Prevailing_Wage_Experience_Benchmarking.pdf
1•rustoo•2m ago•0 comments

Neural surrogate experiments for physics simulation, automated with Opus and Cod

https://blog.1001ud.me/technical/experimental/physics-ml/neural-surrogates
1•lekan_digital•3m ago•0 comments

Benchmarking a Bug Scanner

https://blog.detail.dev/posts/bug-scanner/
1•drob•4m ago•0 comments

3D Tic Tac Toe

https://apps.apple.com/at/app/3d-ttt/id6763501981
1•franze•5m ago•1 comments

Silicon Valley Is Bracing for a Permanent Underclass

https://www.nytimes.com/2026/04/30/opinion/ai-labor-work-force-silicon-valley.html
1•reducesuffering•5m ago•0 comments

We Built AI Agents That Think, Shop, and Choose Like Real Consumers

https://sediman.com/research/ai-agents-consumer-research
1•JasonHEIN•5m ago•0 comments

You've Got (Too Much) Mail: Behind the Scenes of the 3/25/26 Voice Outage

https://discord.com/blog/behind-the-scenes-of-the-3-25-26-voice-outage
1•cyndunlop•5m ago•0 comments

Riviera – A Reverb plugin for DAWs that you can demo in the browser

https://riviera-demo.surge.sh/
1•stagas•7m ago•0 comments

U.S. Senators Vote to Ban Themselves from Trading on Prediction Markets

https://www.wsj.com/politics/policy/senators-vote-to-ban-themselves-from-trading-on-prediction-ma...
6•kamaraju•8m ago•0 comments

The debt crisis could cost average US household $18k/year

https://fortune.com/2026/04/30/national-debt-today-crisis-solutions-tax-brookings/
2•littlexsparkee•9m ago•1 comments

With AI, Your Internet History Is Attributable to You Personally

https://thecarrierwave.substack.com/p/with-ai-your-entire-internet-history
1•23j423j423hj•9m ago•0 comments

Belgium seeks nationalization of nuclear power plants

https://www.dw.com/en/belgium-seeks-nationalization-of-nuclear-power-plants/a-76993170
1•rustoo•10m ago•0 comments

LinkedIn scans for 6,278 extensions and encrypts the results into every request

https://404privacy.com/blog/linkedin-is-scanning-your-browser-extensions-this-is-how-they-use-the...
1•un-nf•10m ago•1 comments

How I built an Autonomous AI Agent team that runs 24/7

https://twitter.com/Saboo_Shubham_/status/2022014147450614038
1•rmason•11m ago•0 comments

Chernobyl, 40 Years Later

https://nautil.us/chernobyl-40-years-later-1280322
1•Brajeshwar•11m ago•0 comments

The HIPAA Violations Hiding in Your Team's Browser History

https://threegates.ai/blog/hipaa-violations-hiding-in-browser-history/
2•rndkeithw•11m ago•0 comments

Remix 3 Beta Preview

https://remix.run/blog/remix-3-beta-preview
2•pspeter3•15m ago•0 comments

Agentic Coding Is Burning Me Out – Sid's Blog

https://0xsid.com/blog/agentic-coding-fatigue
3•evo_9•15m ago•0 comments

CIA Ran MK-Ultra Experiments on Prisoners of War in Custody, Declassified Docs

https://theintercept.com/2026/04/26/mk-ultra-korean-war-prisoner-experiments/
3•pseudolus•16m ago•0 comments

YouTube has demonITised us. Here's what's next.[video; comments]

https://inv.nadeko.net/watch?__goaway_challenge=js-refresh&__goaway_id=ca717db49b8189092f97f7680d...
1•rolph•16m ago•1 comments

French prosecutors link 15-year-old to mega-breach at state's document agency

https://www.theregister.com/2026/04/30/french_gov_mega_breach_suspect/
1•Cider9986•17m ago•0 comments

Show HN: MCP Servers Can Fix the Biggest Problem with AI Coding Assistants

https://medium.com/@xcf.seetan/how-mcp-servers-can-fix-the-biggest-problem-with-ai-coding-assista...
1•xcf_seetan•18m ago•0 comments

A scar-tissue lessons library for Claude Code (compounds across projects)

https://github.com/tenxengineer/claude-code-enhance
1•tenxsengineer•19m ago•0 comments

Chinese Courts Rule Companies Cannot Fire Workers Simply to Replace Them with AI

https://www.caixinglobal.com/2026-04-30/chinese-courts-rule-companies-cannot-fire-workers-simply-...
3•vrganj•20m ago•0 comments

It's well past time we unionize

https://code-cwa.org/
3•systemerror•21m ago•2 comments

Show HN: Byok-relay – self-hosted proxy for BYOK LLM apps without CORS issues

https://github.com/avikalpg/byok-relay
1•avikalp•22m ago•0 comments

Discomfort with modern technology shapes Gen Z's desire to live in the past

https://www.nbcnews.com/politics/politics-news/discomfort-modern-technology-gen-z-desire-live-pas...
2•senorqa•23m ago•0 comments

My Kindle Turned 10

https://mikolajbiernat.com/blog/my-kindle-turned-10
1•surplus2889•23m ago•0 comments

Software engineer driven to insanity from 2026 Job Market

https://www.youtube.com/watch?v=TYuSEeuUhPo
2•tcp_handshaker•24m ago•0 comments