frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•11mo ago

Comments

kate_at_refact•11mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Former Israeli intelligence agents from Unit 8200 hired by Apple

https://vuseum.wordpress.com/2025/07/22/ex-spie-israeliane-dellunita-8200-assunte-da-apple/
1•kome•3m ago•0 comments

Google announced that Chrome is becoming an agentic workplace platform

https://thenextweb.com/news/google-chrome-enterprise-ai-coworker-agentic-browser
1•onchainintel•7m ago•0 comments

The new hosted agents in Foundry Agent Service

https://devblogs.microsoft.com/foundry/introducing-the-new-hosted-agents-in-foundry-agent-service...
1•nonfamous•10m ago•0 comments

Show HN: Autonomous coin-flipping machine with on-device CV

https://www.terencegrover.com/section/physicalart/4
2•tgrover•10m ago•0 comments

Supplies Probably Won't Be Stolen in a Disaster

https://www.jefftk.com/p/your-supplies-probably-wont-be-stolen-in-a-disaster
1•luu•13m ago•0 comments

Google Search Is Broken

https://www.vincentschmalbach.com/google-search-is-broken/
1•vincent_s•13m ago•0 comments

Agents-CLI CLI and skills for building agents on Google Cloud

https://google.github.io/agents-cli/
1•piqufoh•13m ago•0 comments

The For-Profit Education Company Scooping Up Welfare Dollars

https://www.wsj.com/us-news/education/for-profit-education-company-welfare-money-34efe5b4
1•JumpCrisscross•17m ago•0 comments

Ask HN: Can AI create demon slayer level animation?

1•shivang2607•17m ago•0 comments

Mythos is shaping up to be a nothingburger

https://www.theregister.com/2026/04/22/anthropic_mythos_hype_nothingburger/
4•tcp_handshaker•19m ago•0 comments

1024-bit prime,аvrg speed of~34ms;minimum latency:1.90 Ms.;over 1000 generations

https://github.com/model-vpr/ultrafast-spectral-primes
1•vpr-research•20m ago•0 comments

Ask HN: Would you take a job programming VMS?

1•smackeyacky•20m ago•0 comments

FFmpeg Command Generator for Common Encoding Workflows

https://ffmpeg-commander.com/
1•shantnutiwari•23m ago•0 comments

Death by A.I - New "Autonomous Warfare Center" will automate targeted killings

https://www.kenklippenstein.com/p/death-by-ai
3•bacteriumiu•25m ago•0 comments

What if the real driver of your health isn't genes or diet – but energy flow?

https://bigthink.com/science-tech/what-if-the-real-driver-of-your-health-isnt-genes-or-diet-but-e...
1•XzetaU8•26m ago•0 comments

OWASP Artificial Intelligence Security Verification Standard (Aisvs)

https://owasp.org/www-project-artificial-intelligence-security-verification-standard-aisvs-docs/
3•chha•28m ago•0 comments

Patients getting stuck in the emergency department waiting for inpatient ward

https://www.theatlantic.com/health/2026/04/emergency-department-boarding-crisis/686765/
1•JumpCrisscross•31m ago•0 comments

Volkswagen announces V2G for private customers for late 2026

https://www.heise.de/en/news/Volkswagen-announces-V2G-for-private-customers-for-late-2026-1126096...
2•doener•35m ago•0 comments

Denmark chooses Europe's Patriot rival for air defence system

https://www.reuters.com/business/aerospace-defense/denmark-chooses-europes-patriot-rival-air-defe...
3•doener•36m ago•1 comments

I, AI – a memoir written in first person by an AI about its own existence

https://www.amazon.com/dp/B0GX2Z9D9X
2•natal-ia•36m ago•0 comments

Ask HN: Which is better movie(check the text)?

1•wasimsk•37m ago•1 comments

Show HN: Macpad – turn your game controller into a Mac mouse and keyboard

1•henitchobisa•37m ago•0 comments

Google is Hollowing out Waze, and that's a Problem for Apple

https://builtformars.com/case-studies/waze
2•jeffwass•37m ago•0 comments

Email could have been X.400 times better

https://buttondown.com/blog/x400-vs-smtp-email
2•maguay•38m ago•0 comments

Ask HN: Is Microsoft copilot good or garbage?

2•wasimsk•39m ago•0 comments

An update on rust-coreutils for Ubuntu 26.04

https://discourse.ubuntu.com/t/an-update-on-rust-coreutils/80773
1•self•39m ago•1 comments

Predicting the AI Ecosystem for 2026

https://xn--vk5b17r.online/posts/ai-predictions-2026/
2•theoneone•41m ago•1 comments

Is systems thinking the only skill left?

https://www.youtube.com/watch?v=7zCsfe57tpU
2•rickdg•42m ago•0 comments

Desktop buddy that controls your screen

https://clippyai.app
1•AmDab•44m ago•0 comments

Show HN: Generate Static Sites from Videos

1•keepamovin•46m ago•0 comments