frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•11mo ago

Comments

kate_at_refact•11mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Cisco got hacked through a security scanner

https://vaultproof.dev/blog/cisco-trivy-hack
1•Rial_Labs•1m ago•0 comments

DOE explains spacetime

https://www.energy.gov/science/doe-explainsspacetime
1•hhs•2m ago•0 comments

GCC Translation Validation Part 6: Uninitialized Memory

https://kristerw.github.io/2026/04/10/uninitialized-memory/
1•matt_d•3m ago•0 comments

Show HN: The Universe in One Chart

https://saminloes.com/one-chart/
1•atleastoptimal•4m ago•0 comments

How RL Reward Hacking Made Claude Mythos a Zero-Day Hunter

https://uberdavid.substack.com/p/from-code-completion-to-zero-day
1•uberdavid•4m ago•0 comments

Peer Review Does Private, Elite Gatekeeping

https://criticalfallibilism.com/peer-review-does-private-elite-gatekeeping/
2•paulpauper•6m ago•1 comments

Meta Banks on AI to Clear the Smoke of Social-Media Lawsuits

https://www.wsj.com/tech/meta-banks-on-ai-to-clear-the-smoke-of-social-media-lawsuits-902263dc
2•1vuio0pswjnm7•6m ago•0 comments

Show HN: A free study guide for the AWS DVA-C02, built from my own exam notes

https://tofl.github.io
2•tomflitt•8m ago•0 comments

Women are getting most of the new jobs. What's going on with men?

https://www.npr.org/2026/04/10/nx-s1-5773327/women-men-jobs-health-care-manufacturing
2•harambae•9m ago•0 comments

SWE-Bench Verified Leaderboard March 2026 – Independent vs. Self-Reported Scores

https://www.marc0.dev/en/leaderboard
3•chenglin97•11m ago•0 comments

TruffleHog now finds all Deleted and Private Commits on GitHub (2024)

https://trufflesecurity.com/blog/trufflehog-now-finds-all-deleted-and-private-commits-on-github
1•password4321•12m ago•0 comments

Live Halftime Timer for NBA, NFL

https://thehalftimer.com/
1•LeviBL•12m ago•0 comments

A large-scale look at the exposome

https://hms.harvard.edu/news/large-scale-look-exposome
2•hhs•12m ago•0 comments

Artemis II Flight Day 10: Re-Entry Live Updates

https://www.nasa.gov/blogs/missions/2026/04/10/artemis-ii-flight-day-10-re-entry-live-updates/
2•layer8•12m ago•0 comments

Beware, fellow plutocrats, the pitchforks are coming [video]

https://www.youtube.com/watch?v=q2gO4DKVpa8
2•jyounker•16m ago•1 comments

Under the hood of MDN's new front end

https://developer.mozilla.org/en-US/blog/mdn-front-end-deep-dive/
2•0xedb•16m ago•0 comments

Honda's EV Reversal Just Killed Sony's Electric Car: TDS

https://www.thedrive.com/news/hondas-ev-reversal-just-killed-sonys-electric-car-tds
1•PaulHoule•18m ago•0 comments

Poll: Majority of voters say risks of AI outweigh the benefits

https://www.nbcnews.com/politics/politics-news/poll-majority-voters-say-risks-ai-outweigh-benefit...
3•cdrnsf•18m ago•0 comments

A public Agent Sandbox with Hermes inside

https://sandbox-sba1ad15f841c32f2f.treadstone-ai.dev/
4•earayu•21m ago•2 comments

A New Case Exposed the Clever Workaround the FBI Uses to Read Secure Messages

https://www.inc.com/chloe-aiello/a-new-case-exposed-the-clever-workaround-the-fbi-uses-to-read-se...
1•daft_pink•22m ago•0 comments

Blockchain.com bug causing wrong data to be displayed

https://www.blockchain.com/explorer/addresses/btc/1PWo3JeB9jrGwfHDNpdGK54CRas7fsVzXU
1•867-5309•23m ago•2 comments

A Communist Apple II and Fourteen Years of Not Knowing What You're Testing

https://llama.gs/blog/index.php/2026/04/10/friday-archaeology-a-communist-apple-ii-and-fourteen-y...
1•major4x•24m ago•0 comments

Password Manager Angst

https://www.tbray.org/ongoing/When/202x/2026/04/09/Password-Manager-Angst
1•timbray•24m ago•0 comments

Sigbovik 2026

https://sigbovik.org/2026/
1•blmayer•25m ago•1 comments

NASA's Artemis II Crew Comes Home [video]

https://www.youtube.com/watch?v=nfhDuOHMp0A
1•meetpateltech•27m ago•0 comments

AI and Cybersecurity: A Glass Half-Empty/Half-Full of Nitroglycerin

https://www.techdirt.com/2026/04/10/ai-and-cybersecurity-a-glass-half-empty-half-full-proposition...
2•hn_acker•29m ago•1 comments

Show HN: A 1KB zero-dependency relative time formatter for UI systems

https://appents.com/tech/human-time
1•hedayet•29m ago•0 comments

In-Place Test-Time Training

https://arxiv.org/abs/2604.06169
1•dgfl•29m ago•1 comments

How Many Lives Do Amber Alerts Save?

https://www.mcgill.ca/oss/article/critical-thinking-technology-history/how-many-lives-do-amber-al...
2•jprs•30m ago•0 comments

The Ancient Coding Language That 95% of ATMs Use [video]

https://www.youtube.com/watch?v=P8oc_UXgD2A
3•kerim-ca•32m ago•0 comments