frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•9mo ago

Comments

kate_at_refact•9mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Norwegian state newspaper investigate AI psychosis

https://www.nrk.no/direkte/xl/ki-psykose_-vi-undersokte-hvordan-chatbotter-svarer-1.17770908
1•hallvard•55s ago•0 comments

Little about software engineering has changed over past last three months

https://twitter.com/Grady_Booch/status/2026736492488568955
1•tosh•2m ago•0 comments

Axelera: Cutting-edge hardware and software platform for accelerating inference

https://axelera.ai
1•amelius•5m ago•0 comments

WTF Happened in 2025?

https://wtfhappened2025.com/
2•myk-e•5m ago•1 comments

OpenClaw broadcasts its screen while I'm at the gym

https://www.instagram.com/reel/DVN2zfSEXJj/
1•nomilk•8m ago•1 comments

Asura: Looped Language Models done better

https://neel04.github.io/my-website/projects/asura/
1•n7g•8m ago•1 comments

Facebook is now the EU law maker

https://thegoodlobby.eu/watchdogs-call-to-drop-ex-meta-lobbyist-as-digital-omnibus-rapporteur/
1•zoobab•9m ago•0 comments

Show HN: A Confidence Calibration Game

https://www.calibrategame.com/
1•SleepyJack•9m ago•0 comments

Memory Price Trends

https://fr.pcpartpicker.com/trends/price/memory/
1•tin7in•10m ago•0 comments

Sometimes Your Device Is Alive but Is Dead

https://dunkels.com/adam/sometimes-your-device-is-alive-but-is-actually-dead/
1•adunk•13m ago•0 comments

Why This Ransomware Attack Failed

https://kb-it.net/why_this_ransomware_attack_failed/
1•better-it•14m ago•0 comments

Technical Excellence Is Not Enough

https://raccoon.land/posts/technical-excellence-is-not-enough/
2•bo0tzz•14m ago•0 comments

Heinzel – AI-Powered Linux Server Administration with Claude Code

https://github.com/wintermeyer/heinzel
1•wintermeyer•18m ago•0 comments

Reducing the size of Go binaries by up to 77%

https://www.datadoghq.com/blog/engineering/agent-go-binaries/
1•birdculture•21m ago•0 comments

What a negative AI economic scenario could look like

https://deadneurons.substack.com/p/what-a-negative-ai-economic-scenario
1•nr378•22m ago•0 comments

How Russia is intercepting communications from European satellites

https://theconversation.com/how-russia-is-intercepting-communications-from-european-satellites-27...
1•robtherobber•23m ago•0 comments

Pdfpc: A presenter console with multi-monitor support for PDF files

https://pdfpc.github.io/
1•fanf2•24m ago•0 comments

Show HN: Who's Winning the AI Race?

https://whoswinningtheairace.com/
2•truffle_pig•26m ago•0 comments

8B tokens a day forced AT&T to rethink AI orchestration and cut costs by 90%

https://venturebeat.com/orchestration/8-billion-tokens-a-day-forced-at-and-t-to-rethink-ai-orches...
1•Daviey•27m ago•0 comments

Show HN: Codex builds a working NES Emulator in one hour

https://github.com/kaonashi-tyc/codex-nes-emulator
1•zi2zi-jit•28m ago•0 comments

Show HN: PsiGuard – real-time hallucination monitoring for LLM apps

1•brad_o_ley•29m ago•0 comments

Tech Monitor – Real-Time AI and Tech Industry Dashboard

https://tech.worldmonitor.app/
1•Daviey•30m ago•0 comments

Tell HN: YC companies scrape GitHub activity, send spam emails to users

7•miki123211•31m ago•0 comments

Thoughts on Coding Agents

https://dennybritz.com/posts/coding-agents/
1•dennybritz•34m ago•0 comments

SEO, AEO, and AI Visibility: The three metrics that define your Website's future

https://repuai.live/en/blog/seo-aeo-ai-visibility-metrics-website-analysis
1•bioneisme•35m ago•0 comments

I built a turn tracking app and I don't know if it's useful?

https://www.turnsies.app/signin?returnUrl=%2F
1•aidanw•35m ago•1 comments

Copland (Operating System)

https://en.wikipedia.org/wiki/Copland_(operating_system)
1•sanbor•35m ago•0 comments

PivotOrDie – a public startup survival tracker

https://pivotordie.club
1•fojia•36m ago•2 comments

Why "All we need is 1% of this large market" is a red flag

https://www.n47.com/insights/why-all-we-need-is-1-percent-of-this-very-large-market-is-a-red-flag
1•fzliu•46m ago•0 comments

ConTraSt – database of empirical results on consciousness theories

https://contrastdb.tau.ac.il/
1•paraschopra•47m ago•0 comments