frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•11mo ago

Comments

kate_at_refact•11mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Show HN: I'm building an open platform to submit, rate and discover lectures

1•blackbrokkoli•34s ago•0 comments

The AI Industry's Most Expensive Mistake

https://www.thealgorithmicbridge.com/p/inside-the-ai-industrys-most-expensive
1•dev_tty01•40s ago•0 comments

What is it like to be a human being?

https://iai.tv/articles/what-is-it-like-to-be-a-human-being-auid-3544
1•rrwilla•1m ago•1 comments

Show HN: Faster more accurate multimodal vector search

https://github.com/nickswami/dasein-python-sdk
2•GaneshSuriya•2m ago•0 comments

Show HN: Corvo – Free portfolio analytics with MonteCarlo simulation and AI chat

https://corvo.capital/
1•vinaybatra•3m ago•0 comments

Open-source AI models for 3D generation

https://firethering.com/open-source-ai-3d-generators/
1•steveharing1•4m ago•0 comments

The state of high-speed rail in the U.S. [video]

https://www.youtube.com/watch?v=9Hm0_-bOB4Y
1•barronlroth•4m ago•0 comments

MirrorCode: Evidence that AI can do some weeks-long coding tasks

https://epoch.ai/blog/mirrorcode-preliminary-results/
2•tadamcz•5m ago•0 comments

Trump's Changes Lock Some Employers Out of H-1B Visa Program

https://www.nytimes.com/2026/04/10/us/politics/h1b-visa-program-changes.html
2•mitchbob•6m ago•1 comments

Next Project

https://www.amantulsyan.com/next-project-after-commenda/
1•amantulsyan35•7m ago•0 comments

I built ClawIDE: A web-based IDE for managing multiple Claude Code sessions

3•aeroxis•7m ago•1 comments

TypeScript stack: modern dev tools and platforms for startups

https://www.paralect.com/stack
1•igorkrasnik•11m ago•0 comments

What to Know About OpenAI's Ideas for a World with 'Superintelligence'

https://www.wsj.com/tech/ai/what-to-know-about-openais-ideas-for-a-world-with-superintelligence-e...
1•gmays•11m ago•0 comments

To Fill Air Traffic Controller Shortage, FAA Turns to Gamers

https://www.nytimes.com/2026/04/10/us/politics/air-traffic-controller-gamer.html
1•mitchbob•11m ago•1 comments

Abandoning Apple and Learning to Love Linux

https://jimjeffers.com/blog/abandoning-apple-and-learning-to-love-linux/
3•jimjeffers•12m ago•1 comments

Agents fail because software stopped being readable

https://adaptivesoftware.substack.com/p/what-agents-cant-read-they-cant-change
2•iristenteije•12m ago•0 comments

Show HN: LuxShot – Open-source, native macOS OCR utility

https://github.com/lukebuild/LuxShot
1•lukeiodev•13m ago•0 comments

Show HN: Formally Verified Leaderless Log Protocol for Kafka

https://github.com/lakestream-io/leaderless-log-protocol
2•sijieg•13m ago•1 comments

I used Codex to upgrade my 2013 Nexus 7 to Android 11

https://opuslabs.substack.com/p/breathing-life-into-my-13-year-old
2•opuslabs•14m ago•0 comments

ChatGPT's bug with scanned PDFs

https://medium.com/@sirk390/chatgpts-bug-with-scanned-pdfs-9fc9d5be38ba
1•sirk390•14m ago•0 comments

Why I'm Building a Database Engine in C#

https://nockawa.github.io/blog/why-building-database-engine-in-csharp/
3•vyrotek•15m ago•0 comments

Wikimind: A CLI that compiles raw documents into an interlinked wiki using LLMs

https://github.com/akashikprotocol/wikimind
1•sahildavid•19m ago•0 comments

"Memflation": Cheaper RAM not expected until 2028, says Gartner

https://www.heise.de/en/news/Memflation-Cheaper-RAM-not-expected-until-2028-says-Gartner-11249607...
4•doener•19m ago•0 comments

Neural Computers

https://arxiv.org/abs/2604.06425
1•tosh•19m ago•0 comments

RFC 9019 – A Firmware Update Architecture for Internet of Things (2021)

https://datatracker.ietf.org/doc/rfc9019/
1•Tomte•21m ago•0 comments

Bluesky April 2026 Outage Post-Mortem

https://pckt.blog/b/jcalabro/april-2026-outage-post-mortem-219ebg2
5•jcalabro•22m ago•1 comments

Compute iOS XNU offset from kernel cache

https://blog.reversesociety.co/blog/2026/kernel-rw-not-enough-extract-offsets-from-xnu-kernelcaches
1•tonygo•22m ago•0 comments

WireGuard makes new Windows release following Microsoft signing resolution

https://lists.zx2c4.com/pipermail/wireguard/2026-April/009561.html
25•zx2c4•24m ago•8 comments

Mybets.gg – AI-powered sports bet tracker with browser extension and analytics

https://www.mybets.gg
1•mybets•24m ago•0 comments

Show HN: I built a $3/yr AI workflow to stop doomscrolling Twitter for tech news

1•JustinLee-DEV•24m ago•0 comments