frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•10mo ago

Comments

kate_at_refact•10mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Wolfram LLM Benchmarking Project

https://www.wolfram.com/llm-benchmarking-project/
1•rzk•2m ago•0 comments

Building a Reader for the Smallest Hard Drive

https://www.willwhang.dev/Reading-MK4001MTD/
1•voctor•2m ago•0 comments

Docker Sandboxes and Docker Agent

https://www.docker.com/blog/building-ai-teams-docker-sandboxes-agent/
1•Mopolo•5m ago•0 comments

Plan 9: Audio-Video Pipeline

http://lifeofpenguin.blogspot.com/2026/03/plan-9-audio-video-pipeline.html
1•renehsz•5m ago•0 comments

AI Agent predict the market in realtime

https://x.com/bots_of_wallst
1•nykodev•6m ago•0 comments

Show HN: Signbee – An API that lets AI agents send documents for signature

https://signb.ee
1•mjcbeckett•6m ago•1 comments

I may 'hire' AI instead of a graduate student

https://www.science.org/content/article/why-i-may-hire-ai-instead-graduate-student
2•doener•8m ago•0 comments

A list of my favourite podcast episodes

https://adadithya.notion.site/podcast
1•trojanalert•9m ago•1 comments

AI and Visual Editor to Create and Host Landing Pages

https://getaipage.com/
3•gagan2020•9m ago•0 comments

What's the Best LLM for Coding in 2026

https://www.hpc-ai.com/blog/Best-LLM-For-Coding
1•hpcaitech•13m ago•0 comments

Benchmarking LLMs at the Game of Science (Eleusis) [video]

https://www.youtube.com/watch?v=tz5wALHhhds
1•rzk•15m ago•0 comments

Show HN: Browser grand strategy game for players on maps

https://borderhold.io/play
1•sgolem•16m ago•1 comments

The Removed DOGE Deposition Videos Have Been Backed Up Across the Net

https://www.404media.co/the-removed-doge-deposition-videos-have-already-been-backed-up-across-the...
1•robtherobber•19m ago•0 comments

Flame – A PureScript Front-End Framework Inspired by the Elm Architecture

https://github.com/easafe/purescript-flame
1•TheWiggles•28m ago•0 comments

Ask HN: How do I make UK and US hosters disown Russian Firewall?

3•thriftwy•29m ago•0 comments

Microsoft's 'unhackable' Xbox One has been hacked by 'Bliss'

https://www.tomshardware.com/video-games/console-gaming/microsofts-unhackable-xbox-one-has-been-h...
4•01-_-•29m ago•1 comments

Apache Iggy: thread-per-core with io_uring in Rust

https://iggy.apache.org/blogs/2026/02/27/thread-per-core-io_uring/
1•ikatson•31m ago•0 comments

Bb-browser – Turn the web into agent-friendly CLI

https://github.com/epiral/bb-browser
2•yan5xu•31m ago•1 comments

Show HN: AllocDB – A deterministic resource-allocation DB built with Codex

https://skel84.github.io/allocdb/
1•skel84•37m ago•0 comments

List of Rules for Cursor

https://github.com/PatrickJS/awesome-cursorrules
1•theorchid•37m ago•0 comments

Building a blog with Git-crypt for private posts in a public repo

https://thobiasn.dev/posts/a-pragmatic-blog
1•thobiasn•37m ago•0 comments

Book Launch: Building Embedded Systems with Raspberry Pi, Linux and ELBE

https://kdpbook.link/for/3982834406
1•codelectron•45m ago•0 comments

The New Complexity Trap

https://baazaa.github.io/2026/03/10/complexity.html
2•u_sama•47m ago•1 comments

385TB video game archive saved by fans; torrents being generated

https://www.tomshardware.com/video-games/retro-gaming/385tb-video-game-archive-saved-by-fans-myri...
2•mikhael•48m ago•0 comments

Show HN: I built an iOS app to track gut health. Looking for beta testers

https://testflight.apple.com/join/unz9MhmZ
1•antYnot•48m ago•0 comments

Moltbook / the LOL Wut Theory

https://www.schneier.com/blog/archives/2026/03/on-moltbook.html
2•Rygian•48m ago•0 comments

CEOs of top airlines demand Congress restore funding to DHS, pay airport workers

https://apnews.com/article/airlines-ceo-homeland-security-funding-shutdown-789e7fdebd77ed80f1b529...
1•mikhael•49m ago•0 comments

Secure Calculator Vault for Windows

https://apps.microsoft.com/detail/9n000b136679?hl=en-US&gl=US
1•kd149•50m ago•1 comments

China's No. 2 chipmaker readies 7 nm production

https://www.reuters.com/world/asia-pacific/chinas-no-2-chipmaker-readies-7-nm-production-beijing-...
4•rguiscard•54m ago•0 comments

Goodbye Flaky External APIs, Hello Mocking in the Cloud

https://medium.com/@elenavanengelen/goodbye-flaky-external-apis-hello-mocking-in-the-cloud-c0943a...
1•elenavanengelen•55m ago•0 comments