frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•10mo ago

Comments

kate_at_refact•10mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Before the Story – A Word on Noise

https://awakenedvoices.substack.com/p/before-the-story
1•sacredcam•21s ago•0 comments

MaaS Updates

https://www.hpc-ai.com/model-apis
1•hpcaitech•27s ago•0 comments

FrameCapture – ScreenStudio but Free

https://framecapture.pro
1•SoldierSacha•1m ago•1 comments

Curious. anyone here allow agents to make purchase decisions of >$100?

1•adityasriram•1m ago•0 comments

Active Supply Chain Attack on axios 1.14.1

1•lemax•2m ago•0 comments

F1 in Japan: Oh no, what have they done to all the fast corners?

https://arstechnica.com/cars/2026/03/f1-in-japan-oh-no-what-have-they-done-to-all-the-fast-corners/
1•y1n0•3m ago•0 comments

Updated Apple Developer Program License Agreement Now Available

https://developer.apple.com/news/?id=fwswmjcn
1•surprisetalk•3m ago•0 comments

Keep Momentum – Android app for tracking job applications

https://keepmomentum.app/
1•lmhansen•4m ago•0 comments

Let Claude use your computer from the CLI

https://code.claude.com/docs/en/computer-use
1•taspeotis•11m ago•0 comments

Using complex polynomials to approximate arbitrary continuous functions (2025)

https://www.lesswrong.com/posts/9gNewBQCF47FyjYfw/using-complex-polynomials-to-approximate-arbitrary
1•measurablefunc•14m ago•0 comments

Explore Benjamin Franklin's Science on NotebookLM

https://blog.google/company-news/outreach-and-initiatives/arts-culture/benjamin-franklin-notebooklm/
2•y1n0•15m ago•0 comments

From static findings to runtime exploits: testing 6 popular MCP servers

https://agentseal.org/blog
1•Resham_Joshi•16m ago•0 comments

Retraining my terrible typing habits

https://technicallychallenged.substack.com/p/retraining-my-terrible-typing-habits
1•koinedad•19m ago•0 comments

Show HN: Gives your AI agents a shared, searchable, persistent memory – locally

https://github.com/vbfs/agent-memory-store/
1•vbfs•21m ago•0 comments

Research game measuring how humans detect AI-generated phishing emails

https://github.com/scottalt/ai-email-threat-research
1•serious_angel•21m ago•1 comments

Incident March 30th, 2026 – Accidental CDN Caching

https://blog.railway.com/p/incident-report-march-30-2026-accidental-cdn-caching
4•cebert•22m ago•0 comments

Adobe Illustrator can now use AI to rotate 2D vectors in 3D space

https://9to5mac.com/2026/03/30/adobe-illustrator-now-lets-you-rotate-2d-vectors-in-3d-space/
1•bundie•25m ago•0 comments

Universal Claude.md – cut Claude output tokens by 63%

https://github.com/drona23/claude-token-efficient
16•killme2008•27m ago•4 comments

Parsing a Chinese Poem as a Formal System That Runs

https://jimiwen.substack.com/p/si-wu-zi-4d7
1•jimiwen•30m ago•0 comments

Don't overthink electric car charging (we should be doing it differently)

https://www.youtube.com/watch?v=5NG4hycq8n0
1•em-bee•31m ago•0 comments

Six cloned horses help rider win prestigious polo match (2016)

https://www.science.org/content/article/six-cloned-horses-help-rider-win-prestigious-polo-match
1•pinkmuffinere•32m ago•1 comments

What I Talk About When I Talk About Grading

https://unintendedconsequenc.es/what-i-talk-about-when-i-talk-about-grading/
1•paulorlando•36m ago•0 comments

Maybe Finance Asset Sale

https://maybefinance.notion.site/asset-sale
1•raybb•38m ago•0 comments

Small ways the App Store could be improved for developers

https://lapcatsoftware.com/articles/2026/3/13.html
2•walterbell•39m ago•0 comments

Show HN: Cut your tail latencies by 74% with zero config

https://pkg.go.dev/github.com/bhope/hedge
2•soniccontroller•40m ago•0 comments

Rust's next-generation trait solver

https://lwn.net/SubscriberLink/1063124/81483612b1c8a493/
1•dabinat•42m ago•0 comments

Ask HN: How do you maintain technical deep-focus in a world of Slack/Teams

1•lion__93332•42m ago•0 comments

A Man and the Elevator

https://joseantunes.tech/life/2026/03/22/the-elevator.html
1•zemike•44m ago•1 comments

Whispr Flow – Vision Flow

https://github.com/tanayvin1216/VisionFlow
1•tanay_vin•46m ago•1 comments

GTabs – AI tab organizer for Chrome that works with any LLM

https://github.com/vaddisrinivas/gtabs
1•srinivasvaddi•46m ago•0 comments