frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•11mo ago

Comments

kate_at_refact•11mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Is "build an audience before a product" bad advice? 3 AI models debated it

https://coldverdict.com/share/d438d0a37443
1•offbeatport•1m ago•0 comments

Waterfox: Firefox with privacy, usability, and speed enhancements

https://www.waterfox.com/
1•rankdiff•2m ago•0 comments

OpenSSL 4.0.0 Released

https://lwn.net/Articles/1067622/
1•Brajeshwar•2m ago•0 comments

The Journal Article Is Not the Job

https://scholarlykitchen.sspnet.org/2026/04/15/the-journal-article-is-not-the-job/
1•rustoo•2m ago•0 comments

Ask HN: Robotics engineers – how painful was setting up GPU SIM infra?

1•nikhilol•3m ago•1 comments

Single Module Lambda Calculus from Simply Typed to Martin Lof Type Theory

https://github.com/solomon-b/lambda-calculus-hs
1•birdculture•5m ago•0 comments

Why Chinese AI labs went open and will remain open

https://try.works/why-chinese-ai-labs-went-open-and-will-remain-open
1•try-working•5m ago•0 comments

The Chat Bar Isn't Lazy Design

https://metedata.substack.com/p/006-the-chat-bar-isnt-lazy-design
1•young_mete•7m ago•0 comments

Strategy to reduce >350k yearly deaths From heart disease by 2050

https://pubmed.ncbi.nlm.nih.gov/41738089/
1•brandonb•7m ago•0 comments

Designing the Transport Typeface

https://www.thamesandhudson.com/blogs/all-news-features/designing-the-transport-typeface-margaret...
1•speckx•7m ago•0 comments

The AlphaFold moment for materials is not any time soon

https://www.lesswrong.com/posts/6SaZ7z2fpKRYEBdff/the-alphafold-moment-for-materials-is-not-any-t...
1•gmays•8m ago•0 comments

The Cheap (Actual) AI Assistant Era Is Almost Here

https://hec.works/blog/ai-assistant-era/
2•dividedcomet•8m ago•0 comments

Why AI hasn't replaced human expertise–and what that means for your SaaS stack

https://stackoverflow.blog/2026/04/15/why-ai-hasn-t-replaced-human-expertise/
1•salkahfi•9m ago•0 comments

Ask HN: Opus Agent Drifting

1•ramon156•9m ago•0 comments

Flowly is live on Product Hunt today – would love your support

https://www.indiehackers.com/post/flowly-is-live-on-product-hunt-today-would-love-your-support-Az...
1•max_flowly_run•10m ago•0 comments

A guide to model quantization in fine-tuning (and how to pick the right GGUF)

https://www.siquick.com/blog/model-quantization-fine-tuning-pick-right-gguf
1•siquick•10m ago•0 comments

1-week challenge using resilient LLMs (starts this Sunday)

https://github.com/gitcommitshow/resilient-llm
1•prasadshankar•11m ago•0 comments

I built a personal dashboard to see what my agents cost and where they get stuck

https://debrief-app.com/
1•ameserop•11m ago•0 comments

Show HN: Lazyagent – TUI for to watch all your AI coding agents

https://github.com/chojs23/lazyagent
5•neozz•12m ago•0 comments

TollGate is the pay-as-you-go internet access on open networks

https://tollgate.me/
2•janandonly•12m ago•0 comments

God Sleeps in the Minerals

https://wchambliss.wordpress.com/2026/03/03/god-sleeps-in-the-minerals/
3•speckx•13m ago•0 comments

Don't Trust Password Managers? Hippo May Be the Answer

https://hackaday.com/2026/04/15/dont-trust-password-managers-hippo-may-be-the-answer/
1•beardyw•14m ago•0 comments

A Long History of Feeling Small

https://worldhistory.substack.com/p/a-long-history-of-feeling-small
1•crescit_eundo•14m ago•0 comments

Pausing new GitHub Copilot Pro trials

https://github.blog/changelog/2026-04-10-pausing-new-github-copilot-pro-trials/
2•ms7892•14m ago•0 comments

9 years building a task manager, last 2 years all in with AI

https://selfmanager.ai/
1•mariansorca•15m ago•1 comments

Ancient Excel bug comes out of retirement for active attacks

https://www.theregister.com/2026/04/15/excel_exploit/
1•Brajeshwar•16m ago•0 comments

Show HN: Chat-rs, yet another LLM provider

https://github.com/eggermarc/chat-rs
1•eggermarc•21m ago•0 comments

The Case for WordPress

https://randomwire.com/the-case-for-wordpress/
1•randomwire•21m ago•0 comments

Show HN: Kino is a Google TV app that turns the TV into a media server

https://play.google.com/store/apps/details?id=com.mrtksn.kino&hl=en_US
1•mrtksn•21m ago•0 comments

The Design Landscape of Robot Learning Is a Minefield

https://allevato.me/2026/04/15/robot-learning-is-a-minefield
1•kukanani•22m ago•0 comments