frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•9mo ago

Comments

kate_at_refact•9mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Ask HN: Why is YouTube's recommendation system so bad?

1•mr-pink•27s ago•0 comments

Mad: Watch agents do research live

https://briankitano.com/mad/
1•bkitano19•28s ago•0 comments

Show HN: Business Lead Finder – Scrape Google Maps and Yelp for Leads

https://apify.com/original_xenomorph/business-lead-finder
1•harborbuilds•29s ago•0 comments

Rust CLI Generate and validate .env files from one spec – self-documenting envs

https://crates.io/crates/envgen/1.0.0
1•SteveMorin•42s ago•0 comments

Show HN: Irondiff-Visual Config Diff for Cisco/Juniper/PfSense with Slack Alerts

https://irondiff.com
1•MattRos•1m ago•1 comments

Show HN: Telescope now queries Kubernetes logs directly

https://github.com/iamtelescope/telescope/releases/tag/v0.0.24
1•r0b3r4•1m ago•0 comments

The Century of the Maxxer

https://samkriss.substack.com/p/the-century-of-the-maxxer
1•wawayanda•1m ago•0 comments

Show HN: ViewLint – Lint UI, Not Code

https://github.com/EvanZhouDev/viewlint
1•EvanZhouDev•2m ago•0 comments

First public patch for Unreal Tournament 2004 in over 20 years

https://github.com/OldUnreal/UT2004Patches/releases
1•NKosmatos•6m ago•0 comments

OpenAI Mission Statement through the years

https://www.closedopenai.com/
1•eternalyxiii•9m ago•1 comments

Vanilla Light – Full Stack Web Framework

https://github.com/beachdevs/vanilla-light
1•dpweb•9m ago•0 comments

PostgreSQL Bloat Is a Feature, Not a Bug

https://rogerwelin.github.io/2026/02/11/postgresql-bloat-is-a-feature-not-a-bug/
1•birdculture•9m ago•0 comments

Dozens of Australians diagnosed with rare tattoo-related vision loss

https://www.abc.net.au/news/health/2026-02-14/tattoo-eye-inflammation/106315444
2•bookofjoe•11m ago•1 comments

KPMG partner fined over using AI to pass AI test

https://www.ft.com/content/c30ded60-bece-45e0-981d-653e1e3e9818
2•mmarian•11m ago•1 comments

Show HN: Personal AI Talent Agency for Content Creators

1•aa_y_ush•16m ago•0 comments

Conversations with AI: What I Learned About Myself

https://luisfernandoyt.makestudio.app/blog/878-conversations-with-ai
1•lout332•17m ago•0 comments

Debugging Kernel Oops

https://lfhernandez.com/posts/debugging-kernel-oops/
1•linolevan•19m ago•0 comments

Vercel-labs/portless: Replace port numbers with stable, named .localhost URLs

https://github.com/vercel-labs/portless
1•bdcravens•19m ago•0 comments

How (and why) we migrated to Tanstack from Next.js

https://www.inngest.com/blog/migrating-off-nextjs-tanstack-start
2•absarokafish•19m ago•1 comments

The singularity won't be gentle

https://www.natesilver.net/p/the-singularity-wont-be-gentle
2•softwaredoug•19m ago•0 comments

State of Show HN: 2025

https://blog.sturdystatistics.com/posts/show_hn/
2•kianN•21m ago•1 comments

Shifting structures in a software world dominated by AI

https://twitter.com/Thom_Wolf/status/2023387043967959138
1•bilsbie•21m ago•0 comments

Show HN: Skillaudit.sh – A minimalist security auditor for LLM skill definitions

https://skillaudit.sh/checks
1•dns•21m ago•0 comments

Pentagon reviewing Anthropic partnership over terms of use dispute

https://thehill.com/policy/defense/5740369-pentagon-anthropic-relationship-review/
1•c420•21m ago•0 comments

Fff.nvim – the first ever typo resistant code search

https://github.com/dmtrKovalenko/fff.nvim
1•neogoose•23m ago•1 comments

Dutch cops arrest man after sending him confidential files

https://www.theregister.com/2026/02/16/dutch_cops_breach/
2•OptionOfT•26m ago•0 comments

Bridging the gap between fitness apps and personal training with AI

https://liftoffmvp.io/
1•bobawarrior99•26m ago•1 comments

Amazon EC2 supports nested virtualization on virtual Amazon EC2 instances

https://aws.amazon.com/about-aws/whats-new/2026/02/amazon-ec2-nested-virtualization-on-virtual/
1•sikiladho•29m ago•0 comments

Ask HN: What are the biggest limitations of agentic AI in real-world workflows?

1•aadarshkumaredu•31m ago•1 comments

Show HN: SkillForge – Turn screen recordings into AI agent skills (SKILL.md)

https://skillforge.expert
2•YaraDori•31m ago•0 comments