frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•12mo ago

Comments

kate_at_refact•12mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

A Tale of Two Job Markets

https://www.youtube.com/watch?v=ugzw5I3Vako
1•gandalfgeek•1m ago•0 comments

Astro Removed Its Llms.txt

https://dacharycarey.com/2026/05/04/astro-removed-llms-txt/
1•taubek•1m ago•0 comments

Karabiner-Elements 16.0.0

https://karabiner-elements.pqrs.org/docs/releasenotes/
1•pretext•2m ago•0 comments

Removable batteries in smartphones will be mandatory in the EU starting in 2027

https://www.ecopv-eu.com/en/blog-en/replaceable-smartphone-batteries-2027-eu-regulation/
2•rdeboo•4m ago•0 comments

ConsentFix v3 attacks target Azure with automated OAuth abuse

https://www.bleepingcomputer.com/news/security/consentfix-v3-attacks-target-azure-with-automated-...
1•Brajeshwar•6m ago•0 comments

Show HN: SharkAuth – Auth server for AI agent delegation

https://github.com/shark-auth/shark
1•raulgooo•7m ago•0 comments

Building a new enterprise AI services company with Blackstone, H&F, and Goldman

https://www.anthropic.com/news/enterprise-ai-services-company
1•yla92•10m ago•0 comments

You Were Tricked: An 8000 Word Response to Lars Lofgren's Viral Codesmith Piece

https://michaelnovati.substack.com/p/a-response-to-lars-lofgrens-codesmith
1•michaelnovati•11m ago•1 comments

29th August 2026: A Scenario

https://martinalderson.com/posts/august-29-2026-a-scenario/
1•martinald•12m ago•0 comments

Automatically switch Android's dark mode using ambient light sensor

https://www.howtogeek.com/i-ditched-sunrisesunset-dark-mode-for-this-android-app-it-uses-your-lig...
1•politelemon•12m ago•0 comments

Show HN: KIP Pattern – A React architecture pattern for true encapsulation

https://github.com/Miladxsar23/kip-pattern
1•milad_shirian•13m ago•0 comments

Send Large Files Online – Free, Secure and Unlimited

https://fromsmash.com/
1•janandonly•15m ago•0 comments

How HN: BibCrit – LLM analysis grounded in real manuscript corpus data

https://bibcrit.app/
1•jossifresben•19m ago•1 comments

More than half of pilots have fallen asleep while in charge of a plane (2013)

https://www.bbc.com/news/uk-24296544
2•johnbarron•21m ago•1 comments

Flipper: Beautiful, performant feature flags for Ruby

https://github.com/flippercloud/flipper
1•thunderbong•22m ago•0 comments

Analyzing the Patterns of Numbers in 10M Passwords (2015)

https://minimaxir.com/2015/02/password-numbers/
1•downbad_•25m ago•1 comments

Show HN: Looq, the capabilities macOS Quick Look should have shipped with

https://parcse.com/looq
3•parcse•25m ago•0 comments

Show HN: Capsule Bash – Sandboxed Bash for Agents

https://github.com/capsulerun/bash
1•mavdol04•25m ago•2 comments

Pomiferous: The most extensive apples (pommes) database

https://pomiferous.com/
1•Ariarule•27m ago•0 comments

How citations ruined science

https://davidoks.blog/p/how-citations-ruined-science
1•jprs•29m ago•0 comments

Are closed social networks inevitable? (2010)

https://danluu.com/open-social-networks/
2•downbad_•30m ago•1 comments

Knowledge Infra for Agents and Humans

https://dosu.dev
1•devstein•30m ago•0 comments

LandingRank – community-ranked landing page directory with daily Elo battles

https://landingrank.com
1•_FakeBanana_•30m ago•0 comments

Systems Are Visual – This Is a Better Way to Write Them

https://toolkit.whysonil.dev/lab-notebook/
3•otterwilde2•33m ago•0 comments

Vitexec – allow agents to test Vite apps through injected code

https://www.youtube.com/watch?v=yhIOSjp6pqs
1•BelaBohlender•33m ago•0 comments

They Left Receipts: Inside Charming Kitten's Crypto Procurement Network

https://caudena.com/charming-kitten-crypto-procurement-network/
2•caudena•35m ago•0 comments

8 in 10 Chatbots Inclined to Assist Users in Planning Attacks

https://www.statista.com/chart/36156/instances-where-chatbots-assisted-users-plan-a-violent-attack/
2•laurex•35m ago•0 comments

AI evals are becoming the new compute bottleneck

https://huggingface.co/blog/evaleval/eval-costs-bottleneck
4•gmays•37m ago•0 comments

Tera – System for structuring and testing complex ideas

https://github.com/Yggdrasilcsui/TERA/discussions/1�
2•Yggdrasilcsui•40m ago•0 comments

Ask HN: What's your favorite tech talk?

1•downbad_•41m ago•4 comments