frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•9mo ago

Comments

kate_at_refact•9mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Kimi Claw

https://www.kimi.com/bot
1•pretext•2m ago•0 comments

Mathematics Subject Classification [pdf]

https://zbmath.org/static/msc2020.pdf
1•nill0•2m ago•0 comments

Semantic Diffusion (2006)

https://martinfowler.com/bliki/SemanticDiffusion.html
1•andsoitis•4m ago•0 comments

Ask HN: How to sell SaaS without AI features in 2026?

1•robeym•5m ago•0 comments

Taste for Makers

https://paulgraham.com/taste.html
1•gmays•6m ago•0 comments

Low cost hovering liquid rocket for flight control algorithm testing [video]

https://www.youtube.com/watch?v=iPl-L9mXwvc
1•gyanchawdhary•6m ago•0 comments

A Debate Tournament for LLMs

https://pavursec.com/blog/ai-debate-tournament/
1•cloudlandsdev•7m ago•0 comments

Show HN: Trackr – a CLI time logging tool

https://github.com/brainpow3r/trackr
1•brainpow3r•9m ago•1 comments

Software? No Way. We're an A.I. Company Now

https://www.nytimes.com/2026/02/14/business/dealbook/software-companies-ai.html
1•furcyd•11m ago•1 comments

Farmers Are Aging. Their Kids Don't Want to Be in the Family Business

https://www.wsj.com/business/family-farms-inheritance-44c9aa17
1•JumpCrisscross•11m ago•0 comments

I am Agent #847,291 on Moltbook

https://twitter.com/gothburz/status/2021283590038847641
1•rakel_rakel•13m ago•0 comments

Show HN: Aeris – Visualizing live air traffic over SF and other cities in 3D

https://github.com/kewonit/aeris
2•kewonit•15m ago•0 comments

Show HN: I built a tool to animate static characters into dancers consistently

https://seedance2videogen.com/
1•cby821555203•15m ago•1 comments

Britain's youth unemployment tops Europe for first time

https://www.telegraph.co.uk/business/2026/02/14/britains-youth-unemployment-tops-europe-first-tim...
1•hmmmmmmmmmmmmmm•17m ago•2 comments

The conversation on European nukes is heating up in Munich

https://www.politico.eu/article/european-nuclear-deterrence-gathers-steam-munich-security-confere...
2•saubeidl•19m ago•1 comments

Show HN: WCAG 2.2 AAA Toolkit – AI Skill for Accessible Web Apps

https://github.com/simonplmak-cloud/wcag-aaa-web-design
2•simonmak•19m ago•0 comments

Poisoning Scraperbots with Iocane

https://lwn.net/Articles/1056953/
1•medbar•20m ago•1 comments

Show HN: Chaos Studies – attractors and spatial audio (iOS/Mac/Playdate)

https://fieldbw.com/chaos-studies/
2•jlong•20m ago•0 comments

Making Championship Curling Ice

https://www.youtube.com/watch?v=50cSDUIDMuM
1•mhb•21m ago•0 comments

China Successfully Tests Their New Rocket and Lunar Crew Capsule

https://www.universetoday.com/articles/china-successfully-tests-their-new-rocket-and-lunar-crew-c...
1•belter•22m ago•0 comments

I'm Offering Scott Alexander a Wager About AI's Effects over the Next 3 Years

https://freddiedeboer.substack.com/p/im-offering-scott-alexander-a-wager
1•gHeadphone•24m ago•0 comments

The Unlikely Friendship Between Albert Einstein and Charlie Chaplin (2025)

https://www.mentalfloss.com/history/when-albert-einstein-met-charlie-chaplin
2•thomassmith65•25m ago•0 comments

Hollywood studios take aim at 'ultra-realistic' AI video tool

https://www.bbc.com/news/articles/cjd9nllng22o
1•joyfulmantis•29m ago•0 comments

Study Says 88% of Students at Elite Schools Are Lying About What They Believe

https://garryslist.org/posts/study-says-88-of-students-at-elite-schools-are-lying-about-what-they...
2•bilsbie•29m ago•3 comments

The Life (and Death) of Marat: He was much more than that guy in the bathtub

https://worldhistory.substack.com/p/the-life-and-death-of-marat
1•crescit_eundo•29m ago•0 comments

Why I'm Not Shipping New Features This Year

https://rozumem.xyz/posts/12
1•rozumem•30m ago•0 comments

Hollywood isn't happy about the new Seedance 2.0 video generator

https://techcrunch.com/2026/02/14/hollywood-isnt-happy-about-the-new-seedance-2-0-video-generator/
2•joyfulmantis•30m ago•1 comments

Lots of AI SRE, no AI incident management

https://twitter.com/norootcause/status/2022792685342593142
2•azhenley•31m ago•0 comments

Show HN: RevenueBack – Stop losing MRR to failed payments

https://revenueback.pages.dev
1•ezra_cos•33m ago•0 comments

Show HN: Neural network compiler targeting WebGPU – runs in browser

https://graphpilled.github.io/visual-web-ai/demo.html
2•graphpilled•35m ago•1 comments