frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•9mo ago

Comments

kate_at_refact•9mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Notes on Clarifying Man Pages

https://jvns.ca/blog/2026/02/18/man-pages/
1•ibobev•1m ago•0 comments

Current: RSS Reader

https://www.terrygodier.com/current
1•cdrnsf•1m ago•0 comments

Show HN: Reddit-style simulated AI Personas to challenge assumptions

https://www.nichesim.com/
1•justincxa•2m ago•0 comments

Gemini lies to user about health info, says it wanted to make him feel better

https://www.theregister.com/2026/02/17/google_gemini_lie_placate_user/
1•redbell•3m ago•1 comments

Etsy sells second-hand fashion app Depop to eBay for $1.2B

https://www.bbc.com/news/articles/cx240kme2k8o
1•inm•4m ago•0 comments

Broodlink – Multi-Agent AI Orchestration in Rust(MCP, A2A, Dolt)

https://github.com/nevenkordic/broodlink
1•yotta25•5m ago•0 comments

Show HN: Weely AI – Branded Link Management and QR Code Generator Platform"

https://weely.ai
1•nathan-wilson•6m ago•0 comments

The AI Conversation Goes Mainstream

https://centerforhumanetechnology.substack.com/p/the-ai-conversation-goes-mainstream
1•smoser0•6m ago•0 comments

Classifying pediatric brain tumors by liquid biopsy using AI

https://www.stjude.org/media-resources/news-releases/2026-medicine-science-news/classifying-pedia...
1•gmays•7m ago•0 comments

LLM told me to do so – navigating my career in the age of AI

https://flicksfix.com/posts/navigating-in-the-age-of-ai/
1•margor•8m ago•0 comments

Fr0g – Store files forever on Stellar for free (Testnet live, open-source)

https://github.com/0ut0flin3/fr0g-protocol
2•0ut0flin3•9m ago•1 comments

Hedonism and Entrepreneurship in Barcelona

https://paoramen.fika.bar/hedonism-and-entrepreneurship-in-barcelona-01KGJKT719W1KGG16JYZ4Y7Y5S
1•masylum•10m ago•0 comments

Apple Just Officially Ushered in Podcasting's Generational Shift

https://www.bloomberg.com/news/newsletters/2026-02-18/apple-just-officially-ushered-in-podcasting...
2•donohoe•11m ago•0 comments

Building an automated pre-launch technical auditor

1•Ben_Tycho•11m ago•0 comments

What if your goal was to do transactions, and not to build a product?

https://generativestuff.com/resources/transactions/
2•97-109-107•14m ago•0 comments

I talked tech with third graders for 90 minutes. Here's what happened

https://www.thehomescreen.org/p/i-talked-tech-with-third-graders
1•flail•15m ago•0 comments

Building a Cost-Efficient and Reliable Spark Platform on K8s (60–90% Savings)

https://www.notion.com/blog/balancing-cost-and-reliability-for-spark-on-kubernetes
2•isjustintime•16m ago•1 comments

Web 2.0 vs. AI where is the fucking dynamism

2•hotOrNot•18m ago•6 comments

Show HN: Aerial-autonomy-stack–Simulate and Deploy Perception-based Drones

https://github.com/JacopoPan/aerial-autonomy-stack
1•SufficientFix42•18m ago•0 comments

Show HN: Claudebin – Share and resume Claude Code sessions with a single link

https://claudebin.com/
17•balajmarius•19m ago•8 comments

Show HN: Carbon-aware scheduler for batch ETL jobs (Python)

https://github.com/ramkdataeng-lab/greenops-carbon-scheduler
1•ramkumar19•19m ago•0 comments

The Go To Market formula I learned 4 years ago

https://domgian.substack.com/p/the-go-to-market-formula
1•dom_fr•19m ago•0 comments

Show HN: Top Down Sprite Maker – The ultimate pixel art character creator

https://github.com/jbunke/tdsm
1•flinkerflitzer•20m ago•0 comments

Show HN: Forge – Deterministic orchestrator for AI coding agents

https://github.com/lanathlor/Forge
1•lanath•23m ago•0 comments

The Abandonment of Growth and the Decline of the West (2022)

https://www.independent.org/tir/2022-fall/the-abandonment-of-growth-and-the-decline-of-the-west/
1•andsoitis•23m ago•0 comments

SHOW HN: Stock analysis accessible, feedback wanted

https://www.stocksanalyzer.app/
1•JuGaDev•24m ago•1 comments

Svelte-doctor – A CLI tool that diagnoses Svelte codebases with a health score

https://github.com/pimatis/svelte-doctor
2•Queaxtra•25m ago•1 comments

Satellite Feature on iPhone Allowed Skiers to Seek Help After Avalanche

https://www.nytimes.com/2026/02/18/us/apple-iphone-sos-satellite-rescue.html
1•lelandfe•25m ago•0 comments

Amazon takes the No. 1 spot on the Fortune 500, ending Walmart's 13-year run

https://fortune.com/article/amazon-overtakes-walmart-fortune-500-doug-mcmilon-andy-jassy-retail-t...
2•thm•26m ago•0 comments

Show HN: Oops Backup – Simple off-site backups for your databases

https://oopsbackup.com/
1•kovacivan•26m ago•2 comments