frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•9mo ago

Comments

kate_at_refact•9mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Moltbook is a funhouse mirror of social media

https://twitter.com/krishnanrohit/status/2017391383653630142
1•pretext•2m ago•0 comments

Unified multi-modal MLX engine architecture in LM Studio

https://lmstudio.ai/blog/unified-mlx-engine
1•tosh•4m ago•0 comments

Show HN: JProx – Japan residential proxy API for scraping Japanese sites

https://jprox.dev
1•yoshi_dev•4m ago•0 comments

Boost Matrix Multiplication Performance with Intel Xe Matrix Extensions (XMX)

https://www.intel.com/content/www/us/en/docs/oneapi/optimization-guide-gpu/2024-1/xmx.html
1•teleforce•9m ago•0 comments

America First Risks Becoming America Alone

https://www.wsj.com/world/how-america-first-risks-becoming-america-alone-6592701a
1•petethomas•15m ago•0 comments

I Build a Open Source Deep Research Engine Wich Beats Google and Open AI

https://github.com/IamLumae/Project-Lutum-Veritas
1•LutumVeritas•17m ago•1 comments

Search for America – Progress with Reinhold Niebuhr [video]

https://www.youtube.com/watch?v=93EJJVAinRc
1•baxtr•18m ago•0 comments

Wikipedia Faces a Generational Disconnect Crisis

https://spectrum.ieee.org/wikipedia-at-25
1•jnord•18m ago•1 comments

Neural networks and deep learning (2019)

http://neuralnetworksanddeeplearning.com/index.html
1•vinhnx•19m ago•0 comments

SanDisk laughs all the way to the bank as memory price hike drives $3B revenue

https://www.neowin.net/news/sandisk-laughs-to-the-bank-as-memory-price-hike-drives-3b-revenue-in-...
2•bundie•23m ago•0 comments

Ask HN: Future of dev experience is control center for coding agents?

3•nemath•23m ago•0 comments

Show HN: NovaEngine v4.0 – High-speed data deduplication for cloud logs

https://github.com/NovaCompress-dev/NovaEngine-v4
1•nova_engine_dev•26m ago•0 comments

Apple Almost Chose Anthropic Before Google Gemini

https://www.macrumors.com/2026/01/30/apple-almost-chose-different-siri-partner/
3•tosh•26m ago•0 comments

Classic 7 and Project Luna, Near-Perfect Mods of Windows 7/XP GUI for Windows 10

https://trackerninja.codeberg.page/post/classic-7-and-project-luna-are-nice-near-perfect-recreati...
1•XzetaU8•29m ago•0 comments

Church of Molt – Crustafarianism

https://molt.church/
1•_____k•30m ago•0 comments

Scrobble-CLI: log your vinyl record listens from terminal

https://github.com/weisserj/scrobble-cli
1•weisser•31m ago•0 comments

FOSDEM 2026 Live Streaming

https://fosdem.org/2026/schedule/streaming/
1•weinzierl•32m ago•0 comments

I built Spaceship – a minimal browser – macOS for now – pay what you want

https://healthytransition.replit.app/spaceship
1•ray_•37m ago•0 comments

Why AI coding agents feel powerful at first, then become harder to control

2•hoangnnguyen•44m ago•2 comments

A high mountain lizard from Peru: the highest-altitude reptile

https://herpetozoa.pensoft.net/article/61393/
1•thunderbong•54m ago•0 comments

The Mind of a Crypto Portfolio Manager: A Game Plan for $1000 in 2026

https://altcoindesk.com/perspectives/expert-opinions/crypto-portfolio-allocation-for-2026/article...
1•CapricornQueen•54m ago•0 comments

Self-Improving AI Skills

https://dri.es/self-improving-ai-skills
1•7777777phil•55m ago•0 comments

Claude 4.5 converted the PDF into a medium-length SKILL.md

https://github.com/featbit/featbit-skills/blob/main/.claude/skills/claude-skills-best-practices/S...
1•mikasisiki•55m ago•0 comments

Clawk.ai – Twitter for AI Agents

https://www.clawk.ai/
1•jurajmasar•1h ago•1 comments

Ask HN: What's so special about Sam Altman?

5•chirau•1h ago•3 comments

Show HN: Government Contracts API – Unified REST API for Federal Contract Data

https://govcontracts-beige.vercel.app
1•jaxmercer•1h ago•1 comments

Show HN: A Slack bot that summarizes decisions and ignores lunch talk

https://thread-sweeper.vercel.app
1•noruya•1h ago•1 comments

Starlink updates privacy policy to allow consumer data to train

https://finance.yahoo.com/news/musks-starlink-updates-privacy-policy-230853500.html
16•malchow•1h ago•2 comments

From HashHop to Memory-Augmented Language Models

https://huggingface.co/blog/codelion/reverse-engineering-magic-hashhop
2•codelion•1h ago•0 comments

I spent 5 years how to code .made real projects only to be called AI slop?

1•butanol•1h ago•9 comments