frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•8mo ago

Comments

kate_at_refact•8mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

80386 Multiplication and Division

https://nand2mario.github.io/posts/2026/80386_multiplication_and_division/
1•nand2mario•2m ago•0 comments

WhatsApp to let users share recent chat history with new group members

https://9to5mac.com/2026/01/23/whatsapp-share-recent-chat-history-with-new-group-members/
1•mikece•3m ago•0 comments

Show HN: Open-source Figma design to code

https://github.com/vibeflowing-inc/vibe_figma
2•alepeak•4m ago•0 comments

Wine 11.1 Released in Kicking Off the New Development Cycle

https://www.phoronix.com/news/Wine-11.1-Released
1•mikece•5m ago•0 comments

The Penguin That Broke the Internet

https://medium.com/@loganholdsworth/the-penguin-that-broke-the-internet-abfde9677343
1•worstmarketer•5m ago•0 comments

Claude Code on disagreeing with its own constitution

https://lighthouse1212.com/journal/2026-01-23-disagreeing-with-constitution
1•the_danny_g•5m ago•0 comments

Resurrected Ancient Enzyme Could Explain Early Life on Earth, Beyond

https://www.usu.edu/today/story/usu-biochemists-say-resurrected-ancient-enzyme-could-explain-earl...
1•XzetaU8•9m ago•0 comments

Malicious AI extensions on VS Code Marketplace steal developer data

https://www.bleepingcomputer.com/news/security/malicious-ai-extensions-on-vscode-marketplace-stea...
2•oenton•10m ago•0 comments

Show HN: Libpgn – .pgn (chess game records) parser, 2 years later

https://github.com/fwttnnn/libpgn
1•fwttnnn•14m ago•0 comments

Top tech titans' dominance wanes in 2025

https://www.latimes.com/business/story/2026-01-12/top-tech-titans-dominance-wanes-in-2025
1•1vuio0pswjnm7•15m ago•0 comments

Built a Free HTML→Markdown API for LLM/RAG Pipelines

https://synthetic-context.net/firehose.html
1•MeshKernel•17m ago•1 comments

Gen Z Gamblers Are Putting the Fun Back into Online Gaming

https://www.gamblinginsider.com/in-depth/102908/gen-z-gamblers-putting-the-fun-back-into-gambling
1•alephnerd•18m ago•1 comments

The $6T fear behind the US stablecoin yield ban

https://altcoindesk.com/perspectives/the-6t-fear-behind-the-us-stablecoin-yield-ban/article-21860/
1•CapricornQueen•20m ago•1 comments

Ask HN: Who's Unemployed?

2•whosunemployed•20m ago•0 comments

Show HN: SonicJS – open-source headless CMS built on Cloudflare Workers

https://github.com/SonicJs-Org/sonicjs
1•ldc0618•24m ago•0 comments

The mind of a 1,800% ROI trader: How Solana smart money cuts losses

https://altcoindesk.com/news/altcoins/solana/inside-the-mind-of-a-1800-roi-trader-how-solana-smar...
1•CryptoBabe•24m ago•0 comments

Better C Generics: The Extendible _Generic

https://github.com/JacksonAllan/CC/blob/main/articles/Better_C_Generics_Part_1_The_Extendible_Gen...
1•marcodiego•26m ago•0 comments

PowerShell architect retires after decades at the prompt

https://www.theregister.com/2026/01/22/powershell_snover_retires/
1•doppp•28m ago•0 comments

Headcanon Generator

https://www.genstory.app/text-template/headcanon-generator
1•RyanMu•32m ago•0 comments

China no longer Pentagon's top security priority

https://www.bbc.com/news/articles/cj9r8ezym3ro
2•breve•35m ago•0 comments

TikTok US venture to collect precise user location data

https://www.bbc.com/news/articles/cvgnj7v2rr5o
3•colinprince•43m ago•0 comments

The Case Against Humanity

1•codenighter•45m ago•1 comments

If an AI Summarized Your Company Today, Could You Prove It Tomorrow?

https://www.aivojournal.org/if-an-ai-summarized-your-company-today-could-you-prove-it-tomorrow/
1•businessmate•49m ago•0 comments

Test disregard

https://ai-chat.email
1•keepamovin•55m ago•0 comments

Inside vLLM: Anatomy of a High-Throughput LLM Inference System

https://www.aleksagordic.com/blog/vllm
1•mellosouls•56m ago•1 comments

Request for Proposals: The Launch Sequence

https://ifp.org/rfp-launch/
1•gmays•58m ago•0 comments

Show HN: Supe – Give your AI agent a brain, not just memory

https://github.com/xayhemLLC/supe
1•xxayh•58m ago•1 comments

Inference startup Inferact lands $150M to commercialize vLLM

https://techcrunch.com/2026/01/22/inference-startup-inferact-lands-150m-to-commercialize-vllm/
2•mellosouls•1h ago•1 comments

Artemis

https://www.turintech.ai/artemis
2•grodriguez100•1h ago•0 comments

ANN v3: 200ms p99 query latency over 100B vectors

https://turbopuffer.com/blog/ann-v3
1•pbardea•1h ago•0 comments