frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•8mo ago

Comments

kate_at_refact•8mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Charlie Eggins 3BLD double world record [video]

https://www.youtube.com/shorts/pzz3Ya5BFvs
1•ColinWright•3m ago•1 comments

Meta's VR layoffs, studio closures underscore Zuckerberg's pivot to AI

https://www.cnbc.com/2026/01/13/meta-lays-off-vr-employees-underscoring-zuckerbergs-pivot-to-ai.html
1•cebert•3m ago•0 comments

I Let the Internet Vote on Code Merges: Week 1 Results

https://blog.openchaos.dev/posts/week-1-the-first-merge
1•birdculture•4m ago•0 comments

Tesla driver-assist system FSD will switch to subscription-only

https://www.bloomberg.com/news/articles/2026-01-14/tesla-driver-assist-system-fsd-will-switch-to-...
1•teleforce•6m ago•0 comments

Show HN: Auto-fix Google Play Store translations that exceed character limits

https://chromewebstore.google.com/detail/play-console-translation/polceeifilniadjhgibdnlikpfnflhml
1•jelmervnuss•12m ago•0 comments

Show HN: Remio A second brain without headaches

https://www.remio.ai
1•AliceH0521•16m ago•0 comments

How to import ChatGPT conversations in Obsidian

https://blog.missioncontroltoolbox.xyz/blog/how-to-import-chatgpt-conversations-in-obsidian
1•awesomepotato•20m ago•0 comments

Show HN: SVGFix – transforms SVG path coordinates to origin, not just viewBox

https://svgfix.net/
1•stardeltaio•23m ago•0 comments

Why AI works better on existing codebases

https://www.stromcapital.fi/blog/brownfield-advantage
1•ronistrom•23m ago•0 comments

The effect of testosterone on human bargaining behaviour (2009)

https://www.nature.com/articles/nature08711
2•mpweiher•24m ago•0 comments

Elevated error rates on Opus 4.5

https://status.claude.com/incidents/tgzm3mf45wzc
1•rvz•24m ago•0 comments

System Programming in Linux: A Hands-On Introduction "Demo" Programs

https://github.com/stewartweiss/intro-linux-sys-prog
1•teleforce•25m ago•0 comments

Show HN: Imago – open-source AI portrait generator with guided creation

https://github.com/tenngoxars/Imago
1•tenngoxars•27m ago•0 comments

Ethernet Switching Hits New Highs

https://www.nextplatform.com/2026/01/08/pushed-by-genai-and-front-end-upgrades-ethernet-switching...
2•ankitg12•27m ago•0 comments

Uber Conquered Database Overload

https://www.uber.com/en-BG/blog/from-static-rate-limiting-to-intelligent-load-management/
2•matesz•28m ago•0 comments

Show HN: I built free calculators for THC, alcohol, and caffeine detox timelines

https://www.detoxwater.com/tools/
1•xohails•29m ago•1 comments

Microsoft Graveyard

https://microsoftgraveyard.com
2•elashri•32m ago•0 comments

Scout AI Revolutionizes Security Intelligence with Amazon OpenSearch Service

https://aws.amazon.com/solutions/case-studies/maxsecurity-bigdataboutique/
1•synhershko•37m ago•0 comments

The Befunge Programming Language

https://esolangs.org/wiki/Befunge
2•askl•37m ago•0 comments

Show HN: PhotoCraft – an AI photo editor I built and shipped as my first iOS app

https://apps.apple.com/us/app/photocraft-ai-photo-editor/id6756682393
2•devavinoth12•38m ago•2 comments

Achieving Kafka reliability at scale with the Streaming Platform (2025)

https://www.datadoghq.com/blog/engineering/streaming-platform-kafka-custom-abstractions/
1•teleforce•44m ago•0 comments

Kuo: Apple's AI Deal with Google Is Temporary and Buys It Time

https://www.macrumors.com/2026/01/13/apple-google-ai-deal-is-temporary/
1•mgh2•44m ago•0 comments

Lore, A reasoning engine that stores the "why" behind code changes

1•almonerthis•47m ago•1 comments

UK police chief admits policy relied on CoPilot hallucination

https://www.telegraph.co.uk/news/2026/01/14/maccabi-police-chief-admits-misleading-mps-by-using-ai/
5•nanna•48m ago•2 comments

Jensen Huang Is Begging You to Stop Being So Negative About AI

https://gizmodo.com/jensen-huang-is-begging-you-to-stop-being-so-negative-about-ai-2000709335
3•robin_reala•48m ago•0 comments

London cracked mobile phone coverage on the Underground

https://www.ianvisits.co.uk/articles/how-london-finally-cracked-mobile-phone-coverage-on-the-unde...
1•ganonm•51m ago•0 comments

Wine stable release 11.0.0 is now available for Linux FreeBSD and macOS

https://www.wine-reviews.net/2026/01/wine-stable-release-1100-is-now.html
4•twickline•52m ago•0 comments

Show HN: I got PyTorch models running on WebGPU without ONNX export

https://github.com/jmaczan/torch-webgpu
1•yu3zhou4•57m ago•1 comments

UK government rolls back key part of digital ID plans

https://www.theguardian.com/politics/2026/jan/13/government-rolls-back-digital-identity-card-plans
4•chrisjj•58m ago•0 comments

Premature Optimization in Entertainment Development

https://medium.com/luminasticity/on-premature-optimization-in-entertainment-development-d2f66083cb26
1•bryanrasmussen•1h ago•0 comments