frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•8mo ago

Comments

kate_at_refact•8mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

AI as a Compression Problem

https://dkg.fifthhorseman.net/blog/2025-ai-and-compression.html
1•pabs3•1m ago•0 comments

PanoptiCity – interactive map reveals the scale of mass surveillance worldwide

https://panopticity.fr/
1•pabs3•2m ago•0 comments

How Safe Is the Rust Ecosystem? A Deep Dive into Crates.io

https://mr-leshiy-blog.web.app/blog/crates_io_analysis/
1•RustSupremacist•6m ago•0 comments

Trump accepts Nobel Peace medal from Venezuelan opposition leader

https://www.smh.com.au/world/north-america/venezuelan-opposition-leader-says-she-presented-trump-...
2•KnuthIsGod•7m ago•2 comments

Gen X and Millennials Will Inherit Trillions in Real Estate over the Next Decade

https://www.wsj.com/real-estate/luxury-homes/millennial-genx-inherit-real-estate-wealth-d78b4454
1•alephnerd•12m ago•1 comments

From AI agent prototype to product: Lessons from building AWS DevOps Agent

https://aws.amazon.com/blogs/devops/from-ai-agent-prototype-to-product-lessons-from-building-aws-...
1•malahay•16m ago•1 comments

TranslateGemma: A new suite of open translation models

https://blog.google/innovation-and-ai/technology/developers-tools/translategemma/
2•anigbrowl•16m ago•0 comments

Show HN: Buildzr: Python DSL for Authoring C4 Models

https://github.com/amirulmenjeni/buildzr
1•amenji•18m ago•0 comments

Apple's Tactics Could Prevent Japan from Improving Browser Competition

https://open-web-advocacy.org/blog/how_apples_key_tactic_could_prevent_japans_smartphone_act_from...
1•donohoe•21m ago•0 comments

Boeing knew of flaw in part linked to UPS plane crash

https://www.bbc.com/news/articles/cly56w0p9e1o
8•1659447091•24m ago•1 comments

Microsoft Xbox Manufacturing in 2002

https://www.youtube.com/watch?v=YeQrQYFVlXA
1•guidedlight•26m ago•0 comments

Image FX – Free One-Click AI Photo Editor and Image Generator

https://image-fx.app
1•julian2026•26m ago•0 comments

European Alternatives for Digital Products

https://european-alternatives.eu
1•memset•28m ago•0 comments

Show HN: Dev Utility Hub – Client-side only developer tools (JSON, JWT, Cron)

1•hun-ing•31m ago•0 comments

vLLM-MLX – Run LLMs on Mac at 464 tok/s

https://github.com/waybarrios/vllm-mlx
2•waybarrios•37m ago•1 comments

Ericsson Doing Quiet Layoffs

5•allabouttech•38m ago•0 comments

Noninvasive brain treatment for depression proves helpful

https://www.cnn.com/2026/01/15/health/saint-tms-depression-therapy-wellness
5•1659447091•39m ago•0 comments

How to Speak LLM

https://chuanqisun.github.io/how-to-speak-llm/
1•osmoscraft•39m ago•0 comments

Cryptography 30 years apart: Ascon on an HP-16C

https://dram.page/p/ascon-hp16c/
2•todsacerdoti•42m ago•0 comments

Show HN: OneView – One-page website builder you can share OR embed anywhere

https://www.oneview.work/en
1•fengs•44m ago•0 comments

My Projects in 2025

https://simonhartcher.com/posts/2026-01-16-my-projects-in-2025/
2•deevus•45m ago•1 comments

Predictions for the New Year

https://lwn.net/Articles/1052269/
1•signa11•46m ago•0 comments

Hytale Calculator

https://hytalecalculator.com/
4•quchao•46m ago•1 comments

After Hostile Takeover Fail, Ellison's Paramount Skydance Sues WBD Netflix

https://finance.yahoo.com/news/failed-hostile-takeover-bid-david-023115712.html
7•stopbulying•48m ago•2 comments

Writing First, Tooling Second

https://susam.net/writing-first-tooling-second.html
2•thunderbong•48m ago•0 comments

Infant needs CPR after feds unleash flash-bangs on family van with 6 kids inside

https://www.rawstory.com/ice-minneapolis-2674900256/
11•perihelions•49m ago•1 comments

Ask HN: What is your opinion on workplace review websites like Glassdoor?

3•slaye•53m ago•1 comments

Open-source specification for building multi-provider LLM interfaces

https://www.openresponses.org/
3•fratellobigio•54m ago•0 comments

Legal framework for crypto is hitting some snags

https://www.nytimes.com/2026/01/15/business/dealbook/crypto-bill-coinbase.html
1•paulpauper•55m ago•0 comments

Should I have tried to insider trade on debunking a famous study?

https://coldbuttonissues.substack.com/p/should-i-have-tried-to-insider-trade
2•paulpauper•56m ago•0 comments