frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•9mo ago

Comments

kate_at_refact•9mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Unlock real power of Google trends

https://daily-trending.org
1•azamsayeedit•53s ago•1 comments

Show HN: I built a puzzle game for couples to connect

https://lovepuzzle.com/
1•lmq10•1m ago•0 comments

Crotch enhancements: The latest controversy at the Winter Olympics

https://news.sky.com/story/crotch-enhancements-the-latest-controversy-at-the-winter-olympics-1350...
1•austinallegro•2m ago•0 comments

A handful of wholesomely bad ads for 90s SPARC clones

https://buttondown.com/suchbadtechads/archive/integrix/
1•rfarley04•2m ago•0 comments

It's not finance, it's your pensions

https://theloop.ecpr.eu/its-not-finance-its-your-pensions/
1•kome•7m ago•0 comments

Show HN: Vfv – Ultra-lightweight terminal file viewer for vibe coding

https://github.com/noumi0k/vfv
1•noumi0k•9m ago•1 comments

Vibe Migrating over 1000 Pages and Losing 80% of Our Traffic

https://www.hopsworks.ai/post/vibe-migrating-1k-pages-and-losing-80-percent-of-our-traffic
1•LexSiga•10m ago•0 comments

FreeBSD Audio Diagnostics and Optimization

https://m4c.pl/blog/freebsd-audio-diagnostics-and-optimization/
1•m4c-pl•10m ago•1 comments

Show HN: LibTTAK- Explicit lifetime-as-data for C systems

https://github.com/gg582/libttak
1•gg582•11m ago•0 comments

President John F. Kennedy's national address on the extraterrestrial presence

https://www.youtube.com/watch?v=gWig9s2GylA
1•keepamovin•12m ago•0 comments

AMD OpenSIL and Open-Source Firmware Efforts for Confidential Compute

https://www.phoronix.com/news/3mdeb-FOSDEM-2026-Firmware
1•grigio•15m ago•0 comments

OpenTelemetry Baggage Enables Global Context for Distributed Systems

https://signoz.io/blog/otel-baggage/
1•dhruv_ahuja•17m ago•1 comments

Text classification with Python 3.14's ZSTD module

https://maxhalford.github.io/blog/text-classification-zstd/
4•Lemaxoxo•18m ago•1 comments

Show HN: PR Bro – a TUI that helps prioritize PRs

https://github.com/toniperic/pr-bro
1•toneric•19m ago•0 comments

NASA astronauts will soon fly with the latest smartphones

https://xcancel.com/NASAAdmin/status/2019259382962307393#m
1•doener•20m ago•0 comments

Ask HN: Do you use LLM memory features?

1•grigio•21m ago•1 comments

Chat Control 1.0: Civil Society Mobilizes Against Extending Mass Surveillance

https://www.patrick-breyer.de/en/chat-control-1-0-civil-society-mobilizes-against-extending-mass-...
3•latexr•22m ago•0 comments

Discover another instant gaming spot

https://t3.baent.top
1•TrendSpotterPro•24m ago•1 comments

UUP dump – download UUP files from Windows Update servers with ease

https://uupdump.net/
2•gjvc•30m ago•0 comments

I Built a 6 BIPS JIT in Five Months

https://unlikelyemphasis.substack.com/p/i-built-a-6-bips-jit-in-five-months
3•brazilofmux•37m ago•0 comments

Pandoc for the people: Convert documents without leaving the browser

https://pandoc.org/app/
3•romes•40m ago•0 comments

Type Variance

https://en.wikipedia.org/wiki/Type_variance
1•tosh•41m ago•0 comments

A 2.5x faster Postgres parser with Claude Code

https://multigres.com/blog/ai-parser-engineering
8•kiwicopple•47m ago•3 comments

The age of a treacherous, falling dollar

https://www.economist.com/leaders/2026/02/05/the-age-of-a-treacherous-falling-dollar
2•petethomas•47m ago•0 comments

The End of Human Relationships

https://www.hackyexperiments.com/blog/the-end-of-human-relationships
4•bilater•47m ago•0 comments

shamir secret sharing + age: my digital safe after a few 2 many bike concussions

https://eljojo.github.io/rememory/
3•eljojo•47m ago•1 comments

The Assault on Ukraine's Power Grid

https://www.newyorker.com/news/the-lede/the-assault-on-ukraines-power-grid
3•petethomas•48m ago•0 comments

Where Drupal Still Wins in 2026?

https://kokocinski.me/blog/where-drupal-still-wins-2026
2•firflant•49m ago•0 comments

Tesla likely can't escape 'Blade Runner 2049' lawsuit

https://www.reuters.com/legal/litigation/tesla-musk-likely-cant-escape-blade-runner-2049-lawsuit-...
3•jmkd•50m ago•1 comments

A tale of three kings (Python, Elixir, Go) (2017)

https://medium.com/@marcelo_lebre/a-tale-of-three-kings-e0be17a16e2b
3•tosh•51m ago•0 comments