frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•7mo ago

Comments

kate_at_refact•7mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

How to Use Trending Topic and Keyword Finder

https://metaconvert.blogspot.com/2025/12/how-to-use-trending-topic-and-keyword-finder.html
1•MetaConvert•37s ago•0 comments

Compressing Embedded Files in Go

https://vincent.bernat.ch/en/blog/2025-go-embed-compressed
1•todsacerdoti•2m ago•0 comments

Python Software Foundation end-of-year fundraiser

https://www.python.org/psf-landing/
1•CaliforniaKarl•2m ago•0 comments

Intermittent Hypoxia Increases Blood Flow and Benefits Executive Function

https://onlinelibrary.wiley.com/doi/10.1111/psyp.70161
1•PaulHoule•2m ago•0 comments

Just Right FM

https://justright.fm/
1•ddrscott•2m ago•0 comments

1Crossword: Crosswords for Your Password Manager

https://eieio.games/blog/1Crossword/
1•jstyles•5m ago•0 comments

Show HN: We open-sourced our internal tool for scoring PRs with Claude AI

https://github.com/MergeMint/mergemint-app
1•textcortex•7m ago•0 comments

SAGE: Semi-Automatic Ground Environment Air Defense System

https://www.ll.mit.edu/about/history/sage-semi-automatic-ground-environment-air-defense-system
1•stmw•7m ago•0 comments

Twins reared apart do not exist

https://davidbessis.substack.com/p/twins-reared-apart-do-not-exist
2•bookofjoe•9m ago•0 comments

Ask HN: YouTube Only Showing Ads?

1•OhMeadhbh•12m ago•3 comments

Urik – The Privacy focused Android keyboard is now in Open Beta

https://github.com/urikdev/Urik
2•urikdev•13m ago•1 comments

Japan's first hotel with a human washing machine is now ready for you

https://soranews24.com/2025/12/10/japans-first-hotel-with-a-human-washing-machine-is-now-ready-fo...
1•rawgabbit•15m ago•1 comments

What UI do you use on top of data engineering tools to look at data?

1•platypii•16m ago•1 comments

Powers of Ten (1977) [video]

https://www.youtube.com/watch?v=0fKBhvDjuy0
5•susam•19m ago•0 comments

3D-Agent

https://3d-agent.com
2•gsunshinel•22m ago•0 comments

Tourists to US would have to reveal 5 years of social media activity: new plan

https://www.theguardian.com/us-news/2025/dec/10/tourists-social-media-trump
7•mikhael•27m ago•1 comments

Creating a Benevolent Industrial Deployer for Memecoins

https://substack.com/@fitziswriting/p-181255460
2•fitzyap•28m ago•0 comments

Internal RFCs saved us months of wasted work

https://highimpactengineering.substack.com/p/the-illusion-of-shared-understanding
3•romannikolaev•29m ago•0 comments

Replican't

https://thinkhuman.com/replicant/
5•jamesgill•29m ago•1 comments

RAM Is Ruining Everything

https://www.theverge.com/report/839506/ram-shortage-price-increases-pc-gaming-smartphones
2•edward•29m ago•0 comments

Transcripts dialectics Gregory Bateson 19th of July 1967 [pdf]

https://villonfilms.ca/main/transcripts-dialectics-gregory-bateson-19-7-67.pdf
2•oriettaxx•33m ago•0 comments

Hark: Voice prompts for LLMs, meeting minutes, quick voice journaling

https://github.com/FPurchess/hark
2•FPurchess•34m ago•0 comments

I Pitched Netflix a Friends-Style Casablanca Sitcom

https://stohl.substack.com/p/i-pitched-netflix-a-casablanca-sitcom
1•FreeQueso•35m ago•0 comments

Show HN: A 2-row, 16-key keyboard designed for smartphones

https://k-keyboard.com/Why-QWERTY-mini
3•QWERTYmini•37m ago•0 comments

Verum 2 closed back option

https://www.pragmaticaudio.com/articles/2023/07/welcome-to-pragmatic-audio/
1•juniedai•38m ago•1 comments

Ask HN: Why is codex suddenly giving me this error?

1•fcpguru•39m ago•0 comments

Show HN: Hacker News, Disstilled

https://www.trydistilled.ai/
1•alexbemore•40m ago•0 comments

Pro-AI super PAC launches first candidate ads

https://www.cnn.com/2025/12/10/tech/pro-ai-super-pac-launches-first-candidate-ads
4•e12e•40m ago•2 comments

Show HN: Skald – open-source context layer API that runs in your VPC

https://www.useskald.com/
4•yakkomajuri•41m ago•1 comments

A Thousand and One Nights in Italy

https://publicdomainreview.org/essay/a-thousand-and-one-nights-in-italy
3•lermontov•42m ago•0 comments