frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•7mo ago

Comments

kate_at_refact•7mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

-tucky

https://languagelog.ldc.upenn.edu/nll/?p=58650
1•benatkin•1m ago•0 comments

InfiniDB: The Unreliable Database of Everything

https://tncardoso.com/blog/2025/12/infinidb-the-unreliable-database-of-everything/
1•zibisco•1m ago•0 comments

PlugOS – An isolated OS that runs on an untrusted phone

2•drding•10m ago•1 comments

Show HN: Open-source Planning Poker web app with no voting limits

https://github.com/rie03p/planning-poker
1•rie03p•13m ago•0 comments

Do we need semantic layer anymore? What are the limitations of LLMs?

https://motherduck.com/blog/who-needs-a-semantic-layer-anyway/
1•thamizhan2611•15m ago•0 comments

Björk does calm Teardown of Sony Trinitron TV, Christmas 1988 [video]

https://www.youtube.com/watch?v=SNQtQWjX-sA
2•keepamovin•18m ago•0 comments

Dolphin Progress Release 2512

https://dolphin-emu.org/blog/2025/12/22/dolphin-progress-report-release-2512/
1•zdw•19m ago•0 comments

Old English Computer Glossary

https://web.archive.org/web/20231120210517/http://www.u.arizona.edu/~ctb/wordhord.html
1•LAC-Tech•24m ago•0 comments

Claude Code with API Key?

https://old.reddit.com/r/ClaudeAI/comments/1jwvssa/comment/mtt0urz/
1•behnamoh•24m ago•0 comments

Microsoft wants to replace its C and C++ codebase, perhaps by 2030

https://www.theregister.com/2025/12/24/microsoft_rust_codebase_migration/
1•0in•29m ago•1 comments

Pennsylvania High Court Rules Police Can Access Google Searches Without Warrant

https://reclaimthenet.org/pennsylvania-court-rules-no-privacy-in-google-searches
2•imglorp•33m ago•1 comments

A new immunotherapy approach could work for many types of cancer

https://news.mit.edu/2025/new-immunotherapy-approach-could-work-many-types-cancer-1216
2•0in•37m ago•0 comments

QWED – Deterministic Verification for AI

https://docs.qwedai.com/
1•handfuloflight•39m ago•0 comments

Gave My RGB Fans a Job: 38-Pixel Screen Mirror

https://seg6.space/posts/rgb-sync/
1•seg6•40m ago•0 comments

Ask HN: Will SLMs be what bursts the LLM bubble cos you can run them on a phone?

1•aniijbod•45m ago•0 comments

They graduated from Stanford. Due to AI, they can't find a job

https://www.latimes.com/business/story/2025-12-19/they-graduated-from-stanford-due-to-ai-they-can...
2•osnium123•45m ago•0 comments

We interfaced single-threaded C++ with multi-threaded Rust and lived

https://antithesis.com/blog/2025/rust_cpp/
1•wwilson•47m ago•0 comments

Evaluating Context Compression for AI Agents

https://factory.ai/news/evaluating-compression
1•gmays•50m ago•0 comments

Zodiac Z13 Decryption

https://colab.research.google.com/drive/19p4n1aMyeYte1jC4P3GKflMgD6xuZAvV
3•sgustard•50m ago•1 comments

Manufactured Inevitability and the Need for Courage

https://theconvivialsociety.substack.com/p/manufactured-inevitability-and-the
4•danielam•50m ago•0 comments

Physicists found a way to make thermodynamics work in the quantum world

https://www.sciencedaily.com/releases/2025/12/251223084615.htm
3•ashishgupta2209•1h ago•0 comments

Don't Become the Machine

https://armeet.bearblog.dev/becoming-the-machine/
5•armeet•1h ago•2 comments

You Can Get Every AI Model for Free

https://infiniax.ai
2•ZacharyGolinger•1h ago•1 comments

Ask HN: Critique wanted — granular-physics pyramid preprint

https://zenodo.org/records/18036910
1•Sherlock_Blight•1h ago•1 comments

The semantic layer is dead. Long live the wiki

https://promptql.io/blog/semantic-layer-dead-long-live-wiki
4•tirumaraiselvan•1h ago•0 comments

Big Space Sandwich Broke a Record

https://nautil.us/this-big-space-sandwich-broke-a-record-1256821/
2•fleahunter•1h ago•0 comments

China bans sharing 'obscene' material – potentially including sexting

https://www.washingtonpost.com/world/2025/12/23/china-porn-ban-online-censorship/
3•0in•1h ago•0 comments

Yendor: A Zach-like, rogue-like game and language made in 7 days

https://github.com/olifog/YENDOR
2•azhenley•1h ago•0 comments

China Delays Plans for Mass Production of Self-Driving Cars After Accident

https://www.nytimes.com/2025/12/23/business/china-autonomous-cars-driving.html
2•bookofjoe•1h ago•1 comments

Poetiq achieves 75% at under $8 / problem using GPT-5.2 X-High on ARC-AGI-2

https://poetiq.ai/posts/arcagi_announcement/
3•mromanuk•1h ago•0 comments