frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•8mo ago

Comments

kate_at_refact•8mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Meta releases open data to train General AI Co-Scientists

https://huggingface.co/datasets/facebook/research-plan-gen
1•shash42•8m ago•0 comments

Show HN: PokéPath TD – Free Pokémon tower defense game

https://pokepathgame.com
1•airobus•17m ago•0 comments

Building Privacy Preserving RAG with Homomorphic Encryption

https://www.subhashdasyam.com/2025/11/building-privacy-preserving-rag-with.html
1•dxsecarch•19m ago•0 comments

Question for Engineering Leaders

https://shadowscoping.com/
1•rezat•19m ago•1 comments

Rapid Validation of Product Concepts with AI

https://luvsheth.com/p/rapid-validation-of-product-concepts
1•Reviving1514•20m ago•1 comments

Somebody Build This

1•Caritaspax•22m ago•3 comments

Who's in charge of Venezuela and what happens next?

https://www.bbc.com/news/articles/crmlz7r0zrxo
4•SilverElfin•25m ago•3 comments

Show HN: CloudSlash – Find AWS waste and generate Terraform state rm commands

1•drskyle•26m ago•0 comments

AGI Is Here

https://www.robinsloan.com/winter-garden/agi-is-here/
2•cmod•29m ago•1 comments

'Chinese Peptides' Are the Latest Biohacking Trend in the Tech World

https://www.nytimes.com/2026/01/03/business/chinese-peptides-silicon-valley.html
1•bookofjoe•30m ago•1 comments

They Said AI Would Replace You by Now

https://www.youtube.com/watch?v=dH_UvWmvny0
1•cable2600•30m ago•0 comments

Americans Choosing Cremation at Historic Rates, NFDA Report Finds

https://nfda.org/news/media-center/nfda-news-releases/id/9772/americans-choosing-cremation-at-his...
3•toomuchtodo•30m ago•0 comments

Damn Vulnerable AI Bank – Practice AI Security

https://dvaib.com
1•dxsecarch•30m ago•0 comments

Show HN: A Android Color Detection Auto Clicker with no full-screen ads

1•dopifier•31m ago•0 comments

Berlin power outages after left-wing anarchist attack on power cables

https://www.irishtimes.com/world/europe/2026/01/04/berlin-power-outages-after-left-wing-anarchist...
3•wslh•37m ago•1 comments

Don't Forget the WAL: How I Lost SQLite Data in Podman Containers

https://bkiran.com/blog/sqlite-containers-data-loss
3•thunderbong•38m ago•1 comments

Wanderly AI Travel App Waitlist

https://waitlister.me/p/wanderly
1•CuylerM•38m ago•1 comments

During Helene, I just wanted a plain text website

https://sparkbox.com/foundry/helene_and_mobile_web_performance
22•CqtGLRGcukpy•48m ago•11 comments

Agent Orchestration Is Not the Future

https://moridinamael.github.io/agent-orchestration/
1•mordymoop•51m ago•1 comments

What is Agent context engine

https://ragflow.io/basics/what-is-agent-context-engine
1•yingfeng•53m ago•0 comments

Tempest Future Fighter Aims for "Extreme Range," Twice F-35 Payload

https://www.twz.com/air/tempest-future-fighter-aims-for-really-extreme-range-twice-f-35-payload
1•throwoutway•55m ago•0 comments

Politics and the English Language – George Orwell [Essay]

https://www.orwellfoundation.com/the-orwell-foundation/orwell/essays-and-other-works/politics-and...
3•nomilk•59m ago•1 comments

Show HN: Vho – AST-based analysis for better AI refactoring of large codebases

https://vue-hook-optimizer.vercel.app/
2•huali•59m ago•1 comments

vLLM: An Efficient Inference Engine for Large Language Models

https://www2.eecs.berkeley.edu/Pubs/TechRpts/2025/EECS-2025-192.html
2•matt_d•1h ago•0 comments

Linuxulator on FreeBSD Feels Like Magic

https://hayzam.com/blog/02-linuxulator-is-awesome/
6•arch1e•1h ago•0 comments

Ask HN: What app features actually help vocabulary stick long-term?

1•hussein-khalil•1h ago•2 comments

Ask HN: Is there a better alternative to email?

1•DinakarS•1h ago•1 comments

AI Safety ArXiv Scraper

https://theguardrail.net/
2•chiwilliams•1h ago•0 comments

Translating Cave Story into Classical Latin with Gemini

https://www.semilin.dev/blog/doukutsu-translator
2•semilin•1h ago•0 comments

Show HN: I Made a Gamma Clone with 1 Prompt

https://prompt-to-ppt.lovable.app/
1•nsemikey•1h ago•1 comments