frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•6mo ago

Comments

kate_at_refact•6mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Local LLMs are how nerds now justify a big computer they don't need

https://world.hey.com/dhh/local-llms-are-how-nerds-now-justify-a-big-computer-they-don-t-need-af2...
1•4dm1r4lg3n3r4l•25s ago•0 comments

NeuroCode – a structural IR engine for code (Infra for AI)

https://github.com/gabrielekarra/neurocode
1•gabrielekarra•1m ago•0 comments

Launching the Genesis Mission

https://www.whitehouse.gov/presidential-actions/2025/11/launching-the-genesis-mission/
1•jonbaer•3m ago•0 comments

What's next for AlphaFold: A conversation with a Google DeepMind Nobel laureate

https://www.technologyreview.com/2025/11/24/1128322/whats-next-for-alphafold-a-conversation-with-...
1•rbanffy•4m ago•0 comments

Reducing MCP token usage by 100x – you don't need code mode

https://www.speakeasy.com/blog/how-we-reduced-token-usage-by-100x-dynamic-toolsets-v2
1•subomi•4m ago•0 comments

What's Like to Be an AI/ML Engineer

https://newsletter.eng-leadership.com/p/whats-really-like-to-be-an-aiml-engineer
1•rbanffy•4m ago•0 comments

Blue Origin to Build a "Super Heavy" Rocket to Compete with Starship

https://www.universetoday.com/articles/blue-origin-to-build-a-super-heavy-rocket-to-compete-with-...
2•rbanffy•9m ago•0 comments

How Did REST Come to Mean the Opposite of REST? (2022)

https://htmx.org/essays/how-did-rest-come-to-mean-the-opposite-of-rest/
1•BerislavLopac•9m ago•0 comments

Can I work my 9-5 Job from Inside Skyrim [video]

https://www.youtube.com/watch?v=v9gIK4j1Ip0
1•ecares•16m ago•0 comments

HP 785M/780M Ultra Fast Scrolling Review and Stutter Fix

https://www.jacopofranco.com/projects/hp-785m780m-ultra-fast-scrolling-review
1•omblivion•16m ago•0 comments

GrapheneOS leaves OVH: "France isn't safe for open source privacy projects."

https://twitter.com/GrapheneOS/status/1993035936800584103
4•bocytron•21m ago•0 comments

Sustainable mycoprotein nutrition: metabolic engineering of Fusarium venenatum

https://www.cell.com/trends/biotechnology/fulltext/S0167-7799(25)00404-4?_returnURL=https%3A%2F%2...
1•PaulHoule•25m ago•0 comments

Show HN: AI search context engine for startups and engineering teams

https://crewmem.com/blog/AI-Search-Engine-for-Indie-Startups-and-Engineering-Teams
1•flabberghasted•26m ago•0 comments

China launches Shenzhou 22 spacecraft to return 3 stranded astronauts

https://economictimes.indiatimes.com/news/international/world-news/china-launches-shenzhou-22-spa...
2•ZeljkoS•27m ago•0 comments

Show HN: Pixeli – The CLI Tool for Creating Beautiful Image Grids and Mosaics

https://github.com/pakdad-mousavi/pixeli
1•zephyrrd•28m ago•0 comments

Disasters I've seen in a microservices world

https://world.hey.com/joaoqalves/disasters-i-ve-seen-in-a-microservices-world-a9137a51
2•enz•29m ago•0 comments

Human-AI Decision Making Costs in Synthetic Teams

https://arxiv.org/abs/2511.19312
1•undefinedmethod•30m ago•0 comments

Astrl

https://www.tryastrl.com/
2•jjwilkin•35m ago•0 comments

LLMs in Predicaments

https://dxdt.ch/blog.php?blog=predicaments
2•lkm0•36m ago•0 comments

When Video Detailed Captioners Evolve Themselves via Agentic Self-Reflection

https://arxiv.org/abs/2511.19436
1•badmonster•38m ago•1 comments

Show HN: ‘Big Bang’ AES Encryption Objects (1of1, offline, un-cross-decryptable)

https://github.com/0north-eth/zero-north-vault/releases/tag/v1.0-demo
2•0north•38m ago•1 comments

Show HN: Echosnap AI – A simple voice-first notes app

https://www.useechosnap.com/
2•pradeep3•38m ago•0 comments

How do we keep apps maintained on Flathub?

https://tim.siosm.fr/blog/2025/11/24/building-better-app-store-flathub/
2•aragilar•42m ago•1 comments

Show HN: DuckDuckGo Search Results Scraper

https://apify.com/johnvc/duckduckgoseoscraper
1•johncole•44m ago•0 comments

Show HN: I built a tool for instant, thoughtful X/LinkedIn replies in your feed

https://www.yapyap.fun/
1•kartik_malik•50m ago•1 comments

Show HN: Banana Studio – AI Image Editor Powered by Nano Banana

https://banana-studio-nano.vercel.app/
2•sumit-paul•51m ago•0 comments

Perplexity Comet UXSS

https://www.hacktron.ai/blog/perplexity-comet-uxss
2•Mohansrk•52m ago•0 comments

How the K-Shaped Economy Is Hurting Everyone but the Rich

https://www.bloomberg.com/news/articles/2025-11-24/how-the-k-shaped-economy-is-hurting-everyone-b...
1•zerosizedweasle•53m ago•2 comments

Show HN: Scrape Baidu

https://apify.com/johnvc/baidu-search-scraper
1•johncole•54m ago•0 comments

How to Create Advertising That Sells by David Ogilvy (1972)

https://borakaizen.medium.com/how-to-create-advertising-that-sells-ad-by-david-ogilvy-1972-a1b4af...
1•kaizenb•55m ago•0 comments