frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•7mo ago

Comments

kate_at_refact•7mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

I managed to get that lost UNIX v4 tape running on my Android tablet

https://old.reddit.com/r/termux/comments/1pv7si4/i_managed_to_get_that_lost_unix_v4_tape_running/
1•sipofwater•23s ago•0 comments

GitHub – rcarmo/feed-summarizer: The feed summarizer that powers feeds.carmo.io

https://github.com/rcarmo/feed-summarizer
1•janandonly•43s ago•0 comments

Show HN: VideoReview – Collaborative video review for games and animation

https://github.com/KirisameMarisa/video-review
1•KirisameMarisa•3m ago•0 comments

Ask HN: Payload in Amazon "Shipped" Emails

1•krautburglar•4m ago•0 comments

Show HN: Mandate – treating AI agents like economic actors, not scripts

https://github.com/kashaf12/mandate
1•kashaf12•5m ago•0 comments

I don't trust NPM install, so I built dev

https://github.com/f0i/dev
2•f0i•10m ago•1 comments

Agent Skills

https://github.com/skillmatic-ai/awesome-agent-skills
4•dergalem•14m ago•0 comments

Waymo Is Working on a Gemini AI Assistant. Here's the System Prompt

https://wongmjane.com/blog/waymo-gemini
1•kerim-ca•15m ago•0 comments

Ask HN: Share your favorite Christmas gimmicks

2•jFriedensreich•16m ago•0 comments

Exposing Honey's Evil Business Model [video]

https://www.youtube.com/watch?v=wwB3FmbcC88
1•echan00•16m ago•0 comments

Show HN: One AI API for word-accurate transcription, translation, and export

https://www.transcripthq.io/
1•ssreenithin•16m ago•0 comments

Show HN: Quintel, a Rule-Discovery Game

https://eternaldawn.itch.io/quintel
1•MageOfTheEast•16m ago•0 comments

How to Help Santa Claus Concurrently

https://wyounas.github.io/puzzles/concurrency/2025/12/23/how-to-help-santa-claus-concurrently/
2•simplegeek•18m ago•0 comments

Line scan camera image processing

https://daniel.lawrence.lu/blog/2025-09-21-line-scan-camera-image-processing/
1•vasco•20m ago•0 comments

How we solved for Cloudflare and Azure outages

https://thesidedish.flipdish.com/flipdish-shipping-forecast-multiple-clouds-with-a-chance-of-seve...
1•flibble•20m ago•0 comments

Ingestr: CLI tool to copy data between any databases with a single command

https://github.com/bruin-data/ingestr
2•saikatsg•24m ago•0 comments

We invited a man into our home at Christmas and he stayed with us for 45 years

https://www.bbc.co.uk/news/articles/cdxwllqz1l0o
25•rajeshrajappan•27m ago•2 comments

Agent-Skills-for-Context-Engineering

https://github.com/muratcankoylan/Agent-Skills-for-Context-Engineering
3•kim0•28m ago•0 comments

Good Morning to Those at Work

3•sandworm101•36m ago•0 comments

Extreme Ironing

https://en.wikipedia.org/wiki/Extreme_ironing
1•m4khno•37m ago•0 comments

Reflections on building internal tools after AI changed the workflow

1•kinj28•40m ago•0 comments

Indian rocket launches record-breaking BlueBird 6 smartphone satellite to orbit

https://www.space.com/space-exploration/launches-spacecraft/indian-rocket-launch-bluebird-6-satel...
1•saikatsg•44m ago•0 comments

Download Python Today

https://files.catbox.moe/taze7f.jpg
1•lihaciudaniel2•47m ago•0 comments

Contributing to Debezium: Fixing Logical Replication at Scale

https://engineering.zalando.com/posts/2025/12/contributing-to-debezium.html
2•saikatsg•49m ago•0 comments

Best practices for long-run LED strip installs (20–50M) to avoid flicker?

1•emmasuntech•51m ago•0 comments

Gluetun v3.41.0 Release – The Ranting Section

https://www.youtube.com/watch?v=SSkGpys40ck
1•FeelingGood•52m ago•0 comments

Prompts.chat: the social platform for AI prompts

https://prompts.chat
1•fka•55m ago•0 comments

Cursor UI is built with SolidJS

https://old.reddit.com/r/solidjs/comments/1puoifc/cursor_ui_is_built_with_solidjs/
2•itayadler•56m ago•0 comments

Show HN: FailCore – Execution-Time Safety Runtime for AI Agents

https://github.com/Zi-Ling/failcore
1•IntelliAvatar•57m ago•1 comments

Show HN: Awkward 90s Christmas Studio Portrait

https://picxstudio.com/templates/298-awkward-90s-christmas-studio-portrait
1•Yash16•59m ago•0 comments