frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•7mo ago

Comments

kate_at_refact•7mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Show HN: CodeProt – Filter static analysis false positives in CI

https://codeprot.com
1•allenz_cheung•9m ago•0 comments

(Norway) New Record: Almost 100% EV Registrations in November

https://www.electrive.com/2025/12/01/norway-sets-new-record-with-near-100-electric-vehicle-regist...
1•JojoFatsani•11m ago•0 comments

Roko's Dancing Basilisk

https://boston.conman.org/2025/12/02.1
2•todsacerdoti•12m ago•0 comments

Human art in a post-AI world should be strange

https://www.owlposting.com/p/art-in-a-post-ai-world-should-be
2•crescit_eundo•12m ago•0 comments

Young Ants Beg for Death When Sick, New Study Reveals

https://www.sciencealert.com/young-ants-beg-for-death-when-sick-new-study-reveals
1•ashishgupta2209•13m ago•0 comments

The Atari Jaguar's Last Roar

https://thedeletedscenes.substack.com/p/not-with-a-roar-but-with-a-whimper
1•adelmastro•14m ago•0 comments

"Diff-Focus: Reduce code review ramp-up time with heuristic diff summarization"

https://github.com/yksanjo/diff-focus-chrome
1•yksanjo•15m ago•0 comments

Show HN: The Forge Calculator for Roblox "The Forge"

https://theforgecalculator.org
1•takennap•23m ago•0 comments

Show HN: I built alwayswith.us to easily add deceased loved ones into photos

https://alwayswith.us
1•jrpribs•23m ago•0 comments

Gym workout set and reps tracker

https://www.setly.org/
1•abdullah9•24m ago•0 comments

Accounting red flags at PDD (2023)

https://www.transparently.ai/blog/accounting-red-flags-at-pinduoduo
2•mgh2•26m ago•0 comments

Bio-AI with a Conscience Kernel and Self-Correcting Identity

1•KIDDOUTLAW•28m ago•0 comments

Ambriel

https://ambriel.io
1•jesuscasdf•28m ago•0 comments

Openterface KVM-GO – Crowd Supply

https://www.crowdsupply.com/techxartisan/openterface-kvm-go
1•evanjrowley•30m ago•1 comments

AI Psychosis in First Person

https://kennethreitz.org/essays/2025-09-08-the_prophets_frequency_on_reading_divine_static
5•maraoz•30m ago•1 comments

AI-powered surveillance firms are gunning for a share of the Gaza spoils

https://www.972mag.com/ai-surveillance-gaza-palantir-dataminr/
2•cramsession•33m ago•0 comments

AI's Wrong Answers Are Bad. Its Wrong Reasoning Is Worse

https://spectrum.ieee.org/ai-reasoning-failures
3•pseudolus•35m ago•1 comments

Wikipedia's most-read articles of 2025

https://wikimediafoundation.org/news/2025/12/02/announcing-wikipedias-most-read-articles-of-2025/
1•andsoitis•38m ago•0 comments

Thoughts on AI Progress

https://www.dwarkesh.com/p/thoughts-on-ai-progress-dec-2025
2•tfirst•41m ago•0 comments

A Directory of Every AI Tool for Hardware Engineers

https://www.hardwareai.directory
1•anu_bonth•42m ago•1 comments

Basecamp/Fizzy

https://github.com/basecamp/fizzy
3•doppp•44m ago•0 comments

Remove Attachments from Gmail Messages

https://attachments-extractor.ybouane.com/
1•michaelrkn•46m ago•1 comments

Built a tool which sizes and selects water filters

https://hydroanalyze.tech/
1•harishiitkgp7•53m ago•0 comments

Intrarectal perfluorodecalin for enteral ventilation in a first-in-human trial

https://www.cell.com/med/abstract/S2666-6340(25)00314-9
1•surprisetalk•53m ago•0 comments

Ambsheet: A spreadsheet for exploring scenarios [video]

https://www.youtube.com/watch?v=EtC2XiGFh7E
2•surprisetalk•53m ago•0 comments

Animalcules and Their Motors

https://www.asimov.press/p/flagella
1•surprisetalk•53m ago•0 comments

Planetary Robotics. Beyond Humanoids.

https://akash.earth/
1•maxnajer•54m ago•1 comments

Kohler Can Access Pictures from "End-to-End Encrypted" Toilet Camera

https://varlogsimon.leaflet.pub/3m6zrw6k2bs2p?interactionDrawer=quotes
48•TimDotC•59m ago•32 comments

Reverse-engineering Claude's sandbox, then building my own

https://michaellivs.com/blog/sandboxed-execution-environment
1•handfuloflight•1h ago•0 comments

Unpaid Labour in Productive Capacity

https://danieltan.weblog.lol/2025/12/appendix-b-unpaid-labour-in-productive-capacity
2•danieltanfh95•1h ago•0 comments