frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•7mo ago

Comments

kate_at_refact•7mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Sooko.ai Launches AI Ecosysystem

https://www.sooko.ai/
1•Femiaguda•2m ago•1 comments

Show HN: QBridge, a clean, modern iOS alternative to Cordova and Capacitor

https://github.com/Qbix/QBridge/blob/main/README.md
1•EGreg•4m ago•0 comments

Paralysed man controls robots using China's BCI tech

https://scienceclock.com/china-brain-computer-interface-paralysed-man-controls-robots-neuralink/
1•ashishgupta2209•6m ago•0 comments

Show HN: Claudereview – Share Claude Code Sessions with PRs and More

https://claudereview.com/
1•eigen-vector•7m ago•0 comments

Pagebound is an independent Goodreads alternative

https://pagebound.co/
2•MajorBee•11m ago•0 comments

Deliberate Deliberation

1•Josf•12m ago•0 comments

Tracking Shell Scripts (and Python, Perl, etc.) with eBPF Is Hard

https://substack.bomfather.dev/p/tracking-shell-scripts-and-python
3•neil_naveen•13m ago•0 comments

The HTML Elements Time Forgot

https://www.htmhell.dev/adventcalendar/2025/22/
1•birdculture•13m ago•0 comments

Rolex Tries to Beat Watch Flippers at Their Own Game

https://www.wsj.com/finance/rolex-watch-secondhand-market-3ddb113e
1•bookofjoe•15m ago•1 comments

How uv got so fast

https://nesbitt.io/2025/12/26/how-uv-got-so-fast.html
1•zdw•15m ago•0 comments

Pre, Mid, Post-Training Way of Life

https://fakepixels.substack.com/p/pre-mid-post-training-way-of-life
1•jger15•17m ago•0 comments

Matz 1/2: A single email sparked Ruby's growth

https://en.kaigaiiju.ch/episodes/matz1
1•kibitan•19m ago•0 comments

Show HN: Ad-sentinel – An AI powered ad-blocker

https://github.com/johnmckay-reward/ad-sentinel
1•jmkni•20m ago•0 comments

Experts Explore New Mushroom Which Causes Fairytale-Like Hallucinations

https://nhmu.utah.edu/articles/experts-explore-new-mushroom-which-causes-fairytale-hallucinations
1•astronads•20m ago•1 comments

Matz 2/2: The trajectory of Ruby's growth, Open-Source Software today etc.

https://en.kaigaiiju.ch/episodes/matz2
1•kibitan•21m ago•0 comments

C/C++ Embedded Files

https://www.4rknova.com//blog/2013/01/27/cpp-embedded-files
11•ibobev•22m ago•2 comments

Bowie's ODE solver and the nonlinear pendulum

https://www.johndcook.com/blog/2025/12/23/bowie-integrator-and-the-nonlinear-pendulum/
2•ibobev•22m ago•0 comments

ZJIT is now available in Ruby 4.0

https://railsatscale.com/2025-12-24-launch-zjit/
2•ibobev•24m ago•0 comments

I Exposed Minnesota's Billion Dollar Fraud Scandal [video]

https://www.youtube.com/watch?v=r8AulCA1aOQ
1•almosthere•24m ago•0 comments

Poor Charlie's Almanack

https://www.stripe.press/poor-charlies-almanack
1•gregzeng95•30m ago•0 comments

Mostlymatter: A fork of Mattermost by Framasoft

https://packages.framasoft.org/projects/mostlymatter/
2•SubiculumCode•32m ago•0 comments

The Renaissance book that heralded growth

https://worksinprogress.co/issue/the-renaissance-book-that-heralded-growth/
1•pseudolus•32m ago•0 comments

Osint Your Future Employer

https://piotrmackowski.com/2025/03/28/OSINT-your-future-employer.html
2•ptrmc•34m ago•0 comments

New science points to 4 distinct types of autism

https://www.washingtonpost.com/health/2025/12/26/autism-research-diagnosis-subtypes/
1•pseudolus•35m ago•1 comments

Show HN: Turn your GitHub profile into a clean, shareable visual card

https://mygit.syigen.com/
2•dewmal•38m ago•0 comments

Depth on Demand

https://solmaz.io/depth-on-demand
2•hosolmaz•39m ago•0 comments

Optimal Classification Cutoffs

https://finite-sample.github.io/optimal-classification-cutoffs/
1•neehao•39m ago•0 comments

Fix Claude's Enter Key

https://chromewebstore.google.com/detail/fix-claudes-enter-key/odnbnplcfenobhmghdpiebbjdgchinjm
1•gjvc•41m ago•0 comments

China isn't just dumping cheap goods anymore – it's sending caviar

https://www.ft.com/content/461009e1-ec74-47ab-ae6b-72a32474df31
5•bookofjoe•43m ago•2 comments

Show HN: Loki Mode – 37 AI agents that autonomously build your startup

https://github.com/asklokesh/claudeskill-loki-mode
3•slogansand•44m ago•1 comments