frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•6mo ago

Comments

kate_at_refact•6mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

What kind of grownup I want to be

https://sashachapin.substack.com/p/what-kind-of-grownup-i-want-to-be
1•eatitraw•6m ago•0 comments

Rock Paper Scissors Is Not Solved, in Practice

https://inchpin.substack.com/p/rock-paper-scissors-is-not-solved
1•eatitraw•7m ago•0 comments

Better GitHub Notifications Dashboard

https://github-notifications.dev/
2•shrimalmadhur•7m ago•0 comments

Crypto hoarding company shares under pressure as risk appetite wanes

https://www.reuters.com/business/finance/crypto-hoarding-company-shares-under-pressure-risk-appet...
1•1vuio0pswjnm7•8m ago•0 comments

Italy's Competition Watchdog Broadens WhatsApp AI Policy Probe

https://www.wsj.com/tech/ai/italys-competition-watchdog-broadens-whatsapp-ai-policy-probe-64c904cd
1•1vuio0pswjnm7•12m ago•0 comments

X402 – protocol for micropayments and the rise of an agentic economy

https://oasis.net/blog/x402-https-internet-native-payments
1•adrianwaj•15m ago•1 comments

Show HN: Tacopy – Tail Call Optimization for Python

https://github.com/raaidrt/tacopy
1•raaid-rt•19m ago•0 comments

CrowdStrike: Security Flaws DeepSeek-Generated Code Linked to Political Triggers

https://www.crowdstrike.com/en-us/blog/crowdstrike-researchers-identify-hidden-vulnerabilities-ai...
1•fosco•20m ago•1 comments

Harvard Database Hacked in Latest Ivy League Cyberattacks

https://www.bloomberg.com/news/articles/2025-11-22/harvard-hacked-in-most-recent-case-of-ivy-leag...
1•1vuio0pswjnm7•24m ago•0 comments

Best Free Headshot Generator 2026: 9 AI Tools Tested and Compared

https://www.aiheadshotreviews.com/articles/best-free-headshot-generator
1•naveensky•31m ago•1 comments

Why Jailbreaking Calculators Is (Unfortunately) a Thing Now [video]

https://www.youtube.com/watch?v=3a-X6FZfl2Y
2•josephcsible•32m ago•0 comments

Discord for LLMs in a single 3.7k-line HTML file

https://modelarena.xyz
2•Estrick•34m ago•1 comments

Onion AI – AI Poster Maker

https://onionai.so/en
1•ovelv•41m ago•0 comments

Ask HN: How can we measure AI's impact on global developer productivity?

2•obilgic•44m ago•1 comments

Listen to music like it's 2005

https://lukecyca.com/2025/listen-to-music-like-its-2005.html
2•ireflect•50m ago•0 comments

Dennis Sullivan: Simplicity Is the Point (2014) [video]

https://www.youtube.com/watch?v=ixc0TNfT0ks
1•vismit2000•55m ago•0 comments

"Many students are simply refusing to do *anything*."

https://bsky.app/profile/jesbattis.bsky.social/post/3m6pvvkojqk2l
4•dougb5•56m ago•0 comments

AI just proved Erdos Problem #124

https://twitter.com/vladtenev/status/1994922827208663383
3•nl•1h ago•0 comments

Decoding Ageing: Blueprint Theory of Ageing [video]

https://www.youtube.com/watch?v=SQAFLDTvQfM
1•escargot•1h ago•0 comments

Show HN: LLM Newsletter Kit – A TypeScript Framework for AI Newsletters

https://github.com/kimhongyeon/llm-newsletter-kit-core
1•hongyeon•1h ago•0 comments

Kopi Luwak

https://en.wikipedia.org/wiki/Kopi_luwak
5•thunderbong•1h ago•0 comments

Tom Stoppard, Award-Winning Playwright of Witty Drama, Dies at 88

https://www.nytimes.com/2025/11/29/theater/tom-stoppard-dead.html
4•mitchbob•1h ago•1 comments

The Markets Aggregating Political Reality

https://freesystems.substack.com/p/inside-the-markets-aggregating-political
1•gwintrob•1h ago•0 comments

Adolescence lasts into 30s – new study shows four pivotal ages for your brain

https://www.bbc.co.uk/news/articles/cgl6klez226o
1•ifh-hn•1h ago•2 comments

Show HN: I built a Web 1.0 budgeting tool, Budget Pro 3000

https://budgetpro3000.com
1•adrianvxyz•1h ago•0 comments

Do we need a new GitHub for AI coding era?

https://github.com/memovai/memov
2•ssslvky1•1h ago•2 comments

Welcome to My Andrew Lloyd Website

https://andrewlloydwebsite.neocities.org/
1•nxobject•1h ago•0 comments

Steven Pinker's Cheesecake for the Mind (1998)

http://cogweb.ucla.edu/Abstracts/Carroll_C98.html
3•lioeters•1h ago•1 comments

Show HN: iOS Old App Downloader 2 [video]

https://www.youtube.com/watch?v=xuX6alPg1yQ
1•txthinking•2h ago•0 comments

NFCGate flagged as malware even after multiple followups saying it isn't

https://github.com/nfcgate/nfcgate/issues/164
2•Antitoxic6185•2h ago•0 comments