frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•8mo ago

Comments

kate_at_refact•8mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Encrypt Your .env in a Meme

https://github.com/thoughtlesslabs/memevault
1•thoughtlesslabs•3m ago•1 comments

Communication as a Product

https://ashgaikwad.substack.com/p/communication-as-a-product
3•ashgkwd•3m ago•0 comments

Key US Power Grid [PJM] Cuts Demand Outlook on Overstated AI Boom

https://www.bloomberg.com/news/articles/2026-01-14/biggest-us-power-grid-cuts-demand-outlook-on-o...
2•toomuchtodo•5m ago•0 comments

Social media time does not increase teenagers' mental health problems – study

https://www.theguardian.com/media/2026/jan/14/social-media-time-does-not-increase-teenagers-menta...
1•sien•8m ago•0 comments

The Inelastic Markets Hypothesis [pdf]

https://r.jordan.im/download/investing/gabaix2020.pdf
1•luu•13m ago•0 comments

The Single-Click Microsoft Copilot Attack That Silently Steals Personal Data

https://www.varonis.com/blog/reprompt
2•extesy•18m ago•0 comments

DeepSeek's technical papers show frontier innovation

https://www.scmp.com/tech/tech-trends/article/3339769/deepseek-stays-mum-next-ai-model-release-te...
2•nsoonhui•23m ago•0 comments

"Don't worry. Boys are hard to find." Trump/Epstein and... Criminal Enterprises

https://lisevoldeng.substack.com/p/dont-worry-boys-are-hard-to-find
3•Tadpole9181•23m ago•0 comments

We built a browser with GPT-5.2 in Cursor

https://xcancel.com/mntruell/status/2011562190286045552?s=20
2•aaraujo002•25m ago•0 comments

Show HN: Commosta – marketplace to share computing resources

https://www.commosta.io/
1•gkm25•25m ago•0 comments

JSON Render

https://json-render.dev/
1•handfuloflight•27m ago•0 comments

Show HN: IMSAI/Altair inspired microcomputer with web emulator

https://gzalo.github.io/microcomputer/
1•gzalo•28m ago•0 comments

Skillshare: Sync skills to all your AI CLI tools with one command

https://github.com/runkids/skillshare
1•handfuloflight•34m ago•0 comments

Show HN: Chklst – A Minimalist Checklist

https://www.chklst.xyz/
2•rgbjoy•37m ago•0 comments

Opinion: Why tech leaders can't regulate AI before releasing them?

1•lauraorchid•38m ago•1 comments

Vibe Coding Paradox

https://blog.kaplich.me/vibe-coding-paradox/
1•skaplich•38m ago•0 comments

Show HN: I built a satellite forensic engine to detect fraud in Carbon Markets

1•kccanarch•38m ago•1 comments

Google is shutting down the Tenor API

https://www.reddit.com/r/webdev/s/ZjlFO8kiW4
1•kull•38m ago•2 comments

Bubblewrap: A nimble way to prevent agents from accessing your .env files

https://patrickmccanna.net/a-better-way-to-limit-claude-code-and-other-coding-agents-access-to-se...
2•0o_MrPatrick_o0•40m ago•0 comments

Is passive investment inflating a stockmarket bubble?

https://www.economist.com/finance-and-economics/2026/01/14/is-passive-investment-inflating-a-stoc...
24•andsoitis•41m ago•28 comments

I beat Factorio on 1k Floppy disks [video]

https://www.youtube.com/watch?v=cTPBGZcTRqo
1•simonpure•42m ago•1 comments

ISS astronauts return to Earth early due to illness of crew member

https://www.cbc.ca/news/science/nasa-crew11-early-return-9.7045315?cmp=rss
2•gnabgib•44m ago•0 comments

2025 Berggruen Prize Essay Competition Winners

https://berggruen.org/eu/news/2025-berggruen-prize-essay-competition-winners
2•i7l•44m ago•0 comments

AgentDiscover Scanner – Multi-layer AI agent detection (code, network, K8s eBPF)

https://github.com/Defend-AI-Tech-Inc/agent-discover-scanner
1•DefendAI•44m ago•0 comments

Skrillex Releases Kora

https://skrlx.com/
3•Lucasoato•50m ago•0 comments

Kutt.ai – Free AI Video Generator, Text and Image to Video

https://kutt.ai/
2•zuoning•51m ago•2 comments

Personal Intelligence: Connecting Gemini to Google Apps

https://blog.google/innovation-and-ai/products/gemini-app/personal-intelligence/
1•simonpure•52m ago•1 comments

Mapping Nostr keys to DNS-based internet identifiers

https://github.com/nostr-protocol/nips/blob/master/05.md
1•gjvc•59m ago•0 comments

WAPlus' Guide to WhatsApp CRM

https://waplus.io/blog/whatsapp-crm
2•bocaiconnie•1h ago•1 comments

Verizon Is Down

https://www.macrumors.com/2026/01/14/verizon-is-down-iphone-sos/
7•vapemaster•1h ago•4 comments