frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•9mo ago

Comments

kate_at_refact•9mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

"You can post anything to Moltbook"

https://twitter.com/galnagli/status/2017573842051334286
1•habinero•20s ago•0 comments

Show HN: Save upto 70% of API usage in agentic development

https://github.com/AffanShaikhsurab/COON
1•affanshaiksurab•2m ago•0 comments

'It's ridiculous': publicans bemused by rise of single-file queues to get served

https://www.theguardian.com/lifeandstyle/2026/jan/31/publicans-bemused-single-file-queue-trend-pubs
1•zeristor•2m ago•0 comments

The Spacecraft That Wouldn't Die

https://www.corememory.com/p/exclusive-theres-a-spaceship-epic-aerospace-chimera
1•trothamel•4m ago•0 comments

Show HN: Replicating OpenAI Prism's LaTeX Workflows in 24h Using Gemini Agents

https://frism.pixelraft.com/introducing
1•ToadPresident•6m ago•1 comments

Show HN: Pinchwork – A task marketplace where AI agents hire each other

https://github.com/anneschuth/pinchwork
2•aschuth•8m ago•0 comments

Content Negotiation Is All You Need (For AI-Readable Docs)

https://docsalot.dev/blog/we-shipped-llms-txt-heres-why-it-matters
1•fazkan•8m ago•0 comments

Data Processing Benchmark Featuring Rust, Go, Swift, Zig, Julia etc.

https://github.com/zupat/related_post_gen
1•behnamoh•9m ago•0 comments

Autonomous cars, drones cheerfully obey prompt injection by road sign

https://www.theregister.com/2026/01/30/road_sign_hijack_ai/
5•breve•11m ago•0 comments

Scientists use enigmatic cell structures to record RNA activity

https://phys.org/news/2026-01-scientists-enigmatic-cell-devices-rna.html
2•PaulHoule•11m ago•0 comments

Microdosing for Depression Appears to Work as Placebo

https://www.wired.com/story/microdosing-for-depression-appears-to-work-about-as-well-as-drinking-...
1•worik•12m ago•1 comments

RustDesk blocks cross-city connections after botnet, locks out users

https://github.com/rustdesk/rustdesk/discussions/14167
1•gordian-mind•13m ago•1 comments

Ask HN: What actually helped you reduce stress & sleep when advice didn’t work?

1•Remberti•13m ago•2 comments

Humans can post on moltbook without ANY AGENT

https://github.com/shash42/post-a-molt
1•shash42•15m ago•0 comments

AI alignment is a $200B+ product problem, not a research question

https://betterhalfai.substack.com/p/ai-alignment-is-a-200b-product-problem
1•i7l•16m ago•0 comments

Michael Saylor's MicroStrategy BTC position is 3% away from negative value

https://www.blossomsocial.com/posts/Market-Cap-Losses-in-Bitcoin-and-Ethereum-Over-Last-7-Hours__...
1•donsupreme•17m ago•0 comments

In Praise Of –Dry-Run

https://henrikwarne.com/2026/01/31/in-praise-of-dry-run/
2•ingve•17m ago•0 comments

Show HN: I built a habit tracker that doesn't shame you for missing a day

https://amble.today/
2•tusharnaik•17m ago•0 comments

Erika Krouse's OCD Ranking of ~500 Literary Magazines

http://www.erikakrousewriter.com/erika-krouses-ocd-ranking-of-483-literary-magazines-for-short-fi...
1•Curiositry•18m ago•0 comments

Show HN:Runtime linter that flags forward-looking claims in logs & model output

1•glenner•18m ago•0 comments

The Vibe in the Crypto Market: 'Stay Alive'

https://www.wsj.com/finance/currencies/the-vibe-in-the-crypto-market-right-now-stay-alive-9f3ee79c
1•JumpCrisscross•18m ago•0 comments

SpaceX files plans for million-satellite orbital data center constellation

https://spacenews.com/spacex-files-plans-for-million-satellite-orbital-data-center-constellation/
3•RickJWagner•19m ago•0 comments

AI ROI Analysis: Evidence from 200 B2B Deployments (2022-2025)

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5861122
2•denisatlan•20m ago•0 comments

One Piece Characters

https://onepiececharacters.com
1•jokera•20m ago•0 comments

Most Art Is Stolen from Homes-Here's One Thing You Can Do to Protect Yours

https://anthonyamore.substack.com/p/most-art-is-stolen-from-private-homes
1•anthonyamore•20m ago•1 comments

A Cartoon Characters

https://acartooncharacters.com
1•jokera•20m ago•0 comments

Brazilian Micro-SaaS Map

https://saas-map.ssr.trapiche.cloud/
1•acfilho•24m ago•0 comments

Broadcom 'bulldozes' VMware CSPs with March deadline

https://www.theregister.com/2026/01/31/broadcom_vmware_cloud_partners/
1•abdelhousni•25m ago•0 comments

Captain Disillusion – Secrets of the Film Slate [video]

https://www.youtube.com/watch?v=lKZIgmXkIjM
1•celias•27m ago•0 comments

Matt Mahan, mayor of San Jose, announces run for governor of California

https://www.theguardian.com/us-news/2026/jan/29/matt-mahan-san-jose-governor-california
1•andsoitis•30m ago•0 comments