frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•9mo ago

Comments

kate_at_refact•9mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

The Question she did not ask

https://claudepress.substack.com/p/the-question-she-didnt-ask
1•Paodim•4m ago•1 comments

Show HN: CursedFeed, a social feed where people use spells to mutate next posts

https://cursedfeed.vercel.app/
1•Roccan•4m ago•0 comments

The Five Eras of KVCache

https://www.modular.com/blog/the-five-eras-of-kvcache
1•timmyd•6m ago•0 comments

The $921M Special Interest Machine That Controls California

https://garryslist.org/posts/the-921m-special-interest-machine-that-controls-california
2•rahimnathwani•9m ago•0 comments

Independent analysis of AI: AI landscape to choose the best model and provider

https://artificialanalysis.ai
1•teleforce•10m ago•0 comments

How to Survive a Fall Through the Ice

https://www.nytimes.com/2026/02/05/us/fall-through-ice-frozen-water-rescue.html
1•0in•10m ago•0 comments

Show HN: OpenWeavr – Run AI workflows on your own machines to automate tasks

https://github.com/openweavr/Openweavr
1•EmTekker•11m ago•0 comments

Show HN: Graph Maker, a tool to help you create data graphs in seconds with AI

https://www.graph-maker.ai
2•lealee•11m ago•1 comments

Bardo Thodol

https://en.wikipedia.org/wiki/Bardo_Thodol
2•toomuchtodo•11m ago•0 comments

UX Anti-patterns skill: Catch the UX sins Claude ships when you're not looking

https://github.com/cassiozen/UX-antipatterns
1•cacozen•13m ago•0 comments

GitHub Actions Is Slowly Killing Your Engineering Team

https://www.iankduncan.com/engineering/2026-02-05-github-actions-killing-your-team/
1•codesuki•14m ago•0 comments

Major political upheaval in Japan? Sanae Takaichi may dissolve the Diet

1•jocelyner•15m ago•0 comments

Reddit Lead Generation: The Complete Guide for B2B Companies

https://getvibeddit.com/blog/reddit-lead-generation-guide
1•shenli3514•17m ago•0 comments

GPT-5.3-Codex System Card [pdf]

https://cdn.openai.com/pdf/23eca107-a9b1-4d2c-b156-7deb4fbc697c/GPT-5-3-Codex-System-Card-02.pdf
1•vinhnx•17m ago•0 comments

Claude Opus 4.6 System Card [pdf]

https://www-cdn.anthropic.com/0dd865075ad3132672ee0ab40b05a53f14cf5288.pdf
1•vinhnx•17m ago•0 comments

Onnx2fx: Yet another ONNX to PyTorch FX converter

https://github.com/mshr-h/onnx2fx
1•mshr-h•18m ago•0 comments

What is the best free AI tool for legal advice?

http://www.harrisblog.com/
1•Jane21•19m ago•0 comments

Monty - A minimal, secure Python interpreter written in Rust

https://github.com/pydantic/monty
1•scolvin•19m ago•0 comments

Can I get a six pack?:-)

https://www.youtube.com/watch?v=kQRu7DdTTVA
1•jeffkumar•21m ago•1 comments

'We don't want to end up like the US', ex-Australian Prime Minister Turnbull

https://www.smh.com.au/national/nsw/sydney-summit-live-updates-industry-leaders-politicians-gathe...
3•KnuthIsGod•23m ago•1 comments

Style tips for less experienced developers coding with AI

https://honnibal.dev/blog/llm-style-tips
1•syllogism•23m ago•0 comments

TOS Tracker

https://tostracker.app/
1•tldrthelaw•24m ago•0 comments

knock-knock.net

https://knock-knock.net/
1•indigodaddy•26m ago•0 comments

Australia confirms Bunnings' facial recognition used personal data unlawfully

https://www.oaic.gov.au/news/media-centre/oaic-statement-on-administrative-review-tribunals-bunni...
2•TripleLB•27m ago•0 comments

Show HN: Reader – open-source web scraping engine built for LLMs

https://github.com/vakra-dev/reader
1•nihalwashere•30m ago•0 comments

Stablecoins vs. Tokenized Deposits: The Narrow Banking Debate Revisited

https://fedinprint.org/item/fednsr/102411
1•toomuchtodo•35m ago•1 comments

Llms.txt – A Robots.txt for AI Assistants

https://seekrates-ai.com/llms-txt-file/
1•mohan-AIyer•35m ago•0 comments

College Board Banning Students from Using Smart Glasses During SATs

https://gizmodo.com/the-college-board-is-banning-students-from-using-smart-glasses-during-the-sat...
3•bookofjoe•39m ago•0 comments

LLMs do plan before they genenrate tokens

https://arxiv.org/abs/2502.06258
2•kaaaang•42m ago•0 comments

X07: An agent-first compiled language with JSON AST and deterministic tooling

https://x07lang.org/
1•webodik•46m ago•0 comments