frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Show HN: AgentBridge – translate and govern calls between AI agent protocols

https://github.com/shadowhunter-92/agentbridge
1•Shadowhunter89•3m ago•0 comments

Demystifying Noise Contrastive Estimation

https://jxmo.io/posts/nce
1•jxmorris12•7m ago•0 comments

Apple Foundation Models

https://platform.claude.com/docs/en/cli-sdks-libraries/libraries/apple-foundation-models
2•MehrdadKhnzd•10m ago•0 comments

AI Agents Don't Need SMS APIs. They Need Infrastructure

https://blog.bridgexapi.io/from-rest-apis-to-mcp-making-messaging-infrastructure-ai-native
2•Bridgexapi•17m ago•0 comments

Fugee, an agentic AI assistant for displaced people and asylum seekers [video]

https://www.youtube.com/watch?v=PYGzJZj7LfM
2•helmo•23m ago•0 comments

'Worst Example of Misconduct': Court Affirms Sanctions for Erroneous AI Cites

https://www.law.com/therecorder/2026/06/12/worst-example-of-misconduct-appellate-court-affirms-sa...
1•1vuio0pswjnm7•26m ago•0 comments

'Strains Credulity': Judge Rejects Meta's Attempt to Dismiss Copyright Suit

https://www.law.com/therecorder/2026/06/12/strains-credulity-judge-rejects-metas-attempt-to-dismi...
1•1vuio0pswjnm7•27m ago•0 comments

Who is the Real Dennis Ritchie? (1991) [pdf]

https://dmrthesis.net/wp-content/uploads/2021/08/BLR-Article-UNIXWorld-Jan1991-A.pdf
2•caned•27m ago•0 comments

A brief tour of the PDP-11, the most influential minicomputer of all time (2022)

https://arstechnica.com/gadgets/2022/03/a-brief-tour-of-the-pdp-11-the-most-influential-minicompu...
2•jensgk•28m ago•0 comments

Ask HN: How can we democratize agentic coding

1•omot•28m ago•0 comments

US Lawmakers Pressing DOJ on fraud, bribery case against Indian Billionaire

https://www.law.com/international-edition/2026/06/12/us-lawmakers-are-pressing-the-doj-on-its-dec...
1•1vuio0pswjnm7•29m ago•1 comments

A Galois Field Arithmetic Primer

https://tomverbeure.github.io/2026/06/14/Galois-Field-Arithmetic-Primer.html
2•matt_d•29m ago•0 comments

Quantum Horizon: Quantum Computing as a Threat to Bitcoin and Ethereum

https://arxiv.org/abs/2606.14484
1•OutOfHere•29m ago•0 comments

Show HN: Prela – A Compositional and Controllable Query Language

https://prela-lang.org
1•remywang•33m ago•0 comments

ArkType: The Parse-Don't-Validate Sequel I Didn't Know I Needed

https://cekrem.github.io/posts/arktype-parse-dont-validate-sequel/
2•jcbhmr•36m ago•0 comments

Found this FREE open source APP which is an absolute GEM

https://github.com/Prithvi-Web/Treemap
1•DaGoat487•36m ago•2 comments

I indexed 936 Lex Fridman episodes into a RAG that cites its sources

https://github.com/aranajhonny/omnipod
1•akatsutki•41m ago•0 comments

Bretisilocin

https://en.wikipedia.org/wiki/Bretisilocin
2•isoprophlex•46m ago•0 comments

Ask HN: What does your local LLM setup looks like?

2•the-mitr•48m ago•0 comments

Stripping KiezelPay (and other network calls) from Pebble Watch Faces

http://luke.abq.nm.us/blog/2026-06-14-Pebble-Watchface-Strip-KiezelPay/
1•sleepyink•53m ago•0 comments

Catjam 2026

https://itch.io/jam/catjam-2026
1•pigeons•54m ago•0 comments

The Tragedy of the Commoner

https://hughhowey.com/the-tragedy-of-the-commoner/
2•bjhess•56m ago•0 comments

Ask HN: Data source used for training Anthropic's Mythos?

2•sandeepkd•56m ago•1 comments

Is using AI in school cheating?

https://www.totalnoise.ai/blog/educationandai/
2•hasmatt•1h ago•0 comments

OpenAI under investigation by group of state attorneys general

https://www.reuters.com/business/openai-under-investigation-by-coalition-state-attorneys-general-...
2•Soumya_Max•1h ago•0 comments

Google sues alleged operators of an AI-powered phishing kit

https://www.reuters.com/legal/government/google-targets-ai-powered-phishing-new-york-lawsuit-2026...
1•Soumya_Max•1h ago•0 comments

Building Software vs. Building a House

2•filup•1h ago•0 comments

Domain-Specific AI for Pharma, Biotech and Medical Professionals

https://www.noah.bio/
1•ilreb•1h ago•0 comments

UK may ban social media for children under 16

https://techcrunch.com/2026/06/14/uk-may-ban-social-media-for-children-under-16/
1•SilverElfin•1h ago•0 comments

Long-Form Video Understanding: Bottlenecks and Design Choices – Part 1

https://yinghonglan.substack.com/p/long-form-video-understanding-bottlenecks
1•rented_mule•1h ago•0 comments