frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

1•ipachanga•18s ago

Show HN: We're open sourcing Superlog (YC P26), an autonomous monitoring tool

https://github.com/superloglabs/superlog
1•signalbright•44s ago•0 comments

Penn Wharton Budget Model

https://budgetmodel.wharton.upenn.edu/model/
1•fzliu•4m ago•0 comments

ALE

https://github.com/arcfide/ALE
1•tosh•6m ago•0 comments

Show HN: Gitdot – a better GitHub. Open-source, anti-AI, and written in Rust

https://gitdot.io/
1•pybae•8m ago•0 comments

The Process Was the Point

https://darshanmakwana412.github.io/2026/06/the-process-was-the-point/
1•martianvoid•12m ago•0 comments

Reflecting on a Year of Claude Code

https://www.youtube.com/watch?v=Hth_tLaC2j8
1•doppp•13m ago•0 comments

Using AI Centaur Systems to Strengthen Professional Judgment

https://download.ssrn.com/2026/5/22/6814343.pdf?response-content-disposition=inline&X-Amz-Securit...
1•droidjj•13m ago•0 comments

A Simple System for TODOs

https://graybearding.bearblog.dev/a-simple-system-for-todos/
1•rglover•15m ago•0 comments

You Don't Need a GitHub Copilot Subscription to Use VS Code AI Features

https://medium.com/@jeffreyflynt02/you-dont-need-a-github-copilot-subscription-to-use-vs-code-ai-...
2•jflynt76•15m ago•0 comments

Hackers likely hijacked over 20k Instagram accounts with Meta's AI chatbot

https://www.theverge.com/tech/945658/meta-ai-support-chatbot-exploit-instagram-accounts
4•LordAtlas•18m ago•0 comments

The dangerous unknowns at the heart of LLMs

https://yalereview.org/article/melanie-mitchell-jagged-intelligence
2•jadelcastillo•19m ago•0 comments

Beyond Ralph Loops: Orchestrate-Map-Reduce and Higher Order Skills

https://twitter.com/djgrant_/status/2063960111173808335
1•djgrant•19m ago•0 comments

Battery-free textile turns clothing into a real-time blood pressure monitor

https://techxplore.com/news/2026-04-battery-free-textile-real-blood.html
1•PaulHoule•19m ago•0 comments

Kimi Work: The AI Desktop for Knowledge Work

https://www.kimi.com/products/kimi-work
1•pretext•21m ago•0 comments

Function Arrays

https://nsl.com/papers/fa.htm
1•tosh•21m ago•0 comments

Ear Training Practice Exercises

https://tonedear.com/
2•mattbit•22m ago•1 comments

GTA V – Graphics Study (2015) [pdf]

https://cgvr.cs.uni-bremen.de/teaching/vr_literatur/GTA%20V%20-%20Graphics%20Study%20-%20Adrian%2...
1•carlos-menezes•25m ago•0 comments

Respiratory Rate and Simulated Apnea Utilizing the PneumoWave Biosensor

https://www.mdpi.com/2079-6374/16/5/256
1•PaulHoule•25m ago•0 comments

Sanders' AI sovereign wealth fund plan is good. But we think this is better

https://www.theguardian.com/commentisfree/2026/jun/08/bernie-sanders-ai-sovereign-wealth-fund-plan
1•hackernj•26m ago•0 comments

Can AI answer open questions in physics?

https://m-malinowski.github.io/2026/06/05/ai-open-physics-problems.html
1•stared•26m ago•0 comments

Keir Starmer set to impose internet restrictions on Britons in days

https://www.gbnews.com/politics/social-media-ban-keir-starmer-internet-restrictions-british-children
4•itbeho•28m ago•0 comments

Types for more than memory safety in OxCaml – Stephen Dolan – VeTSS 2026 [video]

https://www.youtube.com/watch?v=W5li5LBY-1o
1•matt_d•30m ago•0 comments

Show HN: I ported Xonotic to WASM

https://dpgame.xonotic.workers.dev/
2•astlouis44•30m ago•0 comments

County to pay $192,783 to 3 employees who lost their jobs over Covid vaccine

https://padailypost.com/2026/06/04/county-agrees-to-pay-employees-in-vaccine-dispute-but-who-will...
4•MilnerRoute•30m ago•1 comments

Show HN: Startup sci-fi novel that took me 5 years to write

https://www.blockchainednovel.com/
4•mck-•30m ago•0 comments

Do Agents.md Files Help Coding Agents?

https://arxiv.org/abs/2602.119883649136323252397
1•pretext•30m ago•0 comments

Show HN: Open-Source, Local First, GitHub Stars Management and Search

https://orbit-oneq.vercel.app/
1•alonronin•31m ago•0 comments

Uber Freight sees earlier peak season, strong Mexico demand

https://www.freightwaves.com/news/borderlands-mexico-uber-freight-sees-earlier-peak-season-strong...
1•crescit_eundo•32m ago•0 comments

What its like to be an AI Artist [video]

https://www.youtube.com/watch?v=mcYl70vq_Ns
1•dana321•32m ago•1 comments