frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

The Audience Nobody Saw

https://fromthelittoral.substack.com/p/the-audience-nobody-saw
1•MrVandemar•32s ago•0 comments

Nvidia releases CUDA-Oxide 0.1 for experimental Rust-to-CUDA compiler

https://www.phoronix.com/news/NVIDIA-CUDA-Oxide-0.1
1•birdculture•3m ago•0 comments

Job

1•askb_coder•4m ago•0 comments

Musk vs. Altman week 2: OpenAI fires back, and Shivon Zilis reveals that Musk tr

https://www.technologyreview.com/2026/05/08/1137008/musk-v-altman-week-2-openai-fires-back-and-sh...
1•joozio•5m ago•0 comments

GpxFix – A tool to repair recordings of outdoor activities

https://www.gpxfix.eu/
1•taccp•6m ago•0 comments

LLMs are underutilized due to sub optimal management

https://alexzhang13.github.io/blog/2026/mgh/
1•melonmars•8m ago•0 comments

Show HN: DuoSolve – Daily grammer practice game

https://duobook.co/duosolve
1•celltalk•11m ago•0 comments

Vladimir Putin is losing his grip on Russia

https://www.economist.com/by-invitation/2026/05/06/vladimir-putin-is-losing-his-grip-on-russia
2•bazzmt•11m ago•1 comments

Practice reviewing risky AI-generated engineering output

https://www.proreview.dev/
1•shaad1337•13m ago•0 comments

Your Computer Doesn't Belong to You Anymore

https://aquisthoughts.substack.com/p/your-computer-doesnt-belong-to-you
2•ethanplant•16m ago•0 comments

Show HN: Memory Vault – local-first memory, hybrid search, knowledge graph

https://github.com/MihaiBuilds/memory-vault
1•mihaibuilds•20m ago•0 comments

Show HN: Aptmatic – a TUI for managing apt across a bunch of Debian boxes

https://crates.io/crates/aptmatic
1•growse•20m ago•0 comments

Kanvly – notes and boards with AI, now on iOS

https://kanvly.com
1•trotskomain•27m ago•0 comments

Lua as a practical "soft-bedrock" language

https://portal.mozz.us/gemini/zaibatsu.circumlunar.space/~solderpunk/gemlog/lua-as-a-practical-so...
2•karl42•31m ago•2 comments

Agentwerk: A minimal Rust crate for agentic apps

https://github.com/canvascomputing/agentwerk
1•schirrmacher•36m ago•0 comments

The Chiplet Illusion: New Moore's Law or the Most Expensive Cover-Up?

https://sourceryintel.com/reports/the-chiplet-illusion
1•freakynit•36m ago•0 comments

Show HN: Transformer Math Explorer

https://simonramstedt.com/tools/transformer/
2•rmst•38m ago•1 comments

Solar on canals reduces water evaporation by 70% and algae growth by 85%

https://www.pv-magazine.com/2026/05/04/solar-on-canals-reduces-water-evaporation-by-70-and-algae-...
4•ndr42•38m ago•1 comments

Free guided journaling during the Mental Health Awareness Month

https://journal.cubitoo.com/en
2•pawelkomarnicki•41m ago•1 comments

The Cost of Downsizing Social Security

https://www.newyorker.com/news/deep-state-diaries/the-real-cost-of-downsizing-social-security
1•littlexsparkee•45m ago•1 comments

Experimental Rust-to-CUDA Compiler

https://github.com/NVlabs/cuda-oxide
1•cgravill•46m ago•1 comments

Goodbye Slack

https://ano.chat
1•bill-cupid•46m ago•0 comments

The World Inside Neural Networks

https://www.goodfire.ai/research/the-world-inside-neural-networks
3•wsgeorge•48m ago•0 comments

Terax – a 7mb AI terminal in Rust and Tauri [video]

https://www.youtube.com/watch?v=kykgXa7sm1g
3•netten•48m ago•0 comments

EU ban on Chinese inverters sparks strong response from Beijing

https://www.pv-magazine.com/2026/05/08/eu-ban-on-chinese-inverters-sparks-strong-response-from-be...
1•ndr42•51m ago•0 comments

Bun's experimental Rust rewrite hits 99.8% test compatibility on Linux x64 glibc

https://twitter.com/jarredsumner/status/2053047748191232310
5•heldrida•53m ago•2 comments

Steering Zig Fmt

https://matklad.github.io/2026/05/08/steering-zig-fmt.html
1•mpweiher•54m ago•0 comments

The troubled quest for tasty vegan cheese

https://www.economist.com/interactive/1843/2026/05/08/grate-expectations-the-troubled-quest-for-t...
1•pingou•55m ago•1 comments

Chrome's AI features may be hogging 4GB of your computer storage

https://www.theverge.com/tech/924933/google-chrome-4gb-gemini-nano-ai-features
2•elemar•59m ago•1 comments

Show HN: Cheapshot – GPS-based mobile game

https://cheapshot.co/
1•pakenrol•1h ago•3 comments