frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

What Bun Can Tell Us About AI, Open Source and Anthropic

https://redmonk.com/sogrady/2026/06/04/bun-two-lessons/
1•mkeeter•30s ago•0 comments

Good

https://www.quarter--mile.com/Really-Good
1•surprisetalk•56s ago•0 comments

Sangam Tamil literature search engine with AI explanations

http://sangam.tamilnlp.com/mp/json/ai.html
1•laxmena•1m ago•0 comments

First clinical pregnancy following AI-based sperm detection and recovery

https://www.thelancet.com/journals/lancet/article/PIIS0140-6736(25)01623-X/fulltext
1•smusamashah•1m ago•0 comments

Garmin: The more miles ridden, the higher avg VO2 max and FTP

https://www.garmin.com/en-US/blog/fitness/the-beat-on-bikes-the-latest-global-trends-from-garmin-...
1•zdkaster•2m ago•0 comments

Cargo-Auditable Support on Ubuntu

https://documentation.ubuntu.com/project/contributors/language-specific/rust/cargo-auditable/
1•petrakat•2m ago•0 comments

Germany: Lufthansa plane suffers nose gear collapse

https://www.dw.com/en/germany-lufthansa-plane-suffers-nose-gear-collapse/a-77423104
1•N19PEDL2•3m ago•0 comments

Reddit Banned Me Without a Goodbye Note. Here's What I Did Wrong

https://medium.com/@lior_34997/reddit-banned-me-without-a-goodbye-note-heres-what-i-did-wrong-602...
2•liorlb•4m ago•1 comments

Uber's commitment to self-driving startup Nuro is close to $500M

https://www.reuters.com/business/autos-transportation/ubers-commitment-self-driving-startup-nuro-...
1•JumpCrisscross•4m ago•0 comments

Top AI CEOs Call for Law Protecting Against Biological Weapons

https://www.wsj.com/politics/policy/top-ai-ceos-call-for-law-protecting-against-biological-weapon...
1•wrsh07•4m ago•0 comments

It's Not Money or Success: Harvard's Longest Study Reveals the Key to Good Life

https://spacedaily.com/m-harvard-good-life-study-strongest-finding/
1•maxloh•5m ago•0 comments

Agent Skill for TDD

https://www.saturnci.com/my-agent-skill-for-test-driven-development.html
1•laxmena•5m ago•0 comments

Brazil's high-tech voting system is losing voters' trust

https://www.economist.com/the-americas/2026/05/31/brazils-high-tech-voting-system-is-losing-voter...
1•edward•7m ago•0 comments

Let us filter AI slop, you cowards

https://www.theverge.com/ai-artificial-intelligence/942909/let-us-filter-ai-slop-google-youtube-m...
1•stalfosknight•7m ago•0 comments

EU should expand to 40 states – including Canada

https://www.cnbc.com/2026/06/04/finland-stubb-eu-canada-turkey-norway.html
3•leopoldj•7m ago•0 comments

Show HN: Creating SQL queries with decision trees

https://inversql.rentruewang.com
1•renchuw•9m ago•1 comments

Rewiring software delivery for the agentic era

https://www.mckinsey.com/capabilities/technology/our-insights/rewiring-software-delivery-for-the-...
1•taubek•10m ago•0 comments

EU Parliament switches to Qwant search engine from Google in sovereignty push

https://www.reuters.com/business/eu-parliament-switch-french-search-engine-google-tech-sovereignt...
1•rvnx•11m ago•2 comments

microui+fenster=small gui (2024)

https://bernsteinbear.com/blog/fenster-microui/
1•tosh•15m ago•0 comments

Yarle – The ultimate converter of Evernote notes to Markdown

https://github.com/akosbalasko/yarle
1•amai•15m ago•0 comments

NimbusDB – Native macOS DB Manager for CloudKit, Supabase, Firebase and Appwrite

https://apps.apple.com/us/app/nimbusdb/id6769177806?mt=12
1•anton__dev•15m ago•0 comments

Sherpa missing for a week on Mount Everest is rescued

https://www.nbcnews.com/world/asia/sherpa-missing-week-mount-everest-no-food-oxygen-rescued-crawl...
2•gscott•18m ago•0 comments

Show HN: Will It Fit? – Opinionated Normal People Llama.cpp VRAM Estimator

https://hypfer.github.io/will-it-fit-llama-cpp/
1•hypfer•18m ago•1 comments

Oregon gets its first California condor visit in 122 years

https://www.opb.org/article/2026/06/02/oregon-first-california-condor-visit-122-years/
1•speckx•19m ago•0 comments

Teradata CEO to staff: You're not getting a raise. We're spending on AI instead

https://www.businessinsider.com/teradata-pauses-raises-employee-compensation-ai-budget-2026-6
2•healsdata•19m ago•0 comments

The Secret Life of Circuits with lcamtuf / Michał Zalewski (Audio Interview)

https://theamphour.com/725-the-secret-life-of-circuits-with-lcamtuf-michal-zalewski/
2•ChrisGammell•20m ago•1 comments

Faiss: Billion-Scale Similarity Search

https://fremaconsulting.ch/blog/faiss
1•tohms•21m ago•1 comments

US Marine Corps retires the first fighter jet that didn't need a runway

https://www.cnn.com/2026/06/04/us/us-marine-corps-harrier-jump-jet-retirement-intl-hnk-ml
2•everybodyknows•23m ago•3 comments

Get Started with Meko: Agent Memory with Built-In Discernment

https://www.yugabyte.com/blog/meko-agent-memory-with-built-in-discernment/
1•3littlefish•24m ago•0 comments

Breaking Changes in APIs: How to Detect and Prevent Them

https://apiguard.co/blog/openapi-breaking-changes
1•mkhorasani•24m ago•0 comments