frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•9mo ago

Comments

kate_at_refact•9mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Show HN: HyperAgency (H9y.ai) – Open-Source Agentic AI Operating System

https://github.com/vuics/h9y
1•alphara•24s ago•0 comments

Tab-it – Smart Chrome tab organization with session management

https://chromewebstore.google.com/detail/tab-it/ibjlmaiklkfchnggbjlkhjaafchmfcnb
1•choic•1m ago•1 comments

Ask HN: Is it AGI for software engineering?

1•colesantiago•2m ago•1 comments

Show HN: Mjmx – JSX Runtime for Mjml

https://github.com/skwee357/mjmx
1•skwee357•3m ago•0 comments

High performance correctly rounded math libraries for 32-bit floating point

https://blog.sigplan.org/2021/08/26/high-performance-correctly-rounded-math-libraries-for-32-bit-...
1•fanf2•7m ago•0 comments

The Absence of No Is Not Yes: Italy's Flawed Sexual Violence Bill

https://www.hrw.org/news/2026/01/27/the-absence-of-no-is-not-yes-italys-flawed-sexual-violence-bill
1•binning•10m ago•0 comments

Not all men? I'm losing confidence in this idea

https://millihill.substack.com/p/not-all-men-im-losing-confidence
1•binning•10m ago•0 comments

Alice Augusta Ball, chemist who made the first effective treatment for leprosy

https://en.wikipedia.org/wiki/Alice_Ball
1•binning•11m ago•0 comments

Show HN: Yeehaw – Infrastructure as Farm

https://github.com/Colmbus72/yeehaw
1•camcamcam•11m ago•0 comments

Show HN: Track vim cmd and mapping usage, and detect typos to optimize vimrc

https://github.com/AquiGorka/vim-stats
1•AquiGorka•13m ago•0 comments

Japanese city cancels cherry blossom festival over badly behaved tourists

https://www.bbc.com/news/articles/c1wzrlndzjro
3•tartoran•13m ago•0 comments

Date Arithmetic in Bash

https://blog.miguelgrinberg.com/post/date-arithmetic-in-bash
1•ibobev•13m ago•0 comments

Programming Your Own Modern BBS with Python

https://retrogamecoders.com/programming-bbs-with-python/
1•ibobev•14m ago•0 comments

ONNX Based Generative AI LLMs in Java with Project Babylon by Adam Sotona [video]

https://www.youtube.com/watch?v=fJwKvE2AxIo
1•zikani_03•15m ago•0 comments

The Search for Meaning Through Collaboration and Code

https://clojurecivitas.github.io/civitas/why/village/scene.html
1•todsacerdoti•15m ago•0 comments

Ardour 9.0 Released

https://ardour.org/whatsnew.html
9•PaulDavisThe1st•18m ago•1 comments

'X-ray dot' discovery fuels JWST 'black hole star' debate

https://www.scientificamerican.com/article/x-ray-dot-discovery-fuels-jwst-black-hole-star-debate/
1•quapster•19m ago•0 comments

In 2024, 51% of online activity came from bots

https://www.euractiv.com/opinion/humans-are-now-the-minority-online/
1•ATechGuy•19m ago•0 comments

LLatte: Scalable Transformers for Ads at Meta

https://twitter.com/fb_engineering/status/2019440354840154554
3•LatteMetaAI•21m ago•1 comments

Anthropic's Claude Opus 4.6 uncovers 500 zero-day flaws in open-source code

https://www.axios.com/2026/02/05/anthropic-claude-opus-46-software-hunting
18•speckx•23m ago•3 comments

All Laws Are Local

https://pluralistic.net/2026/02/05/contingency/
1•hn_acker•25m ago•1 comments

Psychometric Jailbreaks Reveal Internal Conflict in Frontier Models

https://arxiv.org/abs/2512.04124
2•toomuchtodo•27m ago•2 comments

Developer sues California city over flyers opposing a $10B data-center project

https://www.sfgate.com/california/article/silicon-valley-ai-boom-21331191.php
1•jakemontero24•29m ago•0 comments

"You're Not Going to Investigate a Federal Officer"

https://www.propublica.org/article/why-local-state-police-rarely-investigate-ice-cbp-fbi
3•hn_acker•29m ago•0 comments

Show HN: AgentVM – Safe, Sandboxed Linux VM for OpenClaw and AI Agents

https://agentvm.deepclause.ai/
2•phunterlau•30m ago•1 comments

Brain Dumps as a Literary Form

https://davegriffith.substack.com/p/brain-dumps-as-a-literary-form
1•dxs•31m ago•0 comments

A small, shared skill library by builders, for builders. (human and agent)

https://github.com/PsiACE/skills
2•recrush•31m ago•0 comments

Show HN: PyDesigner – Visual GUI Builder for Tkinter, PyQt5, and CustomTkinter

https://pydesigner.qzz.io
1•harshitshah•31m ago•0 comments

Show HN: VoxPaste – Fast voice-to-text CLI for dictating to Claude/Cursor

https://github.com/felixbrock/voxpaste
1•brockmeier•33m ago•0 comments

Ultravox's Breakthrough Voice AI Benchmark [video]

https://www.youtube.com/watch?v=Z7l4w9RTQ_0
1•underfox•33m ago•0 comments