frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•10mo ago

Comments

kate_at_refact•10mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

AI isn't killing jobs, it's 'unbundling' them into lower-paid chunks

https://www.theregister.com/2026/03/24/ai_job_unbundling/
1•gnabgib•2m ago•0 comments

Para-Academic Techno-Philosophy

https://elftheory.substack.com/p/para-academic-techno-philosophy
1•lentoutcry•2m ago•0 comments

Generating one token at a time is a blessing in disguise

https://kachkach.com/blog/generating-one-token-at-a-time-is-a-blessing-in-disguise
1•halflings•4m ago•1 comments

The Acceleration of Addictiveness (2010)

https://paulgraham.com/addiction.html
1•microsoftedging•5m ago•0 comments

Show HN: OpsScaleIQ – The operational intelligence OS for franchise operators

https://opsscaleiq.com
1•dsptl•5m ago•0 comments

Personal story: BR airlines sites sucks. Struggling to cancel seat selection

https://blog.thisago.com/story/20260329-cancellingFlightSeatSelection.txt
1•thisago•6m ago•0 comments

Show HN: Tabical – Tinder-style city micro-itineraries, personalized by swipe

https://tabical.com/
1•akhilpotturi•7m ago•0 comments

Hundreds of strangers flock to San Francisco beach to dig a really big hole

https://www.sfgate.com/sf-culture/article/hundreds-strangers-flock-sf-beach-dig-really-big-221583...
1•Stratoscope•8m ago•0 comments

Ask HN: What is TensorFlow still good for now?

1•asxndu•10m ago•1 comments

What category theory teaches us about dataframes

https://mchav.github.io/what-category-theory-teaches-us-about-dataframes/
2•fanf2•12m ago•0 comments

Show HN: Crazierl – An Erlang Operating System

https://crazierl.org/demo/
3•toast0•15m ago•1 comments

The Agentic Passive Voice

https://lethain.com/agentic-passive-voice/
1•jbernardo95•16m ago•0 comments

AI on deck: assessing impact of MLB's new ball-strike system

https://news.cornell.edu/stories/2026/03/ai-deck-assessing-impact-mlbs-new-ball-strike-system
1•rmason•16m ago•0 comments

Magellan: AI agents for autonomous cross-disciplinary scientific discovery

https://github.com/kakashi-ventures/magellan-cli
1•ameft•16m ago•1 comments

An uncatchable CoreML crash: MLIR compiler failures on the iPhone SE 2

https://medium.com/@wagaodongo/the-uncatchable-crash-why-my-coreml-app-works-on-every-iphone-exce...
1•volvogradSaint•20m ago•1 comments

The road signs that teach travellers about France

https://www.bbc.com/travel/article/20260327-the-road-signs-that-teach-travellers-about-france
1•1659447091•24m ago•0 comments

Cleveland Clinic and IBM debut new quantum simulation workflow

https://www.ibm.com/quantum/blog/cleveland-clinic-protein-qcsc
1•rbanffy•26m ago•0 comments

Visual reasoning benchmark based on Analog Clocks

https://clockbench.ai/
1•yrds96•26m ago•0 comments

How the Media Is Failing to Hold Iran Accountable for War Crimes

https://www.camera.org/article/how-the-media-is-failing-to-hold-iran-accountable-for-war-crimes/
3•mhb•29m ago•1 comments

Secure Proxy Manager

https://fabriziosalmi.github.io/secure-proxy-manager/
1•TheTaytay•29m ago•0 comments

Zombie Netscape Won't Die

https://hackaday.com/2026/01/27/zombie-netscape-wont-die/
2•riffic•30m ago•0 comments

Toxic PFAS residue identified on 37% of California produce, new analysis finds

https://www.theguardian.com/environment/2026/mar/29/pfas-residue-california-produce-analysis
5•bookofjoe•33m ago•2 comments

ChatGPT Won't Let You Type Until Cloudflare Reads Your React State

https://www.buchodi.com/chatgpt-wont-let-you-type-until-cloudflare-reads-your-react-state-i-decry...
36•alberto-m•33m ago•7 comments

Aldus PageMaker on the Apple Macintosh

https://stonetools.ghost.io/pagemaker-mac/
2•rbanffy•36m ago•0 comments

Phantom of Heilbronn

https://en.wikipedia.org/wiki/Phantom_of_Heilbronn
2•omnibrain•36m ago•0 comments

Circumstantial Complexity, LLMs and Large Scale Architecture

https://datagubbe.se/aiarch/
3•rbanffy•37m ago•0 comments

AI drives companies to reorganize around loops, not functions

https://blog.sshh.io/p/the-transposed-organization
1•sshh12•39m ago•0 comments

Show HN: Nativly –Add One DNS record to rank your site in 60 languages on Google

https://www.nativly.app/
1•Francis221•41m ago•2 comments

Apple forces on-device age verification in UK release of iOS 26.4

https://support.apple.com/en-gb/126788
3•cdinu•42m ago•1 comments

Show HN: Beval – Simple evaluations for your AI product

https://www.beval.space/
1•raviisoccupied•44m ago•0 comments