frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•11mo ago

Comments

kate_at_refact•11mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

The Software Factory Age: Why 2026 May Be the End of Artisan Coding

https://argssh.substack.com/p/the-software-factory-age-why-2026
1•argssh•1m ago•0 comments

Ran into a fake "LP" at a YC after-party and I need to vent

2•danish00111•3m ago•0 comments

The e-graph data structure

https://www.cole-k.com/2023/07/24/e-graphs-primer/
2•SchwKatze•8m ago•0 comments

Show HN: 5-translation RAG matrix fixing LLM religious hallucinations

https://github.com/salaamalykum/quran-semantic-search
2•uk9854321•23m ago•0 comments

Surely no brand is more hated by web users that Cloudflare

5•chrisjj•24m ago•1 comments

Keycard – API keys scoped to one subprocess, gone when it exits

https://www.keycard.studio/zh/
3•jijane•27m ago•0 comments

Hermes Agent by Nous Research

https://hermes-agent.nousresearch.com
2•dnw•27m ago•0 comments

TensorRT LLM

https://github.com/NVIDIA/TensorRT-LLM
2•kristianpaul•33m ago•0 comments

A visual guide to Artemis II and previous missions to the moon

https://www.aljazeera.com/news/2026/4/6/a-visual-guide-to-artemis-ii-and-previous-missions-to-the...
1•mooreds•34m ago•0 comments

Passkeys are one of the worst consumer rollouts I ever witnessed

https://bsky.app/profile/jennschiffer.com/post/3mjrpkrqjm22a
4•mooreds•35m ago•1 comments

Bluetooth tracker hidden in postcard and mailed to warship exposed its location

https://www.tomshardware.com/tech-industry/cyber-security/bluetooth-tracker-hidden-in-a-postcard-...
2•thunderbong•41m ago•1 comments

Server builds, wallet signs – a non-custodial Web3 checkout pattern

https://blauenlabs.com/blog/web3-checkout-pattern/
1•thanders•41m ago•0 comments

Writing Liveness

https://contraptions.venkateshrao.com/p/writing-liveness
1•jger15•43m ago•0 comments

AI agent called every pub in Ireland to index the cost of a Guinness

https://guinndex.ai
2•bilekas•46m ago•0 comments

Show HN: Building compiler from scratch without the help of LLMs [video]

https://www.youtube.com/watch?v=THIkjQnqsbw
2•aarnphm•47m ago•0 comments

Global freedom declined for the 20th consecutive year in 2025

https://freedomhouse.org/report/freedom-world/2026/growing-shadow-autocracy
1•Cider9986•48m ago•0 comments

Show HN: Mac-computer-use, an open-source clone of Codex Computer Use

https://github.com/TheGuyWithoutH/mac-computer-use
3•guywithnoh•51m ago•0 comments

Colombia will euthanize Pablo Escobar's invasive 'cocaine hippos'

https://www.scientificamerican.com/article/colombia-will-euthanize-pablo-escobars-invasive-cocain...
1•zdw•54m ago•0 comments

Fatal Accident Occurs in Nurburgring Langstrecken-Serie (NLS)

https://www.bbc.com/sport/motorsport/articles/crl1wwdegkno
1•linzhangrun•56m ago•0 comments

Bounce Off the Atmosphere at Reentry? (2016)

https://space.stackexchange.com/questions/19296/bounce-off-the-atmosphere-at-reentry
1•susam•56m ago•0 comments

F-14 Central Air Data Computer

https://en.wikipedia.org/wiki/F-14_CADC
1•unsnap_biceps•57m ago•1 comments

Verkada Deceives School That Verkada Cameras Would Not "Brick"

https://ipvm.com/reports/verkada-school-brick
3•jhonovich•59m ago•2 comments

CTX is a cognitive memory layer for AI systems

https://github.com/diegoxtr/ctx-open
2•diegoxtr•1h ago•0 comments

Show HN: FluxTest for testing network performance of self-hosted infrastructure

https://github.com/siddheshgunjal/flux-test
1•siddheshgunjal•1h ago•0 comments

NASA selects Falcon Heavy to launch ESA Mars rover mission despite budget threat

https://spacenews.com/nasa-selects-falcon-heavy-to-launch-esa-mars-rover-mission-despite-budget-t...
1•bookmtn•1h ago•0 comments

Show HN: Hyprmark – Markdown viewer for the Hyprland ecosystem

https://github.com/robinduckett/hyprmark
1•robinduckett•1h ago•1 comments

A resume builder for pets, for those who need to prove their animal isn't feral

https://petresume.co/
2•droopyKnees•1h ago•1 comments

The Khan Ted Institute

https://khanted.org/Home
3•capex•1h ago•1 comments

Ask HN: API Request for Feedback OK in Show HN?

2•casefile_dev•1h ago•3 comments

Opentargets-py – Python SDK for the Open Targets drug discovery database

https://pypi.org/project/opentargets-py/
2•goknurarican•1h ago•0 comments