frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•8mo ago

Comments

kate_at_refact•8mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Myth of the Monolithic ERP: Why They Keep Failing [video]

https://www.youtube.com/watch?v=o6d94HNGV1s
1•rossdavidh•3m ago•0 comments

An A.I. Startup Says It Wants to Empower Workers, Not Replace Them

https://www.nytimes.com/2026/01/20/technology/humans-ai-anthropic-xai.html
1•bookofjoe•6m ago•1 comments

Testosterone went from prostate cancer villain to potential ally

https://theconversation.com/how-testosterone-went-from-prostate-cancer-villain-to-potential-ally-...
1•PaulHoule•7m ago•0 comments

Flashlabs releases the world’s first open-source voice cloning model

https://twitter.com/flashlabsdotai/status/2013993446047158550
1•sangwen•10m ago•0 comments

Show HN: iMessage-data-foundry – Synthetic iMessage Data Generator

https://github.com/johnlarkin1/imessage-data-foundry
2•jlarks32•12m ago•0 comments

Open4D – Open-Source 4D Geometry Processing, Compression and Streaming Library

https://github.com/SINRG-Lab/Open4D
1•hex823•14m ago•1 comments

Palantir CEO: With AI, economies won't need immigration

https://www.theregister.com/2026/01/21/palantir_ceo_karp_claims_ai/
1•abdelhousni•14m ago•0 comments

GPTZero finds 100 new hallucinations in NeurIPS 2025 accepted papers

https://gptzero.me/news/neurips/
1•dnw•14m ago•0 comments

MsgBored, Screaming into the Abyss

https://johntrager.net/projects/msg-bored/
2•jtrager•15m ago•0 comments

AI recruiters: faster, cheaper, and still clueless

https://pksunkara.com/thoughts/ai-recruiters-faster-cheaper-and-still-clueless/
1•pksunkara•15m ago•0 comments

Explore the Mandelbrot Set

https://math.hws.edu/eck/js/mandelbrot/MB.html
1•mooreds•16m ago•0 comments

Summary paper on the STAR-Vote system [pdf]

https://www.cs.rice.edu/~dwallach/pub/star-summative-2018.pdf
1•thechao•24m ago•0 comments

FCC: Late-night and daytime talk shows must offer equal time for candidates

https://www.nbcnews.com/politics/elections/fcc-late-night-daytime-talk-shows-equal-time-candidate...
1•ceejayoz•24m ago•0 comments

The divergence of centralized systems and individual agency

3•Kiplomat-SouCmp•27m ago•3 comments

Ukraine Holds Off on New Helsing Drone Orders After Setbacks

https://www.bloomberg.com/news/articles/2026-01-19/ukraine-holds-off-on-new-helsing-drone-orders-...
4•doener•28m ago•1 comments

Basic TTS – fast, free, and easy-to-use online text-to-speech tool

https://basictts.com/
2•sea-gold•28m ago•1 comments

Cyclic Subgroup Sum

https://m-slee.netlify.app/posts/cyclic-subgroup-sum
1•richard_chase•32m ago•1 comments

Being a tourist, can I film police in the US?

https://travel.stackexchange.com/questions/58566/being-a-tourist-can-i-film-police-in-the-us
2•beatthatflight•32m ago•2 comments

Show HN: HinkyPunk VPN

https://github.com/canaanmckenzie/HinkyPunk
2•prince_nez•33m ago•0 comments

List of Sales Closing Techniques

https://talnet.co/social/3KqvEuvb4p
1•bouia•35m ago•0 comments

Easy Delegation with C# Source Generators Library

https://davidvedvick.info/notes/2026/01/21/source-generators-easy-delegation-c-sharp
1•whoisthemachine•35m ago•0 comments

Tried Implementing DeepSeek's MHC

https://github.com/enochyearn/mhc-vs-resnet-mlx
1•enochyearn•36m ago•1 comments

Self-Hosting Discourse Just Got a Whole Lot Easier

https://meta.discourse.org/t/self-hosting-discourse-just-got-a-whole-lot-easier/393915
2•Curiositry•37m ago•0 comments

Cipher Copy – a fast, privacy-first text encoding tool (runs client-side)

https://cipher-06d59a47.base44.app/Encode
1•jasoncbuckley•38m ago•1 comments

Show HN: A fast, ad-free wiki for Where Winds Meet (built with Astro)

https://wwm-db.com/en/
1•causalzap•41m ago•0 comments

Bluementhals letter about ICE memo justifying entry into homes without warrant [pdf]

https://www.hsgac.senate.gov/wp-content/uploads/2026-01-21-Letter-from-Blumenthal-to-DHS-ICE.pdf
21•rawgabbit•44m ago•4 comments

Ask HN: How do you audit autonomous AI agent decisions?

1•credentum•45m ago•1 comments

AI Design Field Guide

https://www.aidesignfieldguide.com/
1•samuel246•46m ago•0 comments

Ask HN: Anyone seeing copy/paste reliability issues in ChatGPT Web on macOS?

1•sallyrideauto•48m ago•0 comments

What I Mean by "Dream Team"

https://theproductmindedqa.com/on-dream-teams/
1•sabdelrahman•49m ago•0 comments