frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•11mo ago

Comments

kate_at_refact•11mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Live Artemis II position tracker

https://issinfo.net/artemis
1•qingcharles•2m ago•1 comments

Quantum computers need fewer resources than thought to break vital encryption

https://arstechnica.com/security/2026/03/new-quantum-computing-advances-heighten-threat-to-ellipt...
2•rickcarlino•9m ago•0 comments

China's "pig semen eyedrop" could help deliver Alzheimer's treatment

https://www.scmp.com/news/china/science/article/3348726/chinas-brain-penetrating-pig-semen-eyedro...
4•nikolay•13m ago•1 comments

Remember Their Names

https://visualizingpalestine.org/visual/end-30-billion-of-us-military-aid-to-israel-green-jobs/
1•euler2100•13m ago•0 comments

Web server ratelimits are a precaution to let me stop worrying

https://utcc.utoronto.ca/~cks/space/blog/web/RatelimitsAreAPrecaution
1•LorenDB•14m ago•0 comments

After Fighting Malware for Decades, Cybersecurity Vet Now Hacking Drones

https://techcrunch.com/2026/04/04/after-fighting-malware-for-decades-this-cybersecurity-veteran-i...
1•yesensm•14m ago•0 comments

How Pope Leo is pushing back on divine justification of war

https://www.cnn.com/2026/04/04/middleeast/pope-leo-iran-war-analysis-latam-intl
1•1659447091•15m ago•0 comments

AGI Is Here

https://breaking-changes.blog/agi-is-here/
5•oakhan3•16m ago•3 comments

Show HN: Yoink functionality from dependencies and avoid supply chain attacks

https://github.com/theogbrand/yoink
2•kstonekuan•18m ago•0 comments

Half of social-science studies fail replication test in years-long project

https://www.nature.com/articles/d41586-026-00955-5
1•prabal97•19m ago•0 comments

The Rise of Worse Is Better

https://dreamsongs.com/RiseOfWorseIsBetter.html
1•kaladin-jasnah•34m ago•0 comments

Show HN: mailtrim – find what's actually filling your Gmail inbox

5•chevuru•38m ago•3 comments

Explore union types in C# 15

https://devblogs.microsoft.com/dotnet/csharp-15-union-types/
4•0x00C0FFEE•38m ago•0 comments

AI Whiz Kids Dropped Out of College and Got Investors to Pay Their Bills

https://www.wsj.com/tech/ai/ai-college-dropouts-ecc665b7
2•lxm•40m ago•0 comments

Mlx-VLM: Fast Local VLMs and Omni Models on Apple Silicon with MLX

https://github.com/Blaizzy/mlx-vlm
2•salkahfi•43m ago•0 comments

Towards end-to-end automation of AI research

https://www.nature.com/articles/s41586-026-10265-5
3•hardmaru•43m ago•0 comments

The Perils of Privatized Cyberwarfare

https://www.lawfaremedia.org/article/the-perils-of-privatized-cyberwarfare
2•gnabgib•43m ago•0 comments

Show HN: Simple Local Meme Generator

https://github.com/KyleTryon/Gemini-Meme-Generator
2•TechSquidTV•48m ago•0 comments

Nppexec

https://github.com/d0vgan/nppexec
2•downboots•49m ago•0 comments

Computer for Taxes

https://www.perplexity.ai/hub/blog/introducing-computer-for-taxes
2•wslh•50m ago•0 comments

Conductor – Durable Execution Engine

https://conductor-oss.github.io/conductor/index.html
3•opiniateddev•51m ago•0 comments

Notes for US Performers in Montreal

https://evanp.me/2026/04/04/notes-for-us-performers-in-montreal/
2•decimalenough•51m ago•0 comments

Meta-Harness: End-to-End Optimization of Model Harnesses

https://arxiv.org/abs/2603.28052
2•kstonekuan•56m ago•0 comments

Karpathy's knowledge base matches our Grep-is-All-You-Need paper

https://www.localkin.dev/papers/grep-is-all-you-need
3•localkin•1h ago•0 comments

We Made Technology Easy to Use. That Was a Mistake

https://slate.com/technology/2026/04/usability-complexity-apple-iphone-facebook-donald-norman.html
2•kawera•1h ago•0 comments

The Awake "Sleep" Loop: Why Attention Lapses Occur in ADHD

https://neurosciencenews.com/adhd-attention-sleep-activity-30324/
3•ivewonyoung•1h ago•0 comments

Show HN: Signals – finding the most informative agent traces without LLM judges

https://arxiv.org/abs/2604.00356
3•sparacha•1h ago•0 comments

I built a $19.99 flat-fee EU261 flight compensation letter generator

https://www.sovereign-suite.com/
2•oneprofiledev•1h ago•0 comments

The Crazy Nastyass Honey Badger (2011)

https://www.youtube.com/watch?v=4r7wHMg5Yjg
2•kaycebasques•1h ago•0 comments

Beat Cancer Off

https://beatcanceroff.com/
3•cebert•1h ago•0 comments