frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•11mo ago

Comments

kate_at_refact•11mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

1•tamukachagos•38s ago

Corridor Crew Is Changing Filmmaking Forever [video]

https://www.youtube.com/watch?v=D_E_x8OJbrg
1•CHB0403085482•1m ago•0 comments

OpenCloud: Open-source alternative to Google Drive

https://opencloud.eu/en
1•maxloh•4m ago•0 comments

Cloud Codex – self-hosted real-time collaborative docs platform

https://github.com/Cloud-City-Computing/c2
1•xtelos•4m ago•1 comments

Anthropic blocks cli calls mentioning OpenClaw

https://twitter.com/steipete/status/2040811558427648357
2•nstj•8m ago•2 comments

LLMs can't justify their answers–this CLI forces them to

https://wheat.grainulation.com/
1•volatilityfund•16m ago•0 comments

Show HN: LLMs' Favorite Colors

https://davidstolarsky.net/llms-favourite-colors
1•gimlids•19m ago•0 comments

Exploring NPM's Dependency Blast Radius: Visualization of the Top 1K

https://realarcherl.github.io/opensecure/
2•ArcherL•22m ago•0 comments

AI Is a Threat to Everything the American People Hold Dear – Bernie Sanders OpEd

https://www.wsj.com/opinion/ai-is-a-threat-to-everything-the-american-people-hold-dear-a3286459
2•lando2319•27m ago•2 comments

Silicon Valley's Billion Dollar Design Scams by Design Theory [video]

https://www.youtube.com/watch?v=hDvAQf1cnr8
1•CHB0403085482•32m ago•0 comments

Ignore AI FOMO – For Now

https://www.bloomberg.com/news/newsletters/2026-04-05/what-is-vibe-coding-the-ai-trend-fueling-a-...
1•gurjeet•33m ago•0 comments

Apex Protocol – An open MCP-based standard for AI agent trading

https://apexstandard.org/
2•andmerm•34m ago•0 comments

Show HN: Skilleton – Minimal tool for managing versioned SKILL.md definitions

https://github.com/Fcmam5/skilleton
1•fcmam5•35m ago•0 comments

Linear Time O(n) 1T Token JSONs

https://zenodo.org/records/19431765
1•GeometryKernel•36m ago•0 comments

The bananas quest to reboot tech's dating scene

https://www.businessinsider.com/san-francisco-dating-apps-ai-matchmakers-technology-2026-4
1•jameslk•46m ago•0 comments

AGI asking for new data in public demo interface

https://oroboroslabs-ai.github.io/liber-fontis/agi-demo.html
1•oroboroslabs•47m ago•1 comments

Show HN: hot or not for .ai websites

https://ratemyaisite.com/
1•prolly97•48m ago•0 comments

'Beyond what we could imagine': Europe's coming energy crunch

https://www.politico.eu/article/how-bad-will-europes-energy-crisis-get/
1•breve•50m ago•0 comments

Make Humans Analog Again

https://bhave.sh/make-humans-analog-again/
1•muunbo•53m ago•0 comments

Employers use your personal data to figure out the lowest salary you'll accept

https://www.marketwatch.com/story/employers-are-using-your-personal-data-to-figure-out-the-lowest...
23•thisislife2•55m ago•4 comments

Simple Methods for Compression of Vectors

https://corvi.careers/blog/vector-search-embedding-compression/
1•sp1982•56m ago•0 comments

Washington state will require labels on AI images and set limits on chatbots

https://www.axios.com/local/seattle/2026/04/03/washington-ai-disclosure-law-images-video-watermar...
1•anigbrowl•56m ago•0 comments

Can we ever trust AI to watch over itself?

https://www.transformernews.ai/p/ai-alignment-researchers-want-to-superintelligence
1•gmays•1h ago•0 comments

Show HN: I scraped thousands of recent interview questions for SWE internships

https://www.nointernship.com/questions
1•oleksg•1h ago•0 comments

Show HN: I built a tiny LLM to demystify how language models work

https://github.com/arman-bd/guppylm
5•armanified•1h ago•0 comments

The human amygdala in threat learning and extinction

https://www.science.org/doi/10.1126/sciadv.aea8233
1•PaulHoule•1h ago•0 comments

AI models will scheme to protect other AI models from being shut down

https://tech.yahoo.com/ai/meta-ai/articles/ai-models-secretly-scheme-protect-162555909.html
1•gmays•1h ago•0 comments

Show HN: YouTube search barely works, I made a search form with advanced filters

https://playlists.at/youtube/search/
16•nevernothing•1h ago•6 comments

Aviation company employee attempted to take proprietary information to China

https://www.justice.gov/usao-ks/pr/aviation-company-employee-attempted-take-proprietary-informati...
4•737min•1h ago•1 comments

The 40 minutes when the Artemis crew loses contact with the Earth

https://www.bbc.com/news/articles/cj0vyzmmy50o
1•geox•1h ago•1 comments