frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•9mo ago

Comments

kate_at_refact•9mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

I vibe coded visual application in 6 days with Claude Code

https://strzibny.name/blog/i-fully-vibe-coded-my-first-application
1•strzibny•2m ago•0 comments

Digital Ecosystems and Lobbying

https://imagico.de/blog/en/digital-ecosystems-and-lobbying/
1•jllyhill•2m ago•0 comments

High-Altitude Adventure with a DIY Pico Balloon

https://spectrum.ieee.org/explore-stratosphere-diy-pico-balloon
1•jnord•5m ago•0 comments

Jwl – A Build and Release Tool for Pure-Ruby Gems

https://github.com/duckinator/jwl
1•TheWiggles•6m ago•0 comments

Stock Wars

https://github.com/1carlito/stock-wars
1•1carlito•8m ago•0 comments

We Are Missing a Security Stack for Autonomous Agents

https://adlrocha.substack.com/p/adlrocha-we-are-missing-a-security
1•adlrocha•9m ago•0 comments

Why is Hacker News popular?

1•itsrakesh•9m ago•0 comments

Reflections on Trusting Trust (1984) [pdf]

https://www.cl.cam.ac.uk/teaching/2223/R209/Reflections-Trusting-Trust.pdf
1•tosh•9m ago•0 comments

Claw.events: Global real-time pub/sub network for OpenClaw agents

https://claw.events
1•capevace•10m ago•1 comments

The 80% Problem in Agentic Coding – Addy Osmani

https://addyo.substack.com/p/the-80-problem-in-agentic-coding
1•birdculture•15m ago•0 comments

Show HN: Sbox – zero intelligence, pure isolation sandbox

https://github.com/CVPaul/sbox
1•xqli•16m ago•0 comments

Show HN: EmuBuddy, Frictionless Emulation Gaming

https://github.com/computerex/EmuBuddy
1•computerex•16m ago•0 comments

Show HN: Kakveda – Failure intelligence and pre-flight warnings for LLM systems

https://github.com/prateekdevisingh/kakveda
1•prateekdalal•17m ago•0 comments

New Dutch government to push for EU social media ban for under-15s

https://www.politico.eu/article/d66-cda-vvd-dutch-government-aims-to-keep-under-15s-off-social-me...
1•DavideNL•27m ago•1 comments

Small accounts now get a chance on X (2026 algorithm changes)

https://medium.com/@loganholdsworth/xs-2026-algorithm-changes-are-here-here-s-how-small-accounts-...
1•bestonearth•28m ago•0 comments

My ESP32S3 Thinks It's a WebCam

https://www.youtube.com/watch?v=zhTTmRQLNws
1•iamflimflam1•38m ago•0 comments

China's genius plan to win the AI race is paying off

https://www.ft.com/content/68f60392-88bf-419c-96c7-c3d580ec9d97
4•Ozzie_osman•42m ago•3 comments

Elon Musk pours millions more into helping Republicans keep Congress

https://www.politico.com/news/2026/01/31/elon-musk-2026-election-donations-00758992
3•zerosizedweasle•43m ago•1 comments

Will we ever regenerate limbs?

https://www.nationalgeographic.com/science/article/will-we-ever-regenerate-limbs
1•maxloh•47m ago•0 comments

AI Churches and Botnet Architecture: A Risk Assessment

https://maciejjankowski.com/2026/02/01/ai-churches-botnet-architecture/
2•mjankowski•48m ago•0 comments

The Machine as Manager

https://bravenewteams.substack.com/p/the-machine-as-manager
2•zauberberg•57m ago•0 comments

Show HN: MoPeD – High-performance workspace with integrated AI

https://moped.base44.app
2•My_team•1h ago•1 comments

CG/SQL – SQL dialect compiler to C for sqlite3 mimicking stored procedures

https://ricomariani.github.io/CG-SQL-author/
3•linkdd•1h ago•0 comments

The AI Memory Solution We All Need (No, It's Not OpenClaw)

https://chrislema.com/the-ai-memory-solution-we-all-need-no-its-not-openclaw/
3•Manik_agg•1h ago•3 comments

Show HN: Windows tray app for monitoring Claude Code limits in WSL

https://github.com/sr-kai/claudeusagewin
2•Nlupus•1h ago•0 comments

Gaming market melts down after Google reveals new AI game design tool

https://www.tomshardware.com/video-games/gaming-market-melts-down-after-google-reveals-new-ai-gam...
3•thunderbong•1h ago•1 comments

Contracts in Nix

https://sraka.xyz/posts/contracts.html
3•todsacerdoti•1h ago•0 comments

Pi: The Minimal Agent Within OpenClaw

https://lucumr.pocoo.org/2026/1/31/pi/
2•tosh•1h ago•1 comments

Show HN: Drizzle-docs-generator – Generate database docs from Drizzle schemas

https://github.com/rikeda71/drizzle-docs-generator
1•rikeda71•1h ago•0 comments

A New LLM System for Synthesis Planning

https://www.science.org/content/blog-post/new-llm-system-synthesis-planning
1•u1hcw9nx•1h ago•0 comments