frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•11mo ago

Comments

kate_at_refact•11mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Century-bandwidth antenna reinvented,patented after 18 yrs with decade bandwidth

https://ieeexplore.ieee.org/document/1715264
1•teleforce•4m ago•0 comments

Show HN: Species.app – A visual spaced-repetition engine for taxonomy

1•jchiasson•5m ago•0 comments

The Rise of AI Pentesting Agents: A Technical Analysis (2026)

https://appsecsanta.com/research/ai-pentesting-agents-2026
1•appsecsanta•5m ago•0 comments

Show HN: An offline-first type-safe graph database in a CRDT

https://codemix.com/graph
1•phpnode•7m ago•0 comments

Show HN: MFlow – Jira delivery analytics for small engineering teams

https://www.no-pm.com/
1•patrick193•11m ago•0 comments

Job titles of the future: Wildlife first responder

https://www.technologyreview.com/2026/04/13/1135156/job-titles-wildlife-first-responder-wesley-sa...
1•joozio•11m ago•0 comments

The state of bug bounty in 2026

https://aituglo.com/state-of-bug-bounty-in-2026/
1•aituglo•14m ago•1 comments

XBPP – Open standard for governing AI agent payments (Apache 2.0)

https://github.com/VanarChain/xbpp-sdk
1•vanardev•14m ago•0 comments

Point Cloud Allemansrätten

https://digitalflapjack.com/weeknotes/point-cloud-allemansr%C3%A4tten/
2•ColinWright•18m ago•0 comments

Ask HN: Shouldn't we increase flagging threshold?

1•alkyon•18m ago•0 comments

Open source 1040 tax software built by AI agents

https://github.com/filedcom/opentax
1•atulanand94•19m ago•0 comments

RepoClip

https://repoclip.io
1•bellamoon544•22m ago•0 comments

The Star Chamber: Why Multi-LLM Consensus Is Now a Necessity for Code Quality

https://blog.mozilla.ai/the-star-chamber-multi-llm-consensus-for-code-quality/
1•dev_tools_lab•22m ago•0 comments

An open letter to the UK Government on digital privacy

https://www.jimmyff.co.uk/blog/open-letter-uk-digital-privacy/
2•jimmyff•26m ago•0 comments

Deadtrees.earth – Call for Drone Contributions

https://deadtrees.earth
2•raptor111•28m ago•1 comments

Beyond Karpathy's LLM-Wiki: The Necessity of Cognitive Governance

https://www.jonadas.com/writing/essays/beyond-karpathys-llm-wiki
3•jonadas•28m ago•1 comments

Show HN: Rocky-Project Hail Mary agent skill that cut output tokens ~47%

https://github.com/hpbyte/rocky
1•hpbyte•32m ago•0 comments

State of API Security 2026: An AI-Native Testing Perspective

https://reports.kusho.ai/state-of-api-security-2026
3•AkshatVirmani•32m ago•1 comments

How do you validate your GTM Efforts?

1•pranaywankhede•33m ago•0 comments

Minimal Life by Computer

https://www.nature.com/articles/s41587-026-03110-7
1•XzetaU8•38m ago•0 comments

Rented intelligence: AI's mainframe moment

https://www.mjeggleton.com/blog/AIs-mainframe-moment
1•michaelje•39m ago•0 comments

Remembering Piotr "Chastell" Szotkowski

https://pragtob.wordpress.com/2026/04/12/remembering-piotr-chastell-szotkowski/
1•nathell•41m ago•0 comments

How can you build your own SoC with HOOKPROBE; a democratic approach to security

https://github.com/hookprobe/hookprobe
2•hookprobe•42m ago•1 comments

Digital sovereignty isn't just a buzzword – it's the future

https://www.theregister.com/2026/04/13/digital_sovereignty/
1•beardyw•46m ago•1 comments

Can AI be a 'child of God'? Inside Anthropic's meeting with Christian leaders

https://www.msn.com/en-us/news/us/can-ai-be-a-child-of-god-inside-anthropic-s-meeting-with-christ...
2•benkan•46m ago•1 comments

Did Tom Steyer Buy His Own Prediction Market? The Data Says Maybe

https://simplefunctions.dev/opinions/steyer-prediction-market-self-promotion
1•patrickliu0077•47m ago•1 comments

Y Combinator lets you cross the line [video]

https://www.youtube.com/watch?v=ptT_LGfT69k
1•waihtis•48m ago•0 comments

Sadly, the End of Star Trek Is Now Official

https://screenrant.com/star-trek-strange-new-worlds-starfleet-academy-sets-destroyed/
1•benkan•48m ago•0 comments

Ask HN: Is Codex really on Par with Claude Code?

1•shivang2607•49m ago•0 comments

Booking.com warns customers of possible data and security breach

https://www.abc.net.au/news/2026-04-13/booking-com-data-security-breach-personal-details/106557630
4•volongoto•50m ago•0 comments