frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•10mo ago

Comments

kate_at_refact•10mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Show HN: Tool to test brand presence across major LLMs

https://usefox.ai/tools/ai-audit
1•loose_booze•46s ago•0 comments

AI could end online anonymity

https://techxplore.com/news/2026-03-ai-online-anonymity.html
1•daoboy•2m ago•0 comments

Show HN: I made a to-do list app where users use LLMs to rewrite their front end

https://malleabletodo.app
1•maxharrison•2m ago•0 comments

Field notes from the circus of corporate AI adoption

https://mildlyverbose.mataroa.blog/blog/come-one-come-all-to-the-spectacular-corporate-ai-circus/
1•vorpalcoil•6m ago•0 comments

Traffic to top tech publications has plummeted since 2024, new analysis shows

https://www.niemanlab.org/2026/03/traffic-to-top-tech-publications-has-plummeted-since-2024-new-a...
1•giuliomagnifico•8m ago•0 comments

Will Claude Code Consume Legaltech?

https://lexifina.com/blog/agentic-ai-vs-legaltech
1•alansaber•10m ago•0 comments

Show HN: Building a WebSocket Chat App with C#, Redis Pub/Sub and .NET

https://github.com/sanzor/Ctesiphon
1•adrian_berco•10m ago•0 comments

Show HN: I built an AI desktop Waifu that remembers you

https://github.com/buyve/OpenMaiWaifu
1•openmaiwaifu•11m ago•0 comments

Coding agent rewrites (and improves) LGPL library and releases under MIT license

https://github.com/chardet/chardet/releases/tag/7.0.0
2•nemoniac•12m ago•0 comments

Package Manager Magic Files

https://nesbitt.io/2026/03/05/package-manager-magic-files.html
1•chmaynard•13m ago•0 comments

I Have an Archivist in My AI Coding Agents Crew

https://www.metateam.ai/blog/how-archivist-works
1•falsename•15m ago•0 comments

25mm Particle Board: Superior Rigidity for Your DHS Double Wardrobe

https://dreamhomestore.co.uk/collections/wardrobes
1•garryclarke1•15m ago•1 comments

US Military reportedly used Claude in Iran strikes despite Trump's ban

https://www.theguardian.com/technology/2026/mar/01/claude-anthropic-iran-strikes-us-military
2•_____k•17m ago•0 comments

The Management Myth [pdf]

http://pareto.uab.es/fsancho/The%20Management%20Myth.pdf
1•harperlee•18m ago•0 comments

Scientists study comet 3I/ATLAS to understand material from other star systems

https://comuniq.xyz/post?t=838
1•01-_-•19m ago•0 comments

Teaching LLMs to reason like Bayesians

https://research.google/blog/teaching-llms-to-reason-like-bayesians/
1•sebg•19m ago•0 comments

Styx Document Language

https://styx.bearcove.eu/
1•todsacerdoti•26m ago•0 comments

A Call for Meaningful Work at a Slower Pace

https://jenteottenburghs.wordpress.com/2025/11/18/a-call-for-meaningful-work-at-a-slower-pace/
3•carschno•26m ago•1 comments

Show HN: Anaya – CLI that scans codebases for DPDP compliance violations

https://github.com/sandip-pathe/anaya-scan
3•sandippathe•26m ago•1 comments

Show HN: Parsewise – Cursor for Business Documents

https://www.parsewise.ai/platform
2•maxhofer•26m ago•0 comments

Show HN: Chartle – Describe a chart in plain English and it creates it

https://www.chartle.app/
1•moorst•27m ago•1 comments

Show HN: A framework for building nexuses of agents

https://github.com/NetMindAI-Open/NexusAgent
1•Demi369•27m ago•0 comments

Fresh claim of making elusive 'hexagonal' diamond is the strongest yet

https://www.nature.com/articles/d41586-026-00711-9
1•Brajeshwar•31m ago•0 comments

AI can write genomes – how long until it creates synthetic life?

https://www.nature.com/articles/d41586-026-00681-y
1•Brajeshwar•32m ago•0 comments

Ex-Google PM Builds God's Eye to Monitor Iran in 4D [Text]

https://www.spatialintelligence.ai/p/the-intelligence-monopoly-is-over
2•fragmede•32m ago•1 comments

Show HN: Mnemora – Serverless memory DB for AI agents (no LLM in your CRUD path)

https://github.com/mnemora-db/mnemora
2•isaacgbc•35m ago•1 comments

Show HN: Slay the Spire 2 Wiki (database and card maker tool)

https://slaythespire2.gg
1•WanderZil•36m ago•1 comments

Top K is a deceptively hard problem in relational databases

https://www.paradedb.com/blog/optimizing-top-k
1•birdculture•36m ago•0 comments

Frak – a simple code deployment utility

https://github.com/frakjs/frak
1•strube•40m ago•0 comments

Ex-Google PM Builds God's Eye to Monitor Iran in 4D [video]

https://www.youtube.com/watch?v=0p8o7AeHDzg
2•KellyCriterion•41m ago•1 comments