frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•8mo ago

Comments

kate_at_refact•8mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Drug Laws Have Prevented Scientists from Studying Mushrooms

https://thesporereport.com/?p=606
1•speckx•40s ago•0 comments

Bitwarden launches enhanced premium plan

https://bitwarden.com/blog/bitwarden-launches-enhanced-premium-plan/
1•brycewray•1m ago•0 comments

Devin Review: AI to Stop Slop

https://cognition.ai/blog/devin-review#the-birth-and-stagnation-of-code-review
1•swyx•1m ago•0 comments

Microsoft CEO warns AI must 'do something useful' or lose 'social permission'

https://www.pcgamer.com/software/ai/microsoft-ceo-warns-that-we-must-do-something-useful-with-ai-...
1•akyuu•2m ago•0 comments

DOGE Employees Shared Social Security Data, Court Filing Shows

https://www.nytimes.com/2026/01/20/us/politics/doge-employees-social-security-data.html
3•pseudolus•2m ago•1 comments

Verizon carriers start switching to 365day device unlock policy, up from 60 days

https://9to5google.com/2026/01/20/verizon-device-unlock-policy-365-day/
1•thunderbong•3m ago•0 comments

More diversity means better science, says Nature journal chief

https://www.thetimes.com/uk/science/article/dei-diversity-better-science-nature-journal-boss-tgb7...
1•binning•4m ago•0 comments

How the NHS became the battleground in the trans debate facing workplaces

https://www.bbc.co.uk/news/articles/c7v0l25mr2ro
1•binning•7m ago•0 comments

Power, Consumption and Gender: An analysis of Barbara Kruger's political art

https://feminisminindia.com/2026/01/14/power-consumption-and-gender-an-analysis-of-barbara-kruger...
1•binning•8m ago•0 comments

Every big lab is putting resources in building world models

https://ankitmaloo.com/world-models/
1•ankit219•9m ago•0 comments

Show HN: Remember Me – O(1) Client-Side Memory (40x cheaper than Vector DBs)

https://github.com/merchantmoh-debug/Remember-Me-AI
1•MohskiBroskiAI•9m ago•0 comments

Manipulating blood CO₂ levels may help clear toxic proteins from the brain

https://medicalxpress.com/news/2026-01-blood-co8322-toxic-proteins-brain.html
1•bikenaga•9m ago•0 comments

480k-Year-Old Elephant Bone Tool Is the Oldest Ever Found Outside Africa

https://www.iflscience.com/this-480000-year-old-elephant-bone-tool-is-the-oldest-ever-found-outsi...
1•geox•11m ago•0 comments

How are you automating your coding work?

4•manthangupta109•12m ago•1 comments

Tracking Kernel Development with Korgalore

https://people.kernel.org/monsieuricon/tracking-kernel-development-with-korgalore
1•atomlib•13m ago•0 comments

Data Modeling: Living notes on levels, techniques, and patterns

https://www.ssp.sh/brain/data-modeling/
2•articsputnik•13m ago•0 comments

Doctors declare effects of child phone use a public health emergency

https://www.thetimes.com/uk/politics/article/phone-impact-on-children-is-public-health-emergency-...
1•chrisjj•14m ago•1 comments

Show HN: Snapbyte – personalized email digests from HN/Reddit/Lobsters

https://snapbyte.dev
3•onatm•16m ago•0 comments

Disrupted brain balance in alcohol dependence involves two signaling pathways

https://medicalxpress.com/news/2025-12-disrupted-brain-alcohol-involves-pathways.html
1•PaulHoule•16m ago•0 comments

Show HN: I vibecoded a Test Management app for Jira

https://marketplace.atlassian.com/apps/695702622/bestest-requirement-test-management
1•pakosteve•17m ago•0 comments

Setting Up a Cluster of Tiny PCs for Parallel Computing

https://www.kenkoonwong.com/blog/parallel-computing/
3•speckx•18m ago•0 comments

Getting Cited as a Source on Wikipedia

https://www.coryd.dev/posts/2026/getting-cited-as-a-source-on-wikipedia
1•cdrnsf•18m ago•0 comments

Ask HN: Is OBD-II telematics data more private than mobile app tracking?

1•insuranceguru•18m ago•0 comments

Show HN: JitAPI – An MCP server that treats OpenAPI specs as dependency graphs

https://github.com/nk3750/jitapi
1•peaknk•19m ago•0 comments

Element Pro Web Introduces Grid View

https://element.io/blog/element-pro-web-introduces-grid-view/
1•Arcuru•20m ago•0 comments

If you're struggling to get your engineers to adopt AI, read this

https://www.geocod.io/code-and-coordinates/2026-01-21-hand-chiseling-code/
1•mijustin•20m ago•1 comments

The Treachery of Signs Semiotic Mediation

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5987495
2•spacebacon•21m ago•0 comments

How to track your AI Search visibility

https://www.scriptbee.ai/
1•Rayn_11•22m ago•0 comments

How much glycogen is stored in a runner's liver?

https://runningwritings.com/2023/10/how-much-glycogen-is-stored-in-a-runners-liver.html
2•galeaspablo•24m ago•0 comments

Show HN: DockerHoster – Self-hosted alternative to Vercel with auto-deployments

https://twitter.com/jaequery/status/2014049948195815585
1•jaequery•24m ago•0 comments