frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•8mo ago

Comments

kate_at_refact•8mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

OpenAI Partners with Cerebras

https://openai.com/index/cerebras-partnership/
1•rcarmo•38s ago•0 comments

Richard Linklater's love letter to the New Wave

https://observer.co.uk/culture/film/article/richard-linklaters-love-letter-to-the-new-wave
1•tintinnabula•1m ago•0 comments

Have Taken Up Farming

https://dylan.gr/1768295794
1•djnaraps•4m ago•0 comments

Show HN: Semantic search for MTG

https://mtgbuilder.ai/search
1•Strift•4m ago•0 comments

Zhipu AI breaks US chip reliance with first major model trained on Huawei stack

https://www.scmp.com/tech/tech-war/article/3339869/zhipu-ai-breaks-us-chip-reliance-first-major-m...
1•thunderbong•9m ago•0 comments

After Every Clue

https://www.thenation.com/article/society/seymour-hersh-cover-up/
1•petethomas•9m ago•0 comments

Passenger 6.1.1

https://blog.phusion.nl/passenger-6-1-1/
1•amalinovic•10m ago•0 comments

Move Over, ChatGPT

https://www.theatlantic.com/technology/2026/01/claude-code-ai-hype/685617/
1•petethomas•11m ago•0 comments

Ask HN: Are you worried, and care, about AI stealing your code/secrets?

1•fnoef•13m ago•3 comments

Raspberry Pi AI HAT+ 2: Generative AI on Raspberry Pi 5

https://www.raspberrypi.com/news/introducing-the-raspberry-pi-ai-hat-plus-2-generative-ai-on-rasp...
1•schappim•16m ago•0 comments

A letter to those who fired tech writers because of AI

https://passo.uno/letter-those-who-fired-tech-writers-ai/
1•theletterf•18m ago•0 comments

AI tools boost individual scientists but could limit research as a whole

https://www.nature.com/articles/d41586-025-04092-3
2•oliverulerich•23m ago•0 comments

Bags and the Creator Economy

https://steve-yegge.medium.com/bags-and-the-creator-economy-249b924a621a
1•casparvitch•26m ago•0 comments

Increasing the performance of WebAssembly Text Format parser by 350%

https://blog.gplane.win/posts/improve-wat-parser-perf.html
1•gplane•27m ago•0 comments

Built a CLI package to create and maintain project structures

https://pypi.org/project/seed-cli/1.0.0/
1•hunterx•28m ago•0 comments

Voidlink – A Stealthy, Cloud-Native Linux Malware Framework

https://research.checkpoint.com/2026/voidlink-the-cloud-native-malware-framework/
1•fork-bomber•31m ago•0 comments

KR Customizer Shopify Product Configurator and Live Preview

1•krcweb•33m ago•0 comments

Apple, Google face pressure to remove X and Grok from their app stores

https://vechron.com/2026/01/apple-google-face-pressure-to-remove-x-and-grok-from-their-app-stores/
3•GeorgeWoff25•36m ago•2 comments

Best Business Plan Software for Startup and Entrepreneur?

1•selmas58•36m ago•0 comments

Show HN: I Indexed 4000 Agent Skills for Claude and OpenAI

https://agentskills.guide
1•superhuang•37m ago•1 comments

Use of Bayesian Methodology in Clinical Trials of Drug and Biological Products

https://www.fda.gov/media/190505/download
1•brendanashworth•40m ago•0 comments

Cyber+ – A versatile programming language for cybersecurity and automation

1•Czax225•41m ago•0 comments

Are You Dead?: The viral Chinese app for young people living alone

https://www.bbc.com/news/articles/c3381r5nnn6o
2•potatowaffle•42m ago•0 comments

DataRiver – Bank statement parsing using a private AI model

https://www.datariver.co
1•sandra_vu•42m ago•1 comments

Would you listen to my playlist while you work today?

https://suno.com/@runmutlu
1•spotlayn•50m ago•0 comments

Show HN: I built a satellite forensic engine to detect fraud in Carbon Markets

1•kccanarch•53m ago•0 comments

China's Z.ai claims it trained a model using only Huawei hardware

https://www.theregister.com/2026/01/15/zhipu_glm_image_huawei_hardware/
4•50kIters•54m ago•0 comments

Show HN: Matriq – Search inside video files using natural language

https://www.matriq.video/
2•Daviduche03•55m ago•0 comments

MailPilot.Chat - Email for AI Agents

1•keepamovin•57m ago•0 comments

France fines telcos €42M for sub-par security prior to 24M customer breach

https://www.theregister.com/2026/01/14/france_fines_free_free_mobile/
1•pjmlp•1h ago•0 comments