frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•6mo ago

Comments

kate_at_refact•6mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Matter 1.5 Officially Adds Support for Smart Cameras and Energy Management

https://www.iclarified.com/99104/matter-15-officially-adds-support-for-smart-cameras-and-energy-m...
1•Brajeshwar•7m ago•0 comments

AWS ECS and EKS now have remote MCP servers

https://aws.amazon.com/about-aws/whats-new/2025/11/amazon-eks-ecs-fully-managed-mcp-servers-preview/
1•stellastah•8m ago•0 comments

Cincinnati Subway

https://en.wikipedia.org/wiki/Cincinnati_Subway
1•keiferski•15m ago•0 comments

International Crypto Association elections botched by loss of key

https://iacr.org/news/item/27138
3•tomgag•21m ago•0 comments

The risk of round numbers and sharp thresholds in clinical practice

https://www.nature.com/articles/s41746-025-02079-y
1•asplake•32m ago•0 comments

Show HN: LexiForge – Auto-generate vocabulary flashcards from Kindle lookups

https://medium.com/@mr.thantsintoe/i-kept-forgetting-every-word-i-looked-up-on-my-kindle-so-i-bui...
1•thantsintoe•33m ago•0 comments

soul16 – Vibecoding native iOS and Android Apps

https://www.soul16.com
1•rendernos•39m ago•0 comments

Chromium reconsiders JPEG-XL implementation

https://groups.google.com/a/chromium.org/g/blink-dev/c/WjCKcBw219k/m/NmOyvMCCBAAJ
3•OuterVale•39m ago•0 comments

Ask HN: What recent thing you've been tasked improved your skills significantly?

1•setnone•43m ago•0 comments

I built TestCrew to solve the Android 12-tester problem

https://play.google.com/store/apps/details?id=com.testcrew&hl=en_US
1•akira-freeweb•43m ago•1 comments

UN human rights expert urges US to lift sanctions on Cuba

https://www.dw.com/en/un-human-rights-expert-urges-us-to-lift-sanctions-on-cuba/a-74845654
2•rguiscard•43m ago•0 comments

Preserving Historical Cryptography with Modern Python

https://github.com/denismaggior8/enigma-python
1•denismaggior8•44m ago•0 comments

hfsearch: a fast cli tool to discover models and datasets on HuggingFace

https://github.com/HenokB/hfsearch
1•henok_ademtew•49m ago•1 comments

Cloudflare error page of every HTTP status code (reload to show random page)

https://cloudflare-error-page-3th.pages.dev/
1•Donlon•52m ago•0 comments

Show HN: PokeSuite – Pokémon TCG pack simulator and competitive team builder

https://www.pokesuite.com
1•Fsen•52m ago•1 comments

Serflings is a remake of The Settlers 1

https://www.simpleguide.net/serflings.xhtml
1•doener•57m ago•0 comments

When AI Goes Wrong

https://whenaifail.com/category/ai-coding/
1•daco•58m ago•0 comments

Show HN: Minimal Start/Stop app for tracking billable hours (Tauri)

https://github.com/HustleCoding/time-tracker
1•FlorinDobinciuc•58m ago•0 comments

Self-destructing thumb drive can brick itself and wipe your secret files away

https://www.theregister.com/2025/11/21/selfdestructing_external_ssd/
2•beardyw•1h ago•1 comments

Open Source Village

https://opensourcevillage.org/
2•me_bx•1h ago•0 comments

Australia's High Court Chief Justice says judges have become "human filters"

https://www.theguardian.com/law/2025/nov/21/judges-have-become-human-filters-as-ai-in-australian-...
2•ubutler•1h ago•0 comments

Inside Korea's Extreme Labor System [video]

https://www.youtube.com/watch?v=pjjhrwVYPE8
1•rendall•1h ago•0 comments

Structural Inducements for Hallucination in Large Language Models

https://zenodo.org/records/17655375
1•taubek•1h ago•0 comments

Enshittification of Arduino Begins? Qualcomm Starts Clamping Down

https://itsfoss.com/news/enshittification-of-arduino-begins/
5•Teknoman117•1h ago•1 comments

Can you take an ox to Oxford?

https://alexwlchan.net/2025/ox-in-oxford/
1•surprisetalk•1h ago•0 comments

All warfare is information warfare

https://shakeddown.substack.com/p/all-warfare-is-information-warfare
2•surprisetalk•1h ago•0 comments

The 45-year period when America got things done [video]

https://www.youtube.com/watch?v=r9xdvOATny0
1•surprisetalk•1h ago•0 comments

Claude for PHP Developers

https://codewithphp.com/series/claude-php-developers/
1•dalemhurley•1h ago•1 comments

Callspark now let's you call any US number for $0.02 per minute

http://x.com/callsparkapp
1•ahmaliic•1h ago•0 comments

Navy Salvage Ship Trying to Fish Super Hornet and Seahawk Out of South China Sea

https://www.twz.com/sea/navy-salvage-ship-trying-to-fish-crashed-super-hornet-and-seahawk-out-of-...
3•breve•1h ago•0 comments