frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•7mo ago

Comments

kate_at_refact•7mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Elon Musk becomes first person worth $700B following pay package ruling

https://www.reuters.com/business/autos-transportation/elon-musk-becomes-first-person-worth-700-bi...
1•ksec•1m ago•0 comments

Show HN: The Official National Train Map Sucked, So I Made My Own

https://www.bdzmap.com/
1•Pavlinbg•2m ago•0 comments

Concurrent JavaScript: It can work

https://webkit.org/blog/7846/concurrent-javascript-it-can-work/
1•samwillis•3m ago•0 comments

Show HN: A practical database of AI SEO strategies for founders and marketers

https://www.aiseodatabase.com/
1•mohitvaswani•4m ago•0 comments

Show HN: Free tool to blur images instantly without signup or watermarks

https://www.blurimageonline.com/
2•teroquyiqwu•6m ago•0 comments

Against Likes and Subscribers

https://metanomad.blog/against-likes-and-subscribers/
1•speckx•8m ago•0 comments

Inception X TicTacToe: a fractal game

https://tic-tac-toe-inception-49280791970.us-west1.run.app/
1•apitaru•10m ago•1 comments

An 11-qubit atom processor in silicon with all fidelities from 99.10% to 99.99%

https://www.nature.com/articles/s41586-025-09827-w
1•giuliomagnifico•10m ago•0 comments

Eye Sentry: 5-Day Built macOS Eye Care Tool

https://eye-sentry.vercel.app
1•lispking•12m ago•1 comments

Language Switcher Guide: Java, JavaScript, Python, Go Comparison for B

https://blog.blockingqueue.com/language-switcher-cheatsheet-java-javascript-python-go
1•liviu31•12m ago•0 comments

Research Reveals the Optimal Way to Optimize

https://www.wired.com/story/researchers-discover-the-optimal-way-to-optimize/
2•quapster•13m ago•0 comments

NY Gov. vetoes bill to mandate 2-person subway train crews

https://gothamist.com/news/ny-gov-hochul-vetoes-bill-to-mandate-2-person-subway-train-crews
1•geox•13m ago•0 comments

A Homebuilt CO2 Meter as a Virus Risk Proxy (Shallow Thoughts)

https://shallowsky.com/blog/hardware/co2-meter.html
1•speckx•20m ago•0 comments

Stop Saying "Agent" – Name the Work, Own the Output

https://btriani.medium.com/stop-saying-agent-name-the-work-own-the-output-a9a2902a5314
1•btriani•20m ago•1 comments

Mnemonics for Hidden Controls in Win32

https://www.abareplace.com/blog/hidden_mnemonics/
1•todsacerdoti•24m ago•0 comments

Loud Rabbit (Music Video / Lyrics)

https://www.tolmix.com/loud-rabbit
1•n-fuselius•24m ago•1 comments

Show HN: Lockify – developer-friendly CLI for managing encrypted env variables

https://github.com/ahmed-abdelgawad92/lockify
2•ahmedabdelgawad•25m ago•0 comments

Lightweight is a metric, not an adjective

https://simpleobservability.com/blog/lightweight-is-metric
1•khazit•36m ago•0 comments

12 days on Wolf Rock Lighthouse [video]

https://www.youtube.com/watch?v=PhRbJ3DQdlQ
1•bschne•38m ago•0 comments

Jingle Bells (Batman Smells): an incomplete festive folk-rhyme taxonomy

https://loreandordure.com/2025/12/16/jingle-bells/
2•helsinkiandrew•39m ago•0 comments

Yann LeCun Is Raising Half a Billion Dollars to Build Nothing (Yet)

https://medium.com/@anwarzaid76/yann-lecun-is-raising-half-a-billion-dollars-to-build-nothing-yet...
2•MindBreaker2605•41m ago•0 comments

No Fun Allowed

https://josevalerio.com/no-fun-allowed
1•josevalerio•45m ago•0 comments

Show HN: Zimage2.online – An AI image tool built on Alibaba's Z-Image model

https://zimage2.online/
1•chenliang001•47m ago•0 comments

The ML Trench

https://deep-ml-trench.vercel.app/
1•hexhowells•47m ago•0 comments

The iPhone 16e Is Good

https://manualdousuario.net/en/iphone-16e-is-good-actually/
1•rpgbr•48m ago•0 comments

AI in 2026 and beyond ⊗ Bioregionalism's tech-driven revival

https://sentiers.media/ai-in-2026-and-beyond-bioregionalisms-tech-driven-revival-no-384/
1•speckx•49m ago•0 comments

Show HN: Dots: a bullet journal I built to understand my migraines

https://dotsjournal.app/
1•tubignaaso•50m ago•0 comments

US submarines are outnumbered in the Pacific. South Korea has a plan to help

https://www.cnn.com/2025/12/20/asia/south-korea-nuclear-powered-submarines-intl-hnk-ml-dst
1•breve•51m ago•0 comments

Construct in 2025: Year in Review

https://www.construct.net/en/blogs/construct-official-blog-1/construct-2025-year-review-1898
1•AshleysBrain•55m ago•0 comments

Teardown of the Gigaset CL660HX DECT phone and how to disable annoying flash LED

https://github.com/hn/gigaset-cl660hx
2•hn___•57m ago•0 comments