frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•8mo ago

Comments

kate_at_refact•8mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Type Narrowing Patterns in Pyrefly That Make Type Checking More Intuitive

https://pyrefly.org/blog/type-narrowing/
1•ocamoss•50s ago•0 comments

Nitrate source and dementia risk: vegetables-dec. risk; water, animal foods-inc

https://medicalxpress.com/news/2026-01-nitrate-linked-dementia-vegetables.html
1•bikenaga•2m ago•0 comments

Ask HN: Will AIs Need Psychiatrists?

1•toddh•2m ago•0 comments

Prism

https://openai.com/index/introducing-prism
1•meetpateltech•3m ago•0 comments

Methods for protecting yourself against an LRAD system – Tech Ingredients (2020) [video]

https://www.youtube.com/watch?v=CXKTBQBugIA
2•goda90•3m ago•0 comments

Forever Overhead – David Foster Wallace

https://welcometotheloonybin.wordpress.com/2008/09/17/forever-overhead/
1•ofalkaed•4m ago•0 comments

MCP Apps

http://blog.modelcontextprotocol.io/posts/2026-01-26-mcp-apps/
1•sanj•6m ago•0 comments

Ask HN: How to avoid skill atrophy in LLM-assisted programming era?

2•py4•7m ago•0 comments

Pretty much 100% of our code is written by Claude Code and Opus 4.5

https://twitter.com/bcherny/status/2015979257038831967
1•sysoleg•7m ago•0 comments

Stanford scientists reveal oldest map of the night sky

https://www.kqed.org/news/12070647/stanford-scientists-reveal-oldest-map-of-the-night-sky-previou...
1•dr_dshiv•9m ago•0 comments

AI and Society: The Three Phases of Technological Adoption

https://ure.us/articles/ai-and-society-the-three-phases-of-technological-adoption/
1•sschotten•10m ago•0 comments

OpenAI Prism

https://openai.com/prism/
1•davidbarker•10m ago•0 comments

Show HN: LemonSlice – Give your voice agents a face

7•lcolucci•11m ago•0 comments

Ag-jail – Sandbox antigravity to avoid persistant/background process

https://github.com/M-Wham/ag-jail
1•mwham•12m ago•1 comments

Clawdbot is a security nightmare [video]

https://www.youtube.com/watch?v=kSno1-xOjwI
4•carlos-menezes•12m ago•0 comments

Southwest's Open-Seating Era Comes to an End

https://www.wsj.com/lifestyle/travel/my-last-dash-for-open-seats-on-southwest-90aec391
1•JumpCrisscross•13m ago•0 comments

Show HN: AnalysisXYZ – Browser-based CSV/Excel analyzer (privacy focused)

https://www.analysisxyz.dev
1•kushagarwal2907•15m ago•1 comments

Ask HN: How do you manage memory and context across Claude Code sessions?

1•nadis•16m ago•0 comments

Prep Early to Land an Overseas Job

https://relocateme.substack.com/p/how-to-prepare-for-an-overseas-job
1•andrewstetsenko•18m ago•0 comments

The Doomsday Clock is now at 85 seconds to midnight

https://thebulletin.org/doomsday-clock/
4•pbhak•18m ago•0 comments

Show HN: An open-source starter for developing with Postgres and ClickHouse

https://github.com/ClickHouse/postgres-clickhouse-stack
1•saisrirampur•19m ago•0 comments

UPS to cut additional 30,000 jobs in Amazon unwind, turnaround plan

https://www.cnbc.com/2026/01/27/ups-job-cuts-amazon-unwind-turnaround-plan.html
5•belter•20m ago•4 comments

VibeCodingBench: Benchmark Vibe Coding Models for Fun

https://twitter.com/yq_acc/status/2016201908181205358
1•jiayaoqijia•20m ago•1 comments

Former astronaut on lunar spacesuits: "I don't think they're great "

https://arstechnica.com/space/2026/01/former-astronaut-on-lunar-spacesuits-i-dont-think-theyre-gr...
1•rbanffy•21m ago•0 comments

How to Enable ProMotion 120Hz Mode in Safari (Mac, iPhone, and iPad)

https://birchtree.me/blog/how-to-enable-120hz-mode-in-safari-mac-iphone-and-ipad/
1•alwillis•23m ago•0 comments

37signals Isn't Smarter Than You, but They Are Different

https://www.nateberkopec.com/blog/37signals-is-not-smarter-than-you/
1•gaws•23m ago•0 comments

The Peptide Craze, a Surge in Use of Off-Label and Non-FDA Approved Peptides

https://erictopol.substack.com/p/the-peptide-craze
3•ck2•24m ago•1 comments

Bankers at Morgan Stanley are eviscerating Tesla's "robotaxi" performance

https://bsky.app/profile/niedermeyer.online/post/3mdg6hlruzk2o
4•doener•25m ago•0 comments

Will It Rain

https://rainycheck.com/
1•slowinthehead•26m ago•0 comments

Show HN: I built a tool that broke my 15-year doomscrolling habit in one week

https://tolerance.lol
1•wduncan•27m ago•1 comments