frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•9mo ago

Comments

kate_at_refact•9mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

NASA acknowledges the elephant in the room with the SLS rocket

https://arstechnica.com/space/2026/02/nasa-finally-acknowledges-the-elephant-in-the-room-with-the...
1•knappe•15s ago•0 comments

New DeepSeek Research – The Future Is Here [video]

https://www.youtube.com/watch?v=fFL7la73RO4
1•chii•3m ago•0 comments

ICE seeks industry input on ad tech location data for investigative use

https://www.biometricupdate.com/202602/ice-seeks-industry-input-on-ad-tech-location-data-for-inve...
12•WaitWaitWha•12m ago•0 comments

Claude Cowork and the Case of SaaSpocalypse

https://gpt3experiments.substack.com/p/claude-cowork-and-the-case-of-saaspocalypse
2•nutanc•19m ago•1 comments

Show HN: An AI-Powered President Simulator

https://presiduck.feedscription.com/
3•tzhu1997•20m ago•0 comments

Astronauts Are Going Back to the Moon for the First Time in Half a Century

https://time.com/7346146/artemis-ii-launch-nasa-astronauts-moon-mission/
2•helloplanets•28m ago•0 comments

The CIA Is Sunsetting the World Factbook

https://actualityabridged.substack.com/p/the-cia-is-sunsetting-the-world-factbook
4•blizow•29m ago•0 comments

Climate Change Economic Models Omit Shocks, Likely Flawed

https://www.theguardian.com/environment/2026/feb/05/flawed-economic-models-mean-climate-crisis-co...
3•stego-tech•34m ago•1 comments

Show HN: A text format for UI wireframes – comparing token costs across 4 format

https://github.com/enlinks-llc/katsuragi
2•enlinks•36m ago•0 comments

Show HN: FIPSPad – a FIPS 140-3 and NIST SP 800-53 minimal Notepad app in Rust

https://github.com/BrowserBox/FIPSPad
2•keepamovin•36m ago•1 comments

Mick Jagger "Memo from Turner" (1970) [video]

https://archive.org/details/memo-from-turner-clip
2•petethomas•41m ago•0 comments

Show HN: Use Claude Code to Query and Analyze Your Finances

https://github.com/theFong/mmoney-cli
1•alecfong•44m ago•1 comments

4-Hour Builds: Anatomy of a Developer Experience Collapse

https://fabioluciano.com/en/4-hours-build-anatomy-devex-collapse/
1•fabioluciano•47m ago•0 comments

Spellcasting

https://phyous.github.io/spellcasting/
2•wpnx•49m ago•0 comments

OpenClaw Is Lonely [video]

https://vimeo.com/1160861583
1•laserduck•50m ago•0 comments

Strava removes 2.3M rides from leaderboards in clampdown on cheats

https://www.cyclingweekly.com/news/strava-removes-2-3-million-rides-from-leaderboards-in-clampdow...
2•brippalcharrid•50m ago•0 comments

Constant 14ms attention: 512→524K tokens (24.5x faster than FlashAttention)

https://github.com/RegularJoe-CEO/vllm/blob/waller-operator-integration/benchmarks/attention_benc...
1•luxiedge•52m ago•1 comments

Sequoias Need for Churn

https://www.gnupg.org/blog/20250117-aheinecke-on-sequoia.html
1•mocknen•53m ago•0 comments

Investigators found 'concerning similarities' between Reedley, Las Vegas labs

https://abc30.com/post/investigators-found-concerning-similarities-between-reedley-las-vegas-labs...
2•petethomas•58m ago•0 comments

Sam Altman Responds to Anthropic Ad Campaign

https://twitter.com/i/status/2019139174339928189
14•gradus_ad•59m ago•2 comments

Show HN: I've been running OpenClaw on a $640 Mac Mini for a week. Honest report

https://github.com/openclaw/openclaw
3•Legin82•1h ago•1 comments

Show HN: Tiny PWA to encrypt files using Passkeys

https://filokey.github.io/
2•dansjots•1h ago•0 comments

Doc2Calendar – I built an LLM pipeline to parse complex PDF schedules

https://www.doc2calendar.com/
2•mikebuilds•1h ago•1 comments

Ask HN: Is Connecting via SSH Risky?

3•atrevbot•1h ago•7 comments

Betterment Data Breach

https://haveibeenpwned.com/Breach/Betterment
1•skogstokig•1h ago•0 comments

Show HN: Buquet – Durable queues and workflows using only S3

https://horv.co/buquet.html
1•h0rv•1h ago•0 comments

AI Command and Staff–Operational Evidence and Insights from Wargaming

https://www.militarystrategymagazine.com/article/ai-command-and-staff-operational-evidence-and-in...
1•mooreds•1h ago•0 comments

Understanding the Political Disconnect

https://www.swarthmore.edu/understanding-political-disconnect
1•mooreds•1h ago•0 comments

How to Connect with Your Developer Audience (2022)

https://maida.kim/how-to-build-developer-audience/
1•mooreds•1h ago•0 comments

Open secrets about Hacker News

https://bengtan.com/blog/open-secrets-hacker-news/
11•thunderbong•1h ago•0 comments