frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•7mo ago

Comments

kate_at_refact•7mo ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

China Seen Overtaking U.S. as Global Superpower (2011)

https://www.pewresearch.org/global/2011/07/13/china-seen-overtaking-us-as-global-superpower/
1•lawrenceyan•3m ago•0 comments

A development tool I cannot live without: bin/merge_master_into_all_git_branches

https://www.semicolonandsons.com/articles/merge-master-into-all-git-branches
1•jackkinsella•8m ago•0 comments

Grok Official Full Fixed Point Engine Release Google Suppressing

https://github.com/AnalyticalAgnosticAndrewRusher/VCH-Fixed-Point-Game-Engine-VIsualizer
1•ApexSignalAndy•9m ago•1 comments

Data center deals hit record $61B in 2025 amid construction frenzy

https://www.cnbc.com/2025/12/19/data-center-deals-hit-record-amid-ai-funding-concerns-grip-invest...
2•1vuio0pswjnm7•11m ago•0 comments

DraftKings hopes to score big with new prediction markets app

https://www.cbsnews.com/news/draftkings-prediction-markets-app-sports-betting/
2•mhb•19m ago•0 comments

Laws That Do Harm (1982)

https://miltonfriedman.hoover.org/internal/media/dispatcher/214279/full
2•mhb•22m ago•0 comments

From Zero to RAG (Part 1)

https://turtosa.com/blog/from-zero-to-rag
1•kevinroleke•23m ago•0 comments

Google and Apple warn employees on visas to avoid international travel

https://techcrunch.com/2025/12/20/google-and-apple-reportedly-warn-employees-on-visas-to-avoid-in...
6•SilverElfin•24m ago•2 comments

Climate change's hidden price tag: a drop in our income

https://news.arizona.edu/news/climate-changes-hidden-price-tag-drop-our-income
1•geox•27m ago•1 comments

HoustonTracker2 – A Music Sequencer for the Texas TI-82

https://www.irrlichtproject.de/houston/
1•austinallegro•28m ago•0 comments

TailwindSQL: Like TailwindCSS but SQL.className your way to database queries

https://tailwindsql.xyz/
1•sawirricardo•29m ago•0 comments

This is a duplicate. Please delete it.

https://community.ntppool.org/t/ntp-at-nist-boulder-has-lost-power/4192
1•nobody9999•32m ago•1 comments

HBM Supply Curve Gets Steeper, but Still Can't Meet Demand

https://www.nextplatform.com/2025/12/19/hbm-supply-curve-gets-steeper-but-still-cant-meet-demand/
1•rbanffy•33m ago•0 comments

U.S. Plans $80B Nuclear Power Expansion

https://spectrum.ieee.org/80-billion-us-nuclear-power
2•rbanffy•35m ago•1 comments

When creating images, AI keeps remixing the same 12 stock photo clichés

https://www.science.org/content/article/when-creating-images-ai-keeps-remixing-same-12-stock-phot...
1•rbanffy•36m ago•0 comments

C-reactive protein outpaced 'bad' cholester as leading heart disease risk marker

https://theconversation.com/how-c-reactive-protein-outpaced-bad-cholesterol-as-leading-heart-dise...
3•bikenaga•39m ago•0 comments

STPA (System Theoretic Process Analysis) at Google

https://sre.google/resources/practices-and-processes/stpa/
1•motxilo•42m ago•0 comments

Rcarmo/Guerite: A Watchtower Replacement

https://github.com/rcarmo/guerite
1•rcarmo•44m ago•0 comments

OpenWRT 25.12.0-RC1 Released

https://downloads.openwrt.org/releases/25.12.0-rc1/
2•josteink•51m ago•0 comments

OpenWRT 24.10.5 Released

https://openwrt.org/releases/24.10/notes-24.10.5
2•josteink•52m ago•0 comments

Why the fuel-switch story does not explain the AI171 crash

https://frontline.thehindu.com/the-nation/ai-171-crash-boeing-787-electrical-failure-core-network...
1•sltr•52m ago•1 comments

Show HN: Calcu-gator.com – Financial calculators for Canadians

https://calcu-gator.com/
2•Nitromax•57m ago•0 comments

Monte Carlo Cubes

https://thevesselshortstories.substack.com/p/monte-carlo-cubes
1•kawrydav•1h ago•0 comments

I wrote a code editor in C and now I'm a changed man

https://github.com/thisismars-x/light
15•birdculture•1h ago•6 comments

Show HN: Prove your compliance posture with automated evidence (OSCAL)

https://github.com/clay-good/attestful
1•hireclay•1h ago•0 comments

I built a tool to do my bookkeeping for me (freelancer)

https://billpal.io/
2•romanleeb•1h ago•2 comments

FrontierScience Benchmark by OpenAI

https://openai.com/index/frontierscience/
2•mustaphah•1h ago•0 comments

Show HN: SolarSystem, a Solarized-like theme generator using OKHSL and APCA

https://solarsys.dev/
2•zacharyvoase•1h ago•0 comments

More databases should be single-threaded

https://blog.konsti.xyz/p/8c8a399f-8cfe-47dd-9278-9527105d07dc/
3•lawrencechen•1h ago•0 comments

Titan's strong tidal dissipation precludes a subsurface ocean

https://www.sciencedaily.com/releases/2025/12/251220104621.htm
2•gradus_ad•1h ago•0 comments