Cool paper! The authors use the fact that the M1 chip supports both ARM's weaker memory consistency model and x86's total order to investigate the performance hit from using the latter, ceteris paribus.
They see an average of 10% degradation on SPEC and show some synthetic benchmarks with a 2x hit.
charles_irl•2h ago
They see an average of 10% degradation on SPEC and show some synthetic benchmarks with a 2x hit.