Why Nothing Matters: The Impact of Zeroing

Date

October 28, 2011

Speaker

Xi Yang

Affiliation

Australian National University

Overview

Managed languages use memory safety to defend against inadvertent and malicious misuse of memory. Unmanaged native languages are increasingly integrating memory safety for the same reasons. A critical element of memory safety is initializing new memory before the program obtains it. Our experiments show that zero initialization is surprisingly expensive in a highly optimized managed runtime — on average the direct cost of zeroing is 4% to 6% and up to 50% of total application time on a variety of modern processors. Zeroing incurs indirect costs as well, which include memory bandwidth consumption and cache displacement. Existing virtual machines (VMs) either: a) minimize direct costs by zeroing in large blocks, or b) minimize indirect costs by integrating zeroing into the allocation sequence to reduce cache displacement.

This paper first describes and evaluates zero initialization costs and the two existing design points. Our micro architectural analysis of prior designs inspires two better designs that exploit concurrency and non-temporal cache-bypassing instructions to reduce the direct and indirect costs simultaneously. We show that the best strategy is to adaptively choose between the two new designs based on CPU utilization. This approach improves over widely used hot-path zeroing by 3% on average and up to 15% on the newest Intel i7-2600 processor, without slowing down any of the benchmarks. These results indicate that zero initialization is a surprisingly important source of overhead in existing VMs and that our new software strategies are effective at reducing this overhead. These findings also invite other optimizations, including software elision of zeroing and micro architectural support.

Speakers

Xi Yang

Xi Yang is a PHD student in the Department of Computer Science at Australian National University. His research focus on memory subsystem, especially the interface between computer architecture and language run-time system.