Emerging Capabilities in HPCToolkit - Mellor-Crummey, Tallent, Liu
Over the past year, Rice University has been extending the HPCToolkit performance tools with a variety of new capabilities. This talk will describe some new strategies for analyzing performance measurements of parallel programs collected using asynchronous sampling, including scalable analysis of call path profiles to identify load imbalance, using call path tracing to understand temporal behavior of parallel programs, and data centric profiling to associate memory hierarchy costs with program variables.