A Region-Driven Analysis of Processor Power Consumption for Various HPC Workloads
Citation
MLA Style: Sharda Dixit, Amarjeet Sharma, Anil Gupta "A Region-Driven Analysis of Processor Power Consumption for Various HPC Workloads" International Journal of P2P Network Trends and Technology 9.1 (2019): 13-20.
APA Style:Sharda Dixit, Amarjeet Sharma, Anil Gupta (2019). A Region-Driven Analysis of Processor Power Consumption for Various HPC Workloads. International Journal of P2P Network Trends and Technology, 9(1), 13-20.
Abstract
Power consumption is one of the significant challenges for present and future HPC systems. The processor is one of the most power-hungry components at the node level of an HPC cluster. Various features are already implemented in processor hardware and OS/kernel, which optimize power consumption at the node level. We performed several experiments on a variety of workloads and observed power consumption patterns for CPU, memory and IO intensive regions of workloads. The investigations focused on correlating the CPU utilization, memory access, idle state residency, clock gated residency, and power consumption. Our results show that the scope of reducing energy consumed by workload, using employing frequency scaling, for compute-intensive workloads, is slim at the system software level.
References
[1] Exascale Challenges, Scientific Computing World, July 2014, https://www.scientificcomputing.com/feature/exascale-challenges
[2] Power Problems Threaten to Strangle Exascale Computing, IEEE Spectrum, Dec 2015, http://spectrum.ieee.org/computing/hardware/power-problems-threaten-to-strangle-exascale-computing
[3] Technical Challenges of Exascale Computing, JASON, The MITRE Corporation, April 2013, https://fas.org/irp/agency/dod/jason/exascale.pdf
[4] Amarasinghe S, Campbell D, Carlson W, et al. (2009) Exascale Software Study: Software Challenges in Extreme Scale Systems. DARPA Report http://mrichards.ece.gatech.edu/wp-content/uploads/sites/462/2016/08/ECSS-report-101909.pdf
[5] ExaScale Computing Study: Technology Challenges in Achieving Exascale Systems, DAPRA IPTO, September 28, 2008, http://www.cse.nd.edu/Reports/2008/TR-2008-13.pdf
[6] John Shalf, Sudip Dosanjh, and John Morrison, Exascale Computing Technology Challenges, Springer-Verlag Berlin Heidelberg 2011, https://www.nersc.gov/assets/NERSC-Staff-Publications/2010/ShalfVecpar2010.pdf
[7] Top Ten Exascale Research Challenges, ASCAC Subcommittee Report, Department of Energy, USA, February 10, 2014, https://science.energy.gov/˜/media/ascr/ascac/pdf/meetings/20140210/ Top10reportFEB14.pdf
[8] Top500, https://www.top500.org/
[9] James H. Laros III, Measuring and Tuning Energy Efficiency on Large-Scale High-Performance Computing Platforms, SAND2011-5702, August 2011
[10] Sundriyal, V. & Sosonkina, M. J Supercomput (2016) 72: 1549. https://doi.org/10.1007/s11227-016-1680-4
[11] J. Eastep, S. Sylvester, C. Cantalupo, B. Geltz, F. Ardanaz, A. Al- Rawi, Livingston, F. Keceli, M. Maiterth, and S. Jana, Global extensible open power manager: A vehicle for HPC community collaboration on co-designed energy management solutions, in High Performance Computing - 32nd International Conference, ISC High Performance 2017, Frankfurt, Germany, June 18-22, 2017, Proceedings, 2017, pp. 394412.
[12] R. E. Grant, M. Levenhagen, S. L. Olivier, D. DeBonis, K. T. Pedretti, and J. H. L. III, Standardizing power monitoring and control at exascale, Computer, vol. 49, no. 10, pp. 3846, Oct 2016, ISSN: 0018-9162.
[13] Cotes-Ruiz IT, Prado RP, Garca-Galn S, Muoz-Expsito JE, Ruiz-Reyes N (2017) Dynamic Voltage Frequency Scaling Simulator for Real Workflows Energy-Aware Management in Green Cloud Computing. PLoS ONE 12(1): e0169803. https://doi.org/10.1371/journal.pone.0169803
[14] Georges Da Costa, Jean-Marc Pierson. DVFS governor for HPC: Higher, Faster, Greener. 23rd Euromicro International Conference on Parallel, Distributed and network-based Processing (PDP 2015), Mar 2015, Turku, Finland. 2015 23rd Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP), pp.533-540, 2015, ¡10.1109/PDP.2015.73¿. ¡hal-01387826¿
[15] Ghislain Landry Tsafack Chetsa, Laurent Lefevre, Jean-Marc Pierson, Patricia Stolf, Georges Da Costa. Beyond CPU Frequency Scaling for a Fine-grained Energy Control of HPC Systems. 24th International Symposium on Computer Architecture and High-Performance Computing, Oct 2012, New York City, United States. 2012.
[16] Sundriyal, V., Sosonkina, M. & Zhang, Z. J Supercomput (2014) 68: 777. https://doi.org/10.1007/s11227-013-1062-0
[17] Nikl V., Hradecky M., Keleceni J., Jaros J. (2017) The Investigation of the ARMv7 and Intel Haswell Architectures Suitability for Performance and Energy-Aware Computing. In: Kunkel J., Yokota R., Balaji P., Keyes (eds) High-Performance Computing. ISC 2017. Lecture Notes in Computer Science, vol 10266. Springer, Cham
[18] ” HPL Benchmark,” http://www.netlib.org/benchmark/hpl/
[19] ” Intel MKL Benchmarks,” https://software.intel.com/en-us/articles/intel-mkl-benchmarks-suite
[20] ” Stream Benchmark,” https://www.cs.virginia.edu/stream/
[21] Stress-ng benchmark, http://kernel.ubuntu.com/cking/stress-ng/
[22] ” Lammps Benchmarks,” http://lammps.sandia.gov/bench.html
[23] ” WRV V3 Parallel Benchmark”, http://www2.mmm.ucar.edu/wrf/WG2/ benchv3/
[24] ” Gromacs Benchmark,” http://www.gromacs.org/About Gromacs/ Benchmarks
[25] ” Running Average Power Limit,” https://01.org/blogs/2014/running-average-power-limit-%E2%80%93-rapl
[26] ” Performance Counter Monitor,” https://software.intel.com/en-us/articles/intel-performance-counter-monitor
[27] ” PowerTop,” https://01.org/powertop
[28] ” Intel Xeon Processor E5-1600/E5-2600/E5-4600 Product Families”, Datasheet - Volume One, May 2012, https://www.mouser.com/ds/2/612/xeon-e5-1600-2600-vol-1-datasheet-263646.pdf
[29] ” Intel Turbo Boost Technology 2.0”, https://www.intel.in/content/www/in/en/architecture-and-technology/turbo-boost/turbo-boost-technology.html
[30] ” HWPM - Hardware Power Management,” https://software.intel.com/en-us/articles/intel-xeon-processor-e5-2600-v4-product-family-technical-overview
Keywords
DVFS, Clock-Gating, C-State, P-State