TProf: An energy profiler for task-parallel programs

We present TProf, an energy profiling tool for OpenMP-like task-parallel programs. To compute the energy consumed by each task in a parallel application, TProf dynamically traces the parallel execution and uses a novel technique to estimate the per-task energy consumption. To achieve this estimation...

Full description

Saved in:
Bibliographic Details
Published in:Sustainable computing informatics and systems Vol. 5; pp. 1 - 13
Main Authors: Manousakis, Ioannis, Zakkak, Foivos S., Pratikakis, Polyvios, Nikolopoulos, Dimitrios S.
Format: Journal Article
Language:English
Published: Elsevier Inc 01.03.2015
Subjects:
ISSN:2210-5379
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract We present TProf, an energy profiling tool for OpenMP-like task-parallel programs. To compute the energy consumed by each task in a parallel application, TProf dynamically traces the parallel execution and uses a novel technique to estimate the per-task energy consumption. To achieve this estimation, TProf apportions the total processor energy among cores and overcomes the limitation of current works which would otherwise make parallel accounting impossible to achieve. We demonstrate the value of TProf by characterizing a set of task parallel programs, where we find that data locality, memory access patterns and task working sets are responsible for significant variance in energy consumption between seemingly homogeneous tasks. In addition, we identify opportunities for fine-grain energy optimization by applying per-task Dynamic Voltage and Frequency Scaling (DVFS).
AbstractList We present TProf, an energy profiling tool for OpenMP-like task-parallel programs. To compute the energy consumed by each task in a parallel application, TProf dynamically traces the parallel execution and uses a novel technique to estimate the per-task energy consumption. To achieve this estimation, TProf apportions the total processor energy among cores and overcomes the limitation of current works which would otherwise make parallel accounting impossible to achieve. We demonstrate the value of TProf by characterizing a set of task parallel programs, where we find that data locality, memory access patterns and task working sets are responsible for significant variance in energy consumption between seemingly homogeneous tasks. In addition, we identify opportunities for fine-grain energy optimization by applying per-task Dynamic Voltage and Frequency Scaling (DVFS).
Author Zakkak, Foivos S.
Manousakis, Ioannis
Nikolopoulos, Dimitrios S.
Pratikakis, Polyvios
Author_xml – sequence: 1
  givenname: Ioannis
  surname: Manousakis
  fullname: Manousakis, Ioannis
  email: im159@cs.rutgers.edu
  organization: Department of Computer Science, Rutgers University, United States
– sequence: 2
  givenname: Foivos S.
  surname: Zakkak
  fullname: Zakkak, Foivos S.
  email: zakkak@ics.forth.gr
  organization: Institute of Computer Science, Foundation for Research and Technology, Hellas, Greece
– sequence: 3
  givenname: Polyvios
  surname: Pratikakis
  fullname: Pratikakis, Polyvios
  email: polyvios@ics.forth.gr
  organization: Institute of Computer Science, Foundation for Research and Technology, Hellas, Greece
– sequence: 4
  givenname: Dimitrios S.
  orcidid: 0000-0003-0217-8307
  surname: Nikolopoulos
  fullname: Nikolopoulos, Dimitrios S.
  email: d.nikolopoulos@qub.ac.uk
  organization: School of Electronics, Electrical Engineering and Computer Science, Queen's University of Belfast, United Kingdom
BookMark eNqFj7FOwzAQhj0UiVL6Bgx5gQQ7dmKnA1JVAUWqBEOZLdc5Vw5pXJ0DUt8eV2VigFtOp9P36_9uyGQIAxByx2jBKKvvuyJ-RhsORUmZKKgsKBUTMi1LRvOKy-aazGPsaJqqZg0XUyK2bxjcIlsOGQyA-1N2TLfvATMXMBtN_MiPBk3fQ39-7dEc4i25cqaPMP_ZM_L-9LhdrfPN6_PLarnJLZf1mNeNsg5qW0klheXAlWmEarmynHEmJC9Tp1aUJQjhjDOVq63btaBEar9rFZ8Rccm1GGJEcPqI_mDwpBnVZ2Hd6YuwPgtrKnUSTtjiF2b9aEYfhhGN7_-DHy4wJLEvD6ij9TBYaD2CHXUb_N8B3z5yd8g
CitedBy_id crossref_primary_10_1002_cpe_4016
crossref_primary_10_1145_3177754
crossref_primary_10_1145_3050436
crossref_primary_10_1145_3136063
Cites_doi 10.1109/TPDS.2008.105
10.1177/109434200001400303
10.1109/MM.2012.12
10.1145/1577129.1577137
10.1109/TPDS.2009.76
10.1145/2318836.2318841
10.1109/MM.2005.93
ContentType Journal Article
Copyright 2014
Copyright_xml – notice: 2014
DBID AAYXX
CITATION
DOI 10.1016/j.suscom.2014.07.004
DatabaseName CrossRef
DatabaseTitle CrossRef
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EndPage 13
ExternalDocumentID 10_1016_j_suscom_2014_07_004
S2210537914000390
GroupedDBID --K
--M
.~1
0R~
1~.
4.4
457
4G.
7-5
8P~
AACTN
AAEDT
AAEDW
AAHCO
AAIAV
AAIKJ
AAKOC
AALRI
AAOAW
AARJD
AAXUO
AAYFN
ABBOA
ABMAC
ABXDB
ABYKQ
ACDAQ
ACGFS
ACNNM
ACRLP
ACZNC
ADBBV
ADEZE
ADMUD
AEBSH
AEKER
AFKWA
AFTJW
AGHFR
AGUBO
AGYEJ
AHZHX
AIALX
AIEXJ
AIKHN
AITUG
AJBFU
AJOXV
ALMA_UNASSIGNED_HOLDINGS
AMFUW
AMRAJ
AOUOD
AXJTR
BELTK
BKOJK
BLXMC
EBS
EFJIC
EFLBG
EJD
FDB
FIRID
FNPLU
FYGXN
GBLVA
GBOLZ
HZ~
J1W
JARJE
KOM
M41
MO0
N9A
O-L
O9-
OAUVE
P-8
P-9
PC.
Q38
RIG
ROL
SDF
SES
SPC
SPCBC
SSR
SSV
SSZ
T5K
~G-
AAQFI
AATTM
AAXKI
AAYWO
AAYXX
ABJNI
ABWVN
ACLOT
ACRPL
ADNMO
AEIPS
AFJKZ
AIIUN
ANKPU
APXCP
CITATION
EFKBS
~HD
ID FETCH-LOGICAL-c376t-698cfe6c57874c3e38a948d38c31314732537d422e44fafa5f6cfbde84014bd83
ISICitedReferencesCount 8
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000368600800001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 2210-5379
IngestDate Tue Nov 18 22:38:33 EST 2025
Sat Nov 29 02:52:59 EST 2025
Fri Feb 23 02:26:29 EST 2024
IsPeerReviewed false
IsScholarly true
Keywords Parallel programming models
Energy profiling
Task parallelism
Parallel runtime systems
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c376t-698cfe6c57874c3e38a948d38c31314732537d422e44fafa5f6cfbde84014bd83
ORCID 0000-0003-0217-8307
PageCount 13
ParticipantIDs crossref_primary_10_1016_j_suscom_2014_07_004
crossref_citationtrail_10_1016_j_suscom_2014_07_004
elsevier_sciencedirect_doi_10_1016_j_suscom_2014_07_004
PublicationCentury 2000
PublicationDate 2015-03-01
PublicationDateYYYYMMDD 2015-03-01
PublicationDate_xml – month: 03
  year: 2015
  text: 2015-03-01
  day: 01
PublicationDecade 2010
PublicationTitle Sustainable computing informatics and systems
PublicationYear 2015
Publisher Elsevier Inc
Publisher_xml – name: Elsevier Inc
References Fatahalian, Horn, Knight, Leem, Houston, Park, Erez, Ren, Aiken, Dally, Hanrahan (bib0035) 2006
McCullough, Agarwal, Chandrashekar, Kuppuswamy, Snoeren, Gupta (bib0100) 2011
Pathak, Hu, Zhang (bib0005) 2012
Contreras, Martonosi (bib0170) August 2005
Merkel, Bellosa (bib0195) 2006
Leiserson (bib0030) 2009
Bertran, Gonzalez, Martorell, Navarro, Ayguade (bib0090) 2010
Yoon, Kim, Jung, Kang, Cha (bib0130) 2012
Garner, Browne, Dongarra, Garner, Ho, Mucci (bib0070) 2000; 14
Springer, Lowenthal, Rountree, Freeh (bib0160) 2006
Ge, Feng, Song, Chang, Li, Cameron (bib0010) 2010; 21
Bocchino, Adve, Dig, Adve, Heumann, Komuravelli, Overbey, Simmons, Sung, Vakilian (bib0050) 2009
Li, John (bib0105) 2003
Spiliopoulos, Kaxiras, Keramidas (bib0145) July 2011
Pop, Cohen (bib0055) 2011
Isci, Martonosi (bib0095) February 2006
Isci, Buyuktosunoglu, Martonosi (bib0150) 2005; 25
Bui, Norris, Huck, McInnes, Li, Hernandez, Chapman (bib0165) 2008
Rotem, Naveh, Ananthakrishnan, Weissmann, Rajwan (bib0080) 2012; 32
Isci, Martonosi (bib0115) 2003
SMP Superscalar (SMPSs) v2.3 User's Manual, 2010.
Snowdon, Sueur, Petters, Heiser (bib0200) 2009
Manousakis, Nikolopoulos (bib0085) 2012; 9
Kim, Gupta, Wei, Brooks (bib0155) 2008
Singh, Bhadauria, McKee (bib0185) 2009; 37
Blumofe, Joerg, Kuszmaul, Leiserson, Randall, Zhou (bib0025) 1995
Bienia, Kumar, Singh, Li (bib0135) 2008
Lim, Porterfield, Fowler (bib0180) 2010
Ayguadé, Copty, Duran, Hoeflinger, Lin, Massaioli, Teruel, Unnikrishnan, Zhang (bib0060) 2009; 20
Spiliopoulos, Sembrant, Kaxiras (bib0120) 2012
Do, Rawshdeh, Shi (bib0015) 2009
Duesterwald, Cascaval, Dwarkadas (bib0110) 2003
L.N. Pouchet, PolyBench: The Polyhedral Benchmark suite.
Intel® 64 and IA-32 Architectures Software Developer's Manuals Volume 3B.
Pathak, Hu, Zhang, Bahl, Wang (bib0205) 2011
Shen, Shriraman, Dwarkadas, Zhang, Chen (bib0020) 2013
Reinders (bib0045) 2007
Nathuji, Schwan (bib0190) 2008
Tzenakis, Papatriantafyllou, Kesapides, Pratikakis, Vandierendonck, Nikolopoulos (bib0065) 2012
Bauer, Treichler, Slaughter, Aiken (bib0040) 2012
Curtis-Maury, Shah, Blagojevic, Nikolopoulos, de Supinski, Schulz (bib0175) 2008
10.1016/j.suscom.2014.07.004_bib0075
Snowdon (10.1016/j.suscom.2014.07.004_bib0200) 2009
Ge (10.1016/j.suscom.2014.07.004_bib0010) 2010; 21
Springer (10.1016/j.suscom.2014.07.004_bib0160) 2006
Ayguadé (10.1016/j.suscom.2014.07.004_bib0060) 2009; 20
Isci (10.1016/j.suscom.2014.07.004_bib0150) 2005; 25
Fatahalian (10.1016/j.suscom.2014.07.004_bib0035) 2006
Bui (10.1016/j.suscom.2014.07.004_bib0165) 2008
Manousakis (10.1016/j.suscom.2014.07.004_bib0085) 2012; 9
Rotem (10.1016/j.suscom.2014.07.004_bib0080) 2012; 32
McCullough (10.1016/j.suscom.2014.07.004_bib0100) 2011
Pathak (10.1016/j.suscom.2014.07.004_bib0205) 2011
Bertran (10.1016/j.suscom.2014.07.004_bib0090) 2010
Spiliopoulos (10.1016/j.suscom.2014.07.004_bib0120) 2012
Spiliopoulos (10.1016/j.suscom.2014.07.004_bib0145) 2011
Pathak (10.1016/j.suscom.2014.07.004_bib0005) 2012
Pop (10.1016/j.suscom.2014.07.004_bib0055) 2011
Do (10.1016/j.suscom.2014.07.004_bib0015) 2009
Bauer (10.1016/j.suscom.2014.07.004_bib0040) 2012
10.1016/j.suscom.2014.07.004_bib0140
Contreras (10.1016/j.suscom.2014.07.004_bib0170) 2005
Kim (10.1016/j.suscom.2014.07.004_bib0155) 2008
Nathuji (10.1016/j.suscom.2014.07.004_bib0190) 2008
Leiserson (10.1016/j.suscom.2014.07.004_bib0030) 2009
10.1016/j.suscom.2014.07.004_bib0125
Singh (10.1016/j.suscom.2014.07.004_bib0185) 2009; 37
Reinders (10.1016/j.suscom.2014.07.004_bib0045) 2007
Merkel (10.1016/j.suscom.2014.07.004_bib0195) 2006
Bocchino (10.1016/j.suscom.2014.07.004_bib0050) 2009
Lim (10.1016/j.suscom.2014.07.004_bib0180) 2010
Curtis-Maury (10.1016/j.suscom.2014.07.004_bib0175) 2008
Li (10.1016/j.suscom.2014.07.004_bib0105) 2003
Blumofe (10.1016/j.suscom.2014.07.004_bib0025) 1995
Bienia (10.1016/j.suscom.2014.07.004_bib0135) 2008
Garner (10.1016/j.suscom.2014.07.004_bib0070) 2000; 14
Shen (10.1016/j.suscom.2014.07.004_bib0020) 2013
Tzenakis (10.1016/j.suscom.2014.07.004_bib0065) 2012
Duesterwald (10.1016/j.suscom.2014.07.004_bib0110) 2003
Isci (10.1016/j.suscom.2014.07.004_bib0115) 2003
Yoon (10.1016/j.suscom.2014.07.004_bib0130) 2012
Isci (10.1016/j.suscom.2014.07.004_bib0095) 2006
References_xml – year: 2006
  ident: bib0035
  article-title: Sequoia: programming the memory hierarchy.
  publication-title: Proceedings of the 2006 ACM/IEEE Conference on Supercomputing, SC’06
– start-page: 289
  year: 2009
  end-page: 302
  ident: bib0200
  article-title: Koala: a platform for OS-level power management
  publication-title: Proceedings of the 4th ACM European Conference on Computer systems, EuroSys’09
– start-page: 119
  year: 2008
  end-page: 128
  ident: bib0190
  article-title: Vpm tokens: virtual machine-aware power budgeting in datacenters.
  publication-title: In Proceedings of the 17th International Symposium on High Performance Distributed Computing, HPDC’08
– start-page: 160
  year: 2003
  end-page: 171
  ident: bib0105
  article-title: Run-time modeling and estimation of operating system power consumption
  publication-title: Proceedings of the 2003 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, SIGMETRICS’03
– start-page: 153
  year: 2011
  end-page: 168
  ident: bib0205
  article-title: Fine-grained power modeling for smartphones using system call tracing
  publication-title: Proceedings of the Sixth Conference on Computer Systems, EuroSys’11
– start-page: 241
  year: 2012
  end-page: 250
  ident: bib0120
  article-title: Power-Sleuth: a tool for investigating your program's power behavior
  publication-title: Proceedings of the 2012 IEEE 20th International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems, MASCOTS’12
– start-page: 121
  year: February 2006
  end-page: 132
  ident: bib0095
  article-title: Phase characterization for power: evaluating control-flow-based and event-counter-based techniques
  publication-title: Proceedings of the 12th International Symposium on High-Performance Computer Architecture, HPCA’06
– year: 2011
  ident: bib0100
  article-title: Evaluating the effectiveness of model-based power characterization
  publication-title: Proceedings of the 2011 USENIX Annual Technical Conference, USENIX ATC’11
– start-page: 250
  year: 2008
  end-page: 259
  ident: bib0175
  article-title: Prediction models for multi-dimensional power-performance optimization on many cores
  publication-title: Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques, PACT’08
– start-page: 221
  year: August 2005
  end-page: 226
  ident: bib0170
  article-title: Power prediction for Intel XScale® processors using performance monitoring unit events.
  publication-title: Proceedings of the International Symposium on Low Power Electronics and Design, ISLPED’05
– reference: Intel® 64 and IA-32 Architectures Software Developer's Manuals Volume 3B.
– start-page: 403
  year: 2006
  end-page: 414
  ident: bib0195
  article-title: Balancing power consumption in multiprocessor systems
  publication-title: Proceedings of the 1st ACM SIGOPS/EuroSys European Conference on Computer Systems 2006, EuroSys’06
– year: 2012
  ident: bib0040
  article-title: Legion: expressing locality and independence with logical regions
  publication-title: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, SC’12
– reference: SMP Superscalar (SMPSs) v2.3 User's Manual, 2010.
– volume: 21
  start-page: 658
  year: 2010
  end-page: 671
  ident: bib0010
  article-title: PowerPack: energy profiling and analysis of high-performance systems and applications
  publication-title: IEEE Transactions on Parallel and Distributed Systems
– start-page: 5
  year: 2011
  end-page: 14
  ident: bib0055
  article-title: A stream-computing extension to OpenMP.
  publication-title: Proceedings of the 6th International Conference on High Performance and Embedded Architectures and Compilers, HiPEAC’11
– start-page: 123
  year: 2008
  end-page: 134
  ident: bib0155
  article-title: System level analysis of fast, per-core DVFS using on-chip switching regulators
  publication-title: Proceedings of the 14th IEEE International Symposium on High Performance Computer Architecture. HPCA 2008
– start-page: 93
  year: 2003
  ident: bib0115
  article-title: Runtime power monitoring in high-end processors: methodology and empirical data
  publication-title: Proceedings of the 36th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 36
– start-page: 29
  year: 2012
  end-page: 42
  ident: bib0005
  article-title: Where is the energy spent inside my app? Fine grained energy accounting on smartphones with Eprof
  publication-title: Proceedings of the 7th ACM European Conference on Computer Systems, EuroSys’12
– start-page: 65
  year: 2013
  end-page: 76
  ident: bib0020
  article-title: Power containers: an OS facility for fine-grained power and energy management on multicore servers.
  publication-title: Proceedings of the Eighteenth International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS’13
– volume: 32
  start-page: 20
  year: 2012
  end-page: 27
  ident: bib0080
  article-title: Power-Management Architecture of the Intel microarchitecture code-named Sandy Bridge
  publication-title: IEEE Micro
– start-page: 387
  year: 2012
  end-page: 400
  ident: bib0130
  article-title: AppScope: application energy metering framework for android smartphone using kernel activity monitoring
  publication-title: Proceedings of the 2012 USENIX Annual Technical Conference, USENIX ATC’12
– start-page: 220
  year: 2003
  end-page: 231
  ident: bib0110
  article-title: Characterizing and predicting program behavior and its variability
  publication-title: Proceedings of the 12th International Conference on Parallel Architectures and Compilation Techniques, PACT’03
– reference: L.N. Pouchet, PolyBench: The Polyhedral Benchmark suite.
– start-page: 522
  year: 2009
  end-page: 527
  ident: bib0030
  article-title: The Cilk++ concurrency platform
  publication-title: Proceedings of the 46th Annual Design Automation Conference, DAC’09
– start-page: 301
  year: 2012
  end-page: 302
  ident: bib0065
  article-title: BDDT: block-level dynamic dependence analysis for deterministic task-based parallelism
  publication-title: Proceedings of the 17th ACM SIGPLAN symposium on Principles and Practice of Parallel Programming, PPoPP’12
– start-page: 1
  year: July 2011
  end-page: 8
  ident: bib0145
  article-title: Green Governors: a framework for continuously adaptive DVFS
  publication-title: Green Computing Conference and Workshops (IGCC), 2011 International
– year: 2008
  ident: bib0165
  article-title: A component infrastructure for performance and power modeling of parallel scientific applications
  publication-title: Proceedings of the 2008 compFrame/HPC-GECO Workshop on Component based High Performance, CBH,’08
– volume: 25
  start-page: 39
  year: 2005
  end-page: 51
  ident: bib0150
  article-title: Long-term workload phases: duration predictions and applications to DVFS
  publication-title: Micro IEEE
– start-page: 147
  year: 2010
  end-page: 158
  ident: bib0090
  article-title: Decomposable and responsive power models for multi-core processors using performance counters
  publication-title: Proceedings of the 24th ACM International Conference on Supercomputing, ICS’10
– start-page: 230
  year: 2006
  end-page: 238
  ident: bib0160
  article-title: Minimizing execution time in MPI programs on an energy-constrained, power-scalable cluster
  publication-title: Proceedings of the 11th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP’06
– start-page: 72
  year: 2008
  end-page: 81
  ident: bib0135
  article-title: The PARSEC benchmark suite: characterization and architectural implications
  publication-title: Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques, PACT’08
– volume: 9
  start-page: 28
  year: 2012
  end-page: 32
  ident: bib0085
  article-title: EPC: a power instrumentation controller for embedded applications
  publication-title: SIGBED Review
– start-page: 308
  year: 2010
  end-page: 311
  ident: bib0180
  article-title: SoftPower: fine-grain power estimations using performance counters
  publication-title: Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, HPDC’10
– year: 2009
  ident: bib0015
  article-title: pTop: a process-level power profiling tool
  publication-title: Proceedings of the 2nd Workshop on Power Aware Computing and Systems (HotPower09), HotPower’09
– volume: 20
  start-page: 404
  year: 2009
  end-page: 418
  ident: bib0060
  article-title: The design of OpenMP tasks
  publication-title: IEEE Transactions on Parallel and Distributed Systems
– volume: 37
  start-page: 46
  year: 2009
  end-page: 55
  ident: bib0185
  article-title: Real time power estimation and thread scheduling via performance counters
  publication-title: SIGARCH Comput. Archit. News
– start-page: 97
  year: 2009
  end-page: 116
  ident: bib0050
  article-title: A type and effect system for deterministic parallel Java
  publication-title: Proceedings of the 24th ACM SIGPLAN Conference on Object Oriented Programming Systems Languages and Applications, OOPSLA’09
– start-page: 207
  year: 1995
  end-page: 216
  ident: bib0025
  article-title: Cilk: an efficient multithreaded runtime system
  publication-title: Proceedings of the Fifth ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP’95
– year: 2007
  ident: bib0045
  article-title: Intel® Threading Building Blocks: Outfitting C++ for Multi-core Processor Parallelism
– volume: 14
  start-page: 189
  year: 2000
  end-page: 204
  ident: bib0070
  article-title: A portable programming interface for performance evaluation on modern processors
  publication-title: The International Journal of High Performance Computing Applications
– year: 2012
  ident: 10.1016/j.suscom.2014.07.004_bib0040
  article-title: Legion: expressing locality and independence with logical regions
– start-page: 250
  year: 2008
  ident: 10.1016/j.suscom.2014.07.004_bib0175
  article-title: Prediction models for multi-dimensional power-performance optimization on many cores
– volume: 20
  start-page: 404
  issue: March (3)
  year: 2009
  ident: 10.1016/j.suscom.2014.07.004_bib0060
  article-title: The design of OpenMP tasks
  publication-title: IEEE Transactions on Parallel and Distributed Systems
  doi: 10.1109/TPDS.2008.105
– start-page: 72
  year: 2008
  ident: 10.1016/j.suscom.2014.07.004_bib0135
  article-title: The PARSEC benchmark suite: characterization and architectural implications
– start-page: 93
  year: 2003
  ident: 10.1016/j.suscom.2014.07.004_bib0115
  article-title: Runtime power monitoring in high-end processors: methodology and empirical data
– start-page: 387
  year: 2012
  ident: 10.1016/j.suscom.2014.07.004_bib0130
  article-title: AppScope: application energy metering framework for android smartphone using kernel activity monitoring
– start-page: 289
  year: 2009
  ident: 10.1016/j.suscom.2014.07.004_bib0200
  article-title: Koala: a platform for OS-level power management
– start-page: 220
  year: 2003
  ident: 10.1016/j.suscom.2014.07.004_bib0110
  article-title: Characterizing and predicting program behavior and its variability
– start-page: 221
  year: 2005
  ident: 10.1016/j.suscom.2014.07.004_bib0170
  article-title: Power prediction for Intel XScale® processors using performance monitoring unit events.
– volume: 14
  start-page: 189
  year: 2000
  ident: 10.1016/j.suscom.2014.07.004_bib0070
  article-title: A portable programming interface for performance evaluation on modern processors
  publication-title: The International Journal of High Performance Computing Applications
  doi: 10.1177/109434200001400303
– start-page: 308
  year: 2010
  ident: 10.1016/j.suscom.2014.07.004_bib0180
  article-title: SoftPower: fine-grain power estimations using performance counters
– start-page: 65
  year: 2013
  ident: 10.1016/j.suscom.2014.07.004_bib0020
  article-title: Power containers: an OS facility for fine-grained power and energy management on multicore servers.
– year: 2007
  ident: 10.1016/j.suscom.2014.07.004_bib0045
– start-page: 5
  year: 2011
  ident: 10.1016/j.suscom.2014.07.004_bib0055
  article-title: A stream-computing extension to OpenMP.
– year: 2008
  ident: 10.1016/j.suscom.2014.07.004_bib0165
  article-title: A component infrastructure for performance and power modeling of parallel scientific applications
– volume: 32
  start-page: 20
  issue: March (2)
  year: 2012
  ident: 10.1016/j.suscom.2014.07.004_bib0080
  article-title: Power-Management Architecture of the Intel microarchitecture code-named Sandy Bridge
  publication-title: IEEE Micro
  doi: 10.1109/MM.2012.12
– start-page: 230
  year: 2006
  ident: 10.1016/j.suscom.2014.07.004_bib0160
  article-title: Minimizing execution time in MPI programs on an energy-constrained, power-scalable cluster
– start-page: 522
  year: 2009
  ident: 10.1016/j.suscom.2014.07.004_bib0030
  article-title: The Cilk++ concurrency platform
– start-page: 301
  year: 2012
  ident: 10.1016/j.suscom.2014.07.004_bib0065
  article-title: BDDT: block-level dynamic dependence analysis for deterministic task-based parallelism
– start-page: 123
  year: 2008
  ident: 10.1016/j.suscom.2014.07.004_bib0155
  article-title: System level analysis of fast, per-core DVFS using on-chip switching regulators
– ident: 10.1016/j.suscom.2014.07.004_bib0125
– start-page: 147
  year: 2010
  ident: 10.1016/j.suscom.2014.07.004_bib0090
  article-title: Decomposable and responsive power models for multi-core processors using performance counters
– year: 2009
  ident: 10.1016/j.suscom.2014.07.004_bib0015
  article-title: pTop: a process-level power profiling tool
– year: 2011
  ident: 10.1016/j.suscom.2014.07.004_bib0100
  article-title: Evaluating the effectiveness of model-based power characterization
– start-page: 121
  year: 2006
  ident: 10.1016/j.suscom.2014.07.004_bib0095
  article-title: Phase characterization for power: evaluating control-flow-based and event-counter-based techniques
– start-page: 97
  year: 2009
  ident: 10.1016/j.suscom.2014.07.004_bib0050
  article-title: A type and effect system for deterministic parallel Java
– start-page: 241
  year: 2012
  ident: 10.1016/j.suscom.2014.07.004_bib0120
  article-title: Power-Sleuth: a tool for investigating your program's power behavior
– start-page: 119
  year: 2008
  ident: 10.1016/j.suscom.2014.07.004_bib0190
  article-title: Vpm tokens: virtual machine-aware power budgeting in datacenters.
– start-page: 160
  year: 2003
  ident: 10.1016/j.suscom.2014.07.004_bib0105
  article-title: Run-time modeling and estimation of operating system power consumption
– start-page: 1
  year: 2011
  ident: 10.1016/j.suscom.2014.07.004_bib0145
  article-title: Green Governors: a framework for continuously adaptive DVFS
– start-page: 153
  year: 2011
  ident: 10.1016/j.suscom.2014.07.004_bib0205
  article-title: Fine-grained power modeling for smartphones using system call tracing
– year: 2006
  ident: 10.1016/j.suscom.2014.07.004_bib0035
  article-title: Sequoia: programming the memory hierarchy.
– volume: 37
  start-page: 46
  issue: July (2)
  year: 2009
  ident: 10.1016/j.suscom.2014.07.004_bib0185
  article-title: Real time power estimation and thread scheduling via performance counters
  publication-title: SIGARCH Comput. Archit. News
  doi: 10.1145/1577129.1577137
– ident: 10.1016/j.suscom.2014.07.004_bib0140
– ident: 10.1016/j.suscom.2014.07.004_bib0075
– volume: 21
  start-page: 658
  issue: May (5)
  year: 2010
  ident: 10.1016/j.suscom.2014.07.004_bib0010
  article-title: PowerPack: energy profiling and analysis of high-performance systems and applications
  publication-title: IEEE Transactions on Parallel and Distributed Systems
  doi: 10.1109/TPDS.2009.76
– volume: 9
  start-page: 28
  issue: June (2)
  year: 2012
  ident: 10.1016/j.suscom.2014.07.004_bib0085
  article-title: EPC: a power instrumentation controller for embedded applications
  publication-title: SIGBED Review
  doi: 10.1145/2318836.2318841
– start-page: 403
  year: 2006
  ident: 10.1016/j.suscom.2014.07.004_bib0195
  article-title: Balancing power consumption in multiprocessor systems
– start-page: 207
  year: 1995
  ident: 10.1016/j.suscom.2014.07.004_bib0025
  article-title: Cilk: an efficient multithreaded runtime system
– start-page: 29
  year: 2012
  ident: 10.1016/j.suscom.2014.07.004_bib0005
  article-title: Where is the energy spent inside my app? Fine grained energy accounting on smartphones with Eprof
– volume: 25
  start-page: 39
  issue: September (5)
  year: 2005
  ident: 10.1016/j.suscom.2014.07.004_bib0150
  article-title: Long-term workload phases: duration predictions and applications to DVFS
  publication-title: Micro IEEE
  doi: 10.1109/MM.2005.93
SSID ssj0000561934
Score 2.006632
Snippet We present TProf, an energy profiling tool for OpenMP-like task-parallel programs. To compute the energy consumed by each task in a parallel application, TProf...
SourceID crossref
elsevier
SourceType Enrichment Source
Index Database
Publisher
StartPage 1
SubjectTerms Energy profiling
Parallel programming models
Parallel runtime systems
Task parallelism
Title TProf: An energy profiler for task-parallel programs
URI https://dx.doi.org/10.1016/j.suscom.2014.07.004
Volume 5
WOSCitedRecordID wos000368600800001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVESC
  databaseName: Elsevier SD Freedom Collection Journals 2021
  issn: 2210-5379
  databaseCode: AIEXJ
  dateStart: 20110301
  customDbUrl:
  isFulltext: true
  dateEnd: 99991231
  titleUrlDefault: https://www.sciencedirect.com
  omitProxy: false
  ssIdentifier: ssj0000561934
  providerName: Elsevier
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lj5swELbSbA-99F3t9iUOvVVUgA3YvUXtrtqqiiJlW0W9IGOMxIIgCtlo97q_vOMXYbtVX1IvCBGMg-fTeGaYbwahV3GRB2EBEkiSMvKJCKifB5z54A3JoIx5yPXngq-f0_mcrlZsMZlcOS7Mrknbll5csPV_FTVcA2Er6uxfiHt4KFyAcxA6HEHscPwzwS9Atdp4nzTMPtuYe2NyCnlf-6rid9PIxuVn9WMjdTkiVQnd9cEQX6x9a8s696Na5zqkrYq9KntUq51O9UIafvzG65prvXvSVbuuf718M-hkBcHajVt0zeWu6oaB86oG7bzuzhuTDvhe8bE21f4JNmARxvuMLRNFc0yaa4meEbiefoxNYxmnmeORZg1HW7Rhr95Q_iYOcQa7Tq8ygWBuoguzmv7GP5TVXqoZ1YTgYSqCcnALHURpzOgUHcw-Hq8-DZE65WMxnZ4w_EnHwdSJgjen-7mNM7JbTu-ju9bh8GYGKA_QRLYP0T3XzMOzuv0RIho3b71Z6xnUeA41Hsjdu4Yaz6HmMfpycnz67oNvW2r4AnaSrZ8wKkqZCKWnicASU84ILTAVOMQhSXEEr1eQKJKElLzkcZmIMi8kBTec5AXFT9C07Vp5iLyQSJ4UAc5pmRKZMx4lkeopzBIG93N6hLBbhEzYevOq7UmTucTCs8wsXaaWLgtUIgQ5Qv4wam3qrfzm_tStb2ZtRmMLZoCKX458-s8jn6E7e1w_R9Pt5ly-QLfFblv1m5cWPt8BUVeVeQ
linkProvider Elsevier
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=TProf%3A+An+energy+profiler+for+task-parallel+programs&rft.jtitle=Sustainable+computing+informatics+and+systems&rft.au=Manousakis%2C+Ioannis&rft.au=Zakkak%2C+Foivos+S.&rft.au=Pratikakis%2C+Polyvios&rft.au=Nikolopoulos%2C+Dimitrios+S.&rft.date=2015-03-01&rft.pub=Elsevier+Inc&rft.issn=2210-5379&rft.volume=5&rft.spage=1&rft.epage=13&rft_id=info:doi/10.1016%2Fj.suscom.2014.07.004&rft.externalDocID=S2210537914000390
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2210-5379&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2210-5379&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2210-5379&client=summon