TY - GEN
T1 - Capturing provenance as a diagnostic tool for workflow performance evaluation and optimization
AU - Pouchard, Line
AU - Malik, Abid
AU - Van Dam, Huub
AU - Xie, Cong
AU - Xu, Wei
AU - Van Dam, Kerstin Kleese
N1 - Publisher Copyright: © 2017 IEEE.
PY - 2017/10/25
Y1 - 2017/10/25
N2 - In extreme-scale computing environments such as the DOE Leadership Computing Facilities scientific workflows are routinely used to coordinate software processes for the execution of complex, computational applications that perform in-silico experiments. Monitoring the performance of workflows without also simultaneously tracking provenance is not sufficient to understand variations between runs, configurations, versions of a code, and between changes in an implemented stack, and systems, i.e. the variability of performance metrics data in their historical context. We take a provenance-based approach and demonstrate that provenance is useful as a tool for evaluating and optimizing workflow performance in extreme- scale HPC environments. We present Chimbuko, a framework for the analysis and visualization of the provenance of performance. Chimbuko implements a method for the evaluation of workflow performance from multiple components that enables the exploration of performance metrics data at scale.
AB - In extreme-scale computing environments such as the DOE Leadership Computing Facilities scientific workflows are routinely used to coordinate software processes for the execution of complex, computational applications that perform in-silico experiments. Monitoring the performance of workflows without also simultaneously tracking provenance is not sufficient to understand variations between runs, configurations, versions of a code, and between changes in an implemented stack, and systems, i.e. the variability of performance metrics data in their historical context. We take a provenance-based approach and demonstrate that provenance is useful as a tool for evaluating and optimizing workflow performance in extreme- scale HPC environments. We present Chimbuko, a framework for the analysis and visualization of the provenance of performance. Chimbuko implements a method for the evaluation of workflow performance from multiple components that enables the exploration of performance metrics data at scale.
KW - Chimbuko
KW - WFPP
KW - performance
KW - provenance
KW - scientific workflows
KW - workflow performance provenance ontology
UR - https://www.scopus.com/pages/publications/85040169514
U2 - 10.1109/NYSDS.2017.8085043
DO - 10.1109/NYSDS.2017.8085043
M3 - Conference contribution
T3 - 2017 New York Scientific Data Summit, NYSDS 2017 - Proceedings
BT - 2017 New York Scientific Data Summit, NYSDS 2017 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2017 New York Scientific Data Summit, NYSDS 2017
Y2 - 6 August 2017 through 9 August 2017
ER -