Probabilistic Timing Estimates in Scenarios Under Testing Constraints

Saved in:
Bibliographic Details
Title: Probabilistic Timing Estimates in Scenarios Under Testing Constraints
Authors: Sergi Vilardell, Francesco Rossi, Gabriele Giordana, Isabel Serra, Enrico Mezzetti, Jaume Abella, Francisco J. Cazorla
Source: UPCommons. Portal del coneixement obert de la UPC
Universitat Politècnica de Catalunya (UPC)
Publisher Information: ACM, 2025.
Publication Year: 2025
Subject Terms: Probability and statistics, Computer systems organization, Àrees temàtiques de la UPC::Informàtica::Enginyeria del software, Embedded systems, Validation, Verification, General and reference, Reliability, Mathematics of computing
Description: Measurement-based probabilistic (MBP) methods like Extreme Value Theory (EVT) and the Markov's Inequality have been exploited to derive probabilistic Worst-Case Execution Time (pWCET) estimates. Usually, the reliability and accuracy of pWCET techniques have been evaluated on medium to large sample sizes, N = [103, 105]. However, several works increasingly advocate for containing the cost of carrying out the test campaign by reducing the number of executions (i.e. the sample size) required by pWCET analysis. Specific scenarios, for example, impose inherent limitations on the collection of timing measurements due to cost and availability of appropriate testing facilities. In this work, we analyze the impact of small sample sizes on MBP. Our analysis shows that classical EVT models for tail estimation require a threshold that estimates where the tail of the distribution begins. In low sample scenarios, the uncertainty in determining this threshold can compromise the reliability of EVT estimates. We also assess the impact of small samples on RESTK, a time forecast method based on Markov's Inequality. Our results with synthetic data and representative kernels show that RESTK provides the best trade-off in terms of trustworthiness and tightness for small samples, partly due to not relying on the selection of any threshold, as opposed to EVT.
The research leading to these results has received funding from the European Union’s Horizon Europe Programme under the SAF- EXPLAIN Project (www.safeexplain.eu), grant agreement num. 101069595. Authors also appreciate the support given to the Re- search Group SSAS (Code: 2021 SGR 00637) by the Research and University Department of the Generalitat de Catalunya.
Document Type: Article
Conference object
File Description: application/pdf
DOI: 10.1145/3672608.3707895
Rights: CC BY
URL: https://www.acm.org/publications/policies/copyright_policy#Background
Accession Number: edsair.doi.dedup.....44c9c828cd678d071db719d0ca50f09e
Database: OpenAIRE
Description
Abstract:Measurement-based probabilistic (MBP) methods like Extreme Value Theory (EVT) and the Markov's Inequality have been exploited to derive probabilistic Worst-Case Execution Time (pWCET) estimates. Usually, the reliability and accuracy of pWCET techniques have been evaluated on medium to large sample sizes, N = [103, 105]. However, several works increasingly advocate for containing the cost of carrying out the test campaign by reducing the number of executions (i.e. the sample size) required by pWCET analysis. Specific scenarios, for example, impose inherent limitations on the collection of timing measurements due to cost and availability of appropriate testing facilities. In this work, we analyze the impact of small sample sizes on MBP. Our analysis shows that classical EVT models for tail estimation require a threshold that estimates where the tail of the distribution begins. In low sample scenarios, the uncertainty in determining this threshold can compromise the reliability of EVT estimates. We also assess the impact of small samples on RESTK, a time forecast method based on Markov's Inequality. Our results with synthetic data and representative kernels show that RESTK provides the best trade-off in terms of trustworthiness and tightness for small samples, partly due to not relying on the selection of any threshold, as opposed to EVT.<br />The research leading to these results has received funding from the European Union’s Horizon Europe Programme under the SAF- EXPLAIN Project (www.safeexplain.eu), grant agreement num. 101069595. Authors also appreciate the support given to the Re- search Group SSAS (Code: 2021 SGR 00637) by the Research and University Department of the Generalitat de Catalunya.
DOI:10.1145/3672608.3707895