Suchergebnisse - General AND reference Cross-computing tools AND techniques*

  1. 1

    OptiWISE: Combining Sampling and Instrumentation for Granular CPI Analysis von Guo, Yuxin, Chadwick, Alex W., Erdos, Marton, Bora, Utpal, Vougioukas, Ilias, Gabrielli, Giacomo, Jones, Timothy M.

    ISSN: 2643-2838
    Veröffentlicht: IEEE 02.03.2024
    “… Existing profiling tools typically either sample hardware performance counters or instrument the program with extra instructions to analyze its execution …”
    Volltext
    Tagungsbericht
  2. 2
  3. 3

    GPA: A GPU Performance Advisor Based on Instruction Sampling von Zhou, Keren, Meng, Xiaozhu, Sai, Ryuichi, Mellor-Crummey, John

    Veröffentlicht: IEEE 27.02.2021
    “… Existing performance tools only provide coarse-grained tuning advice at the kernel level …”
    Volltext
    Tagungsbericht
  4. 4

    Revamping Sampling-Based PGO with Context-Sensitivity and Pseudo-instrumentation von He, Wenlei, Yu, Hongtao, Wang, Lei, Oh, Taewook

    ISSN: 2643-2838
    Veröffentlicht: IEEE 02.03.2024
    “… The ever increasing scale of modern data center demands more effective optimizations, as even a small percentage of performance improvement can result in a …”
    Volltext
    Tagungsbericht
  5. 5

    Performance Analysis with Bayesian Inference von Couderc, Noric, Reichenbach, Christoph, Soderberg, Emma

    ISBN: 9798350300406, 9798350300390
    ISSN: 2832-7632
    Veröffentlicht: IEEE 01.05.2023
    “… However, for non-statisticians, picking the right statistical tool to answer a research question can be challenging …”
    Volltext
    Tagungsbericht Buchkapitel
  6. 6

    Performance Characterization of Popular DNN Models on Out-of-Order CPUs von Prieto, Pablo, Abad, Pablo, Gregorio, Jose Angel, Puente, Valentin

    Veröffentlicht: IEEE 21.10.2023
    “… However, the ubiquity of DNN models is rapidly extending the presence of this software to general-purpose CPUs …”
    Volltext
    Tagungsbericht
  7. 7

    White-Box Performance-Influence Models: A Profiling and Learning Approach (Replication Package) von Weber, Max, Apel, Sven, Siegmund, Norbert

    ISBN: 1665412194, 9781665412193
    Veröffentlicht: IEEE 01.05.2021
    “… Specifically, we describe the general steps and tools that we have used to implement our approach, the data we have obtained, and the evaluation setup …”
    Volltext
    Tagungsbericht
  8. 8

    The Architecture value engine: Measuring and delivering sustainable SoC improvement von Carballo, Juan-Antonio, Bangqi Xu

    ISSN: 1558-2434
    Veröffentlicht: ACM 01.11.2016
    “… The value of semiconductor-based systems continues to increase rapidly especially when considering the cost associated with building it. As such, Moore's Law …”
    Volltext
    Tagungsbericht
  9. 9

    Bubble-up: Increasing utilization in modern warehouse scale computers via sensible co-locations von Mars, Jason, Lingjia Tang, Hundt, Robert, Skadron, Kevin, Soffa, Mary Lou

    Veröffentlicht: ACM 01.12.2011
    “… As much of the world's computing continues to move into the cloud, the overprovisioning of computing resources to ensure the performance isolation of …”
    Volltext
    Tagungsbericht
  10. 10

    Top-Down Microarchitecture Analysis Approximation Based on Performance Counter Architecture for SiFive RISC-V Processors von Mou, Chan-Yu, Hsiao, Chao-Chieh, Chou, Jerry

    Veröffentlicht: IEEE 17.11.2024
    “… Modern Out-of-Order RISC-V CPUs have complex mechanisms, making microarchitecture-level performance analysis challenging. Despite increasing Performance …”
    Volltext
    Tagungsbericht
  11. 11

    Multi-level Memory-Centric Profiling on ARM Processors with ARM SPE von Miksits, Samuel, Shi, Ruimin, Gokhale, Maya, Wahlgren, Jacob, Schieffer, Gabin, Peng, Ivy

    Veröffentlicht: IEEE 17.11.2024
    “… Many existing memory profiling tools leverage hardware performance counters and precise event sampling, such as Intel PEBS and AMD IBS, to achieve high accuracy and low overhead …”
    Volltext
    Tagungsbericht
  12. 12

    Probing for Requirements Knowledge to Stimulate Architectural Thinking von Anish, Preethu Rose, Balasubramaniam, Balaji, Sainani, Abhishek, Cleland-Huang, Jane, Daneva, Maya, Wieringa, Roel J., Ghaisas, Smita

    ISSN: 1558-1225
    Veröffentlicht: ACM 01.05.2016
    “… Software requirements specifications (SRSs) often lack the detail needed to make informed architectural decisions. Architects therefore either make …”
    Volltext
    Tagungsbericht
  13. 13

    Ponte Vecchio Across the Atlantic: Single-Node Benchmarking of Two Intel GPU Systems von Applencourt, Thomas, Sadawarte, Aditya, Muralidharan, Servesh, Bertoni, Colleen, Kwack, JaeHyuk, Luo, Ye, Rangel, Esteban, Tramm, John, Ghadar, Yasaman, Tamerus, Arjen, Edsall, Chris, Deakin, Tom

    Veröffentlicht: IEEE 17.11.2024
    “… Intel Data Center GPU Max 1550, known as Ponte Vecchio (PVC), is a new Intel GPU architecture for high-performance computing. It is the basis of two systems on …”
    Volltext
    Tagungsbericht
  14. 14

    SLEMI: Equivalence Modulo Input (EMI) Based Mutation of CPS Models for Finding Compiler Bugs in Simulink von Chowdhury, Shafiul Azam, Shrestha, Sohil Lal, Johnson, Taylor T., Csallner, Christoph

    ISSN: 1558-1225
    Veröffentlicht: ACM 01.10.2020
    “… Finding bugs in commercial cyber-physical system development tools (or "model-based design" tools …”
    Volltext
    Tagungsbericht
  15. 15

    Exploiting Input Sanitization for Regex Denial of Service von Barlas, Efe, Du, Xin, Davis, James C.

    ISSN: 1558-1225
    Veröffentlicht: ACM 01.05.2022
    “… Web services use server-side input sanitization to guard against harmful input. Some web services publish their sanitization logic to make their client …”
    Volltext
    Tagungsbericht
  16. 16

    Student research poster: A low complexity cache sharing mechanism to address system fairness von Selfa, Vicent, Sahuquillo, Julio, Petit, Salvador, Gomez, Maria E.

    Veröffentlicht: ACM 01.09.2016
    “… Shared caches have become, de facto, the common design choice in current multi-cores, ranging from embedded devices to high-performance processors. In these …”
    Volltext
    Tagungsbericht
  17. 17

    White-Box Performance-Influence Models: A Profiling and Learning Approach von Weber, Max, Apel, Sven, Siegmund, Norbert

    ISBN: 1665402962, 9781665402965
    ISSN: 1558-1225
    Veröffentlicht: IEEE 01.05.2021
    “… Many modern software systems are highly configurable, allowing the user to tune them for performance and more. Current performance modeling approaches aim at …”
    Volltext
    Tagungsbericht
  18. 18

    Analyzing and Improving Resilience and Robustness of Autonomous Systems (Invited Paper) von Wan, Zishen, Swaminathan, Karthik, Chen, Pin-Yu, Chandramoorthy, Nandhini, Raychowdhury, Arijit

    ISSN: 1558-2434
    Veröffentlicht: ACM 29.10.2022
    “… Autonomous systems have reached a tipping point, with a myriad of self-driving cars, unmanned aerial vehicles (UAVs), and robots being widely applied and …”
    Volltext
    Tagungsbericht
  19. 19

    MemSeer: Leverage Memory Failure Distinctions and Multi-Grained Prediction in Ultra-Scale Heterogeneous X86/ARM Clusters von Gu, Yunfei, Liu, Yixuan, Wu, Xinyuan, Shao, Bo, Wu, Chentao, Li, Shiyi, Zhao, Jieru, Li, Jie, Guo, Minyi, Yang, Kunlin, Zhang, Wengui, Lin, Feilong

    Veröffentlicht: IEEE 22.06.2025
    “… We introduce MemSeer, an AIOps-integrated tool that utilizes a multi-grained memory failure prediction approach for x86/ARM heterogeneous clusters …”
    Volltext
    Tagungsbericht
  20. 20

    LLM-Pilot: Characterize and Optimize Performance of your LLM Inference Services von Lazuka, Malgorzata, Anghel, Andreea, Parnell, Thomas

    Veröffentlicht: IEEE 17.11.2024
    “… As Large Language Models (LLMs) are rapidly growing in popularity, LLM inference services must be able to serve requests from thousands of users while …”
    Volltext
    Tagungsbericht