Suchergebnisse - General AND reference Cross-computing tools AND techniques*
-
1
OptiWISE: Combining Sampling and Instrumentation for Granular CPI Analysis
ISSN: 2643-2838Veröffentlicht: IEEE 02.03.2024Veröffentlicht in Proceedings / International Symposium on Code Generation and Optimization (02.03.2024)“… Existing profiling tools typically either sample hardware performance counters or instrument the program with extra instructions to analyze its execution …”
Volltext
Tagungsbericht -
2
MLPerf Inference Benchmark
Veröffentlicht: IEEE 01.05.2020Veröffentlicht in 2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture (ISCA) (01.05.2020)“… Machine-learning (ML) hardware and software system demand is burgeoning. Driven by ML applications, the number of different ML inference systems has exploded …”
Volltext
Tagungsbericht -
3
GPA: A GPU Performance Advisor Based on Instruction Sampling
Veröffentlicht: IEEE 27.02.2021Veröffentlicht in 2021 IEEE/ACM International Symposium on Code Generation and Optimization (CGO) (27.02.2021)“… Existing performance tools only provide coarse-grained tuning advice at the kernel level …”
Volltext
Tagungsbericht -
4
Revamping Sampling-Based PGO with Context-Sensitivity and Pseudo-instrumentation
ISSN: 2643-2838Veröffentlicht: IEEE 02.03.2024Veröffentlicht in Proceedings / International Symposium on Code Generation and Optimization (02.03.2024)“… The ever increasing scale of modern data center demands more effective optimizations, as even a small percentage of performance improvement can result in a …”
Volltext
Tagungsbericht -
5
Performance Analysis with Bayesian Inference
ISBN: 9798350300406, 9798350300390ISSN: 2832-7632Veröffentlicht: IEEE 01.05.2023Veröffentlicht in 2023 IEEE/ACM 45th International Conference on Software Engineering: New Ideas and Emerging Results (ICSE-NIER) (01.05.2023)“… However, for non-statisticians, picking the right statistical tool to answer a research question can be challenging …”
Volltext
Tagungsbericht Buchkapitel -
6
Performance Characterization of Popular DNN Models on Out-of-Order CPUs
Veröffentlicht: IEEE 21.10.2023Veröffentlicht in 2023 32nd International Conference on Parallel Architectures and Compilation Techniques (PACT) (21.10.2023)“… However, the ubiquity of DNN models is rapidly extending the presence of this software to general-purpose CPUs …”
Volltext
Tagungsbericht -
7
White-Box Performance-Influence Models: A Profiling and Learning Approach (Replication Package)
ISBN: 1665412194, 9781665412193Veröffentlicht: IEEE 01.05.2021Veröffentlicht in 2021 IEEE/ACM 43rd International Conference on Software Engineering: Companion Proceedings (ICSE-Companion) (01.05.2021)“… Specifically, we describe the general steps and tools that we have used to implement our approach, the data we have obtained, and the evaluation setup …”
Volltext
Tagungsbericht -
8
The Architecture value engine: Measuring and delivering sustainable SoC improvement
ISSN: 1558-2434Veröffentlicht: ACM 01.11.2016Veröffentlicht in Digest of technical papers - IEEE/ACM International Conference on Computer-Aided Design (01.11.2016)“… The value of semiconductor-based systems continues to increase rapidly especially when considering the cost associated with building it. As such, Moore's Law …”
Volltext
Tagungsbericht -
9
Bubble-up: Increasing utilization in modern warehouse scale computers via sensible co-locations
Veröffentlicht: ACM 01.12.2011Veröffentlicht in MICRO 44 : Proceedings of the 44th Annual IEEE/ACM Symposium on Microarchitecture, December 4 - 7, 2011 Porto Alegre, RS - Brazil (01.12.2011)“… As much of the world's computing continues to move into the cloud, the overprovisioning of computing resources to ensure the performance isolation of …”
Volltext
Tagungsbericht -
10
Top-Down Microarchitecture Analysis Approximation Based on Performance Counter Architecture for SiFive RISC-V Processors
Veröffentlicht: IEEE 17.11.2024Veröffentlicht in SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)“… Modern Out-of-Order RISC-V CPUs have complex mechanisms, making microarchitecture-level performance analysis challenging. Despite increasing Performance …”
Volltext
Tagungsbericht -
11
Multi-level Memory-Centric Profiling on ARM Processors with ARM SPE
Veröffentlicht: IEEE 17.11.2024Veröffentlicht in SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)“… Many existing memory profiling tools leverage hardware performance counters and precise event sampling, such as Intel PEBS and AMD IBS, to achieve high accuracy and low overhead …”
Volltext
Tagungsbericht -
12
Probing for Requirements Knowledge to Stimulate Architectural Thinking
ISSN: 1558-1225Veröffentlicht: ACM 01.05.2016Veröffentlicht in Proceedings / International Conference on Software Engineering (01.05.2016)“… Software requirements specifications (SRSs) often lack the detail needed to make informed architectural decisions. Architects therefore either make …”
Volltext
Tagungsbericht -
13
Ponte Vecchio Across the Atlantic: Single-Node Benchmarking of Two Intel GPU Systems
Veröffentlicht: IEEE 17.11.2024Veröffentlicht in SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)“… Intel Data Center GPU Max 1550, known as Ponte Vecchio (PVC), is a new Intel GPU architecture for high-performance computing. It is the basis of two systems on …”
Volltext
Tagungsbericht -
14
SLEMI: Equivalence Modulo Input (EMI) Based Mutation of CPS Models for Finding Compiler Bugs in Simulink
ISSN: 1558-1225Veröffentlicht: ACM 01.10.2020Veröffentlicht in 2020 IEEE/ACM 42nd International Conference on Software Engineering (ICSE) (01.10.2020)“… Finding bugs in commercial cyber-physical system development tools (or "model-based design" tools …”
Volltext
Tagungsbericht -
15
Exploiting Input Sanitization for Regex Denial of Service
ISSN: 1558-1225Veröffentlicht: ACM 01.05.2022Veröffentlicht in 2022 IEEE/ACM 44th International Conference on Software Engineering (ICSE) (01.05.2022)“… Web services use server-side input sanitization to guard against harmful input. Some web services publish their sanitization logic to make their client …”
Volltext
Tagungsbericht -
16
Student research poster: A low complexity cache sharing mechanism to address system fairness
Veröffentlicht: ACM 01.09.2016Veröffentlicht in 2016 International Conference on Parallel Architecture and Compilation Techniques (PACT) (01.09.2016)“… Shared caches have become, de facto, the common design choice in current multi-cores, ranging from embedded devices to high-performance processors. In these …”
Volltext
Tagungsbericht -
17
White-Box Performance-Influence Models: A Profiling and Learning Approach
ISBN: 1665402962, 9781665402965ISSN: 1558-1225Veröffentlicht: IEEE 01.05.2021Veröffentlicht in Proceedings / International Conference on Software Engineering (01.05.2021)“… Many modern software systems are highly configurable, allowing the user to tune them for performance and more. Current performance modeling approaches aim at …”
Volltext
Tagungsbericht -
18
Analyzing and Improving Resilience and Robustness of Autonomous Systems (Invited Paper)
ISSN: 1558-2434Veröffentlicht: ACM 29.10.2022Veröffentlicht in 2022 IEEE/ACM International Conference On Computer Aided Design (ICCAD) (29.10.2022)“… Autonomous systems have reached a tipping point, with a myriad of self-driving cars, unmanned aerial vehicles (UAVs), and robots being widely applied and …”
Volltext
Tagungsbericht -
19
MemSeer: Leverage Memory Failure Distinctions and Multi-Grained Prediction in Ultra-Scale Heterogeneous X86/ARM Clusters
Veröffentlicht: IEEE 22.06.2025Veröffentlicht in 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)“… We introduce MemSeer, an AIOps-integrated tool that utilizes a multi-grained memory failure prediction approach for x86/ARM heterogeneous clusters …”
Volltext
Tagungsbericht -
20
LLM-Pilot: Characterize and Optimize Performance of your LLM Inference Services
Veröffentlicht: IEEE 17.11.2024Veröffentlicht in SC24: International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)“… As Large Language Models (LLMs) are rapidly growing in popularity, LLM inference services must be able to serve requests from thousands of users while …”
Volltext
Tagungsbericht

