Search Results - "Process Management- Multiprocessing/Multiprogramming"
-
1
nvshare: Practical GPU Sharing Without Memory Size Constraints
ISSN: 2574-1934Published: ACM 14.04.2024Published in Proceedings (IEEE/ACM International Conference on Software Engineering Companion. Online) (14.04.2024)“…GPUs are essential for accelerating Machine Learning (ML) work-loads. A common practice is deploying ML jobs as containers managed by an orchestrator such as…”
Get full text
Conference Proceeding -
2
VISTA: Optimizing GPU Scheduling through Versatile Locality-Aware Data Sharing
Published: IEEE 22.06.2025Published in 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)“…Graphics Processing Units (GPUs) play a pivotal role in high-performance computing by utilizing the massive parallelism of concurrent thread execution to…”
Get full text
Conference Proceeding -
3
A Universal Method for Task Allocation on FP-FPS Multiprocessor Systems with Spin Locks
Published: IEEE 09.07.2023Published in 2023 60th ACM/IEEE Design Automation Conference (DAC) (09.07.2023)“…Many complex real-time systems, such as increasingly automated vehicles and 5G wireless base stations, contain a large amount of shared resources that must be…”
Get full text
Conference Proceeding -
4
SMiTe: Precise QoS Prediction on Real-System SMT Processors to Improve Utilization in Warehouse Scale Computers
ISSN: 1072-4451Published: IEEE 01.12.2014Published in 2014 47th Annual IEEE/ACM International Symposium on Microarchitecture (01.12.2014)“…One of the key challenges for improving efficiency in warehouse scale computers (WSCs) is to improve server utilization while guaranteeing the quality of…”
Get full text
Conference Proceeding -
5
Analysis and approximation of optimal co-scheduling on Chip Multiprocessors
Published: ACM 25.10.2008Published in PACT'08 : proceedings of the Seventeenth International Conference on Parallel Architectures and Compilation Techniques : Toronto, Ontario, Canada, October 25-29, 2008 (25.10.2008)“…Cache sharing among processors is important for Chip Multiprocessors to reduce inter-thread latency, but also brings cache contention, degrading program…”
Get full text
Conference Proceeding -
6
Automatic Thread Extraction with Decoupled Software Pipelining
ISBN: 9780769524405, 0769524400ISSN: 1072-4451Published: Washington, DC, USA IEEE Computer Society 12.11.2005Published in 38th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO'05) (12.11.2005)“…Until recently, a steadily rising clock rate and other uniprocessor microarchitectural improvements could be relied upon to consistently deliver increasing…”
Get full text
Conference Proceeding -
7
Rinnegan: Efficient resource use in heterogeneous architectures
Published: ACM 01.09.2016Published in 2016 International Conference on Parallel Architecture and Compilation Techniques (PACT) (01.09.2016)“…Current processors provide a variety of different processing units to improve performance and power efficiency. For example, ARM's big.LITTLE, AMD's APUs, and…”
Get full text
Conference Proceeding -
8
Reducing cache coherence traffic with hierarchical directory cache and NUMA-aware runtime scheduling
Published: ACM 01.09.2016Published in 2016 International Conference on Parallel Architecture and Compilation Techniques (PACT) (01.09.2016)“…Cache Coherent NUMA (ccNUMA) architectures are a widespread paradigm due to the benefits they provide for scaling core count and memory capacity. Also, the…”
Get full text
Conference Proceeding -
9
Scalability-based manycore partitioning
Published: ACM 19.09.2012Published in PACT'12 : proceedings of the 21st International Conference on Parallel Architectures and Compilation Techniques, September 19-23, Minneapolis, Minnesota, USA (19.09.2012)“…Multicore processors have been popular for years, and the industry is gradually shifting towards the era of manycore processors. Single-thread performance of…”
Get full text
Conference Proceeding -
10
Online scalability characterization of data-parallel programs on many cores
Published: ACM 01.09.2016Published in 2016 International Conference on Parallel Architecture and Compilation Techniques (PACT) (01.09.2016)“…We present an accurate online scalability prediction model for data-parallel programs on NUMA many-core systems. Memory contention is considered to be the…”
Get full text
Conference Proceeding -
11
Energy efficient task scheduling on a multi-core platform using real-time energy measurements
Published: ACM 01.08.2014Published in 2014 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED) (01.08.2014)“…This paper presents a large advance in energy-efficient operating system multiprocessor task scheduling with experimentally proven benefits for standard Linux…”
Get full text
Conference Proceeding -
12
A Modular Simulation Framework for Spatial and Temporal Task Mapping onto Multi-Processor SoC Platforms
ISBN: 9780769522883, 0769522882ISSN: 1530-1591Published: Washington, DC, USA IEEE Computer Society 07.03.2005Published in Design, Automation and Test in Europe (07.03.2005)“…Heterogeneous Multi-Processor SoC platforms bear the potential to optimize conflicting performance, flexibility and energy efficiency constraints as imposed by…”
Get full text
Conference Proceeding -
13
Fairness and Throughput in Switch on Event Multithreading
ISBN: 0769527329, 9780769527321ISSN: 1072-4451Published: Washington, DC, USA IEEE Computer Society 09.12.2006Published in 2006 39th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO'06) (09.12.2006)“…The need to reduce power and complexity will increase the interest in Switch on Event multithreading (coarse grained multithreading). Switch On Event…”
Get full text
Conference Proceeding -
14
Spread-n-Share: Improving Application Performance and Cluster Throughput with Resource-aware Job Placement
ISSN: 2167-4337Published: ACM 17.11.2019Published in SC19: International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2019)“…Traditional batch job schedulers adopt the Compact-n-Exclusive (CE) strategy, packing processes of a parallel job into as few com-pute nodes as possible. While…”
Get full text
Conference Proceeding -
15
Multitasking workload scheduling on flexible-core chip multiprocessors
Published: ACM 01.10.2008Published in PACT'08 : proceedings of the Seventeenth International Conference on Parallel Architectures and Compilation Techniques : Toronto, Ontario, Canada, October 25-29, 2008 (01.10.2008)“…While technology trends have ushered in the age of chip multiprocessors (CMP), a fundamental question is what size to make each core. Most current commercial…”
Get full text
Conference Proceeding -
16
Continuous Path and Edge Profiling
ISBN: 9780769524405, 0769524400ISSN: 1072-4451Published: Washington, DC, USA IEEE Computer Society 12.11.2005Published in 38th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO'05) (12.11.2005)“…Microarchitectures increasingly rely on dynamic optimization to improve performance in ways that are dif- ficult or impossible for ahead-of-time compilers…”
Get full text
Conference Proceeding -
17
Pinot: Speculative Multi-threading Processor Architecture Exploiting Parallelism over a Wide Range of Granularities
ISBN: 9780769524405, 0769524400ISSN: 1072-4451Published: Washington, DC, USA IEEE Computer Society 01.01.2005Published in 38th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO'05) (01.01.2005)“…We propose a speculative multi-threading processor architecture called Pinot. Pinot exploits parallelism over a wide range of granularities without modifying…”
Get full text
Conference Proceeding -
18
RTOS Modeling for System Level Design
ISBN: 0769518702, 9780769518701ISSN: 1530-1591Published: Washington, DC, USA IEEE Computer Society 03.03.2003Published in Design, Automation, and Test in Europe: Proceedings of the conference on Design, Automation and Test in Europe - Volume 1; 03-07 Mar. 2003 (03.03.2003)“…System level synthesis is widely seen as the solution for closing the productivity gap in system design. High level system models are used in system level…”
Get full text
Conference Proceeding -
19
Low Cost Task Migration Initiation in a Heterogeneous MP-SoC
ISBN: 9780769522883, 0769522882ISSN: 1530-1591Published: Washington, DC, USA IEEE Computer Society 07.03.2005Published in Design, Automation and Test in Europe (07.03.2005)“…Run-time task migration in a heterogeneous multiprocessor System-on-Chip (MP-SoC) is a challenge that requires cooperation between the task and the operating…”
Get full text
Conference Proceeding -
20
Scheduling Hard-Real-Time Tasks with Backup Phasing Delay
ISBN: 9780769526973, 0769526977ISSN: 1550-6525Published: Washington, DC, USA IEEE Computer Society 02.10.2006Published in 2006 Tenth IEEE International Symposium on Distributed Simulation and Real-Time Applications (02.10.2006)“…This paper presents several fault-tolerant extensions of the Rate-Monotonic First-Fit multiprocessor scheduling algorithm handling both active and passive task…”
Get full text
Conference Proceeding

