Search Results - "Theory of computation Models of computation Concurrency Parallel computing models"
-
1
On Thin Air Reads Towards an Event Structures Model of Relaxed Memory
ISBN: 9781450343916, 1450343910Published: New York, NY, USA ACM 05.07.2016Published in Proceedings of the 31st Annual ACM/IEEE Symposium on Logic in Computer Science (05.07.2016)“…This is the first paper to propose a pure event structures model of relaxed memory. We propose confusion-free event structures over an alphabet with a…”
Get full text
Conference Proceeding -
2
Managing GPU Concurrency in Heterogeneous Architectures
ISSN: 1072-4451Published: IEEE 01.12.2014Published in 2014 47th Annual IEEE/ACM International Symposium on Microarchitecture (01.12.2014)“…Heterogeneous architectures consisting of general-purpose CPUs and throughput-optimized GPUs are projected to be the dominant computing platforms for many…”
Get full text
Conference Proceeding -
3
In search of lost time: Axiomatising parallel composition in process algebras
Published: IEEE 29.06.2021Published in Proceedings of the 36th Annual ACM/IEEE Symposium on Logic in Computer Science (29.06.2021)“…This survey reviews some of the most recent achievements in the saga of the axiomatisation of parallel composition, along with some classic results. We focus…”
Get full text
Conference Proceeding -
4
Optimizing indirect memory references with milk
Published: ACM 01.09.2016Published in 2016 International Conference on Parallel Architecture and Compilation Techniques (PACT) (01.09.2016)“…Modern applications such as graph and data analytics, when operating on real world data, have working sets much larger than cache capacity and are bottlenecked…”
Get full text
Conference Proceeding -
5
Reverse-Mode Automatic Differentiation and Optimization of GPU Kernels via Enzyme
ISSN: 2167-4337Published: ACM 14.11.2021Published in SC21: International Conference for High Performance Computing, Networking, Storage and Analysis (14.11.2021)“…Computing derivatives is key to many algorithms in scientific computing and machine learning such as optimization, uncertainty quantification, and stability…”
Get full text
Conference Proceeding -
6
Virtualizing Transactional Memory
ISBN: 076952270X, 9780769522708ISSN: 1063-6897Published: Washington, DC, USA IEEE Computer Society 01.05.2005Published in 32nd International Symposium on Computer Architecture (ISCA'05) (01.05.2005)“…Writing concurrent programs is difficult because of the complexity of ensuring proper synchronization. Conventional lock-based synchronization suffers from…”
Get full text
Conference Proceeding -
7
Flexible Decoupled Transactional Memory Support
ISBN: 9780769531748, 0769531741ISSN: 1063-6897Published: Washington, DC, USA IEEE Computer Society 01.06.2008Published in 2008 International Symposium on Computer Architecture (01.06.2008)“…A high-concurrency transactional memory (TM) implementation needs to track concurrent accesses, buffer speculative updates, and manage conflicts. We present a…”
Get full text
Conference Proceeding -
8
Bandwidth Bandit: Quantitative characterization of memory contention
ISBN: 9781467355247, 1467355240Published: Washington, DC, USA IEEE Computer Society 23.02.2013Published in 2013 IEEE/ACM International Symposium on Code Generation and Optimization (CGO) (23.02.2013)“…On multicore processors, co-executing applications compete for shared resources, such as cache capacity and memory bandwidth. This leads to suboptimal resource…”
Get full text
Conference Proceeding -
9
Fast Track: A Software System for Speculative Program Optimization
ISBN: 9780769535760, 0769535763Published: Washington, DC, USA IEEE Computer Society 22.03.2009Published in Code Generation and Optimization, Proceedings (22.03.2009)“…Fast track is a software speculation system that enables unsafe optimization of sequential code. It speculatively runs optimized code to improve performance…”
Get full text
Conference Proceeding -
10
Quantitative Reasoning for Proving Lock-Freedom
ISBN: 1479904139, 9781479904136ISSN: 1043-6871Published: IEEE 01.06.2013Published in 2013 28th Annual ACM/IEEE Symposium on Logic in Computer Science (01.06.2013)“…This article describes a novel quantitative proof technique for the modular and local verification of lock-freedom. In contrast to proofs based on temporal…”
Get full text
Conference Proceeding -
11
Alchemist: A Transparent Dependence Distance Profiling Infrastructure
ISBN: 9780769535760, 0769535763Published: Washington, DC, USA IEEE Computer Society 22.03.2009Published in Code Generation and Optimization, Proceedings (22.03.2009)“…Effectively migrating sequential applications to take advantage of parallelism available on multicore platforms is a well-recognized challenge. This paper…”
Get full text
Conference Proceeding -
12
Parallel random numbers: as easy as 1, 2, 3
ISBN: 145030771X, 9781450307710ISSN: 2167-4329Published: New York, NY, USA ACM 12.11.2011Published in 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC) (12.11.2011)“…Most pseudorandom number generators (PRNGs) scale poorly to massively parallel high-performance computation because they are designed as sequentially dependent…”
Get full text
Conference Proceeding -
13
ILP and TLP in shared memory applications: A limit study
Published: ACM 01.08.2014Published in PACT '14 : proceedings of the 23rd International Conference on Parallel Architectures and Compilation Techniques : August 24-27, 2014, Edmonton, AB, Canada (01.08.2014)“…With the breakdown of Dennard scaling, future processor designs will be at the mercy of power limits as Chip MultiProcessor (CMP) designs scale out to…”
Get full text
Conference Proceeding -
14
Using Hardware Memory Protection to Build a High-Performance, Strongly-Atomic Hybrid Transactional Memory
ISBN: 9780769531748, 0769531741ISSN: 1063-6897Published: Washington, DC, USA IEEE Computer Society 01.06.2008Published in 2008 International Symposium on Computer Architecture (01.06.2008)“…We demonstrate how fine-grained memory protection can be used in support of transactional memory systems: first showing how a software transactional memory…”
Get full text
Conference Proceeding -
15
Decidability and Complexity for Quiescent Consistency
ISBN: 9781450343916, 1450343910Published: New York, NY, USA ACM 05.07.2016Published in Proceedings of the 31st Annual ACM/IEEE Symposium on Logic in Computer Science (05.07.2016)“…Quiescent consistency is a notion of correctness for a concurrent object that gives meaning to the object's behaviours in quiescent states, i.e., states in…”
Get full text
Conference Proceeding -
16
Communication-Sensitive Static Dataflow for Parallel Message Passing Applications
ISBN: 9780769535760, 0769535763Published: Washington, DC, USA IEEE Computer Society 22.03.2009Published in Code Generation and Optimization, Proceedings (22.03.2009)“…Message passing is a very popular style of parallel programming, used in a wide variety of applications and supported by many APIs, such as BSD sockets, MPI…”
Get full text
Conference Proceeding -
17
Shadow Profiling: Hiding Instrumentation Costs with Parallelism
ISBN: 9780769527642, 0769527647Published: Washington, DC, USA IEEE Computer Society 11.03.2007Published in Proceedings of the International Symposium on Code Generation and Optimization (11.03.2007)“…In profiling, a tradeoff exists between information and overhead. For example, hardware-sampling profilers incur negligible overhead, but the information they…”
Get full text
Conference Proceeding -
18
Complexity-effective superscalar processors
ISBN: 9780897919012, 0897919017ISSN: 1063-6897Published: New York, NY, USA ACM 01.01.1997Published in Conference Proceedings. The 24th Annual International Symposium on Computer Architecture (01.01.1997)“…The performance tradeoff between hardware complexity and clock speed is studied. First, a generic superscalar pipeline is defined. Then the specific areas of…”
Get full text
Conference Proceeding -
19
PGX.D: a fast distributed graph processing engine
ISBN: 1450337236, 9781450337236ISSN: 2167-4337Published: New York, NY, USA ACM 15.11.2015Published in Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (15.11.2015)“…Graph analysis is a powerful method in data analysis. Although several frameworks have been proposed for processing large graph instances in distributed…”
Get full text
Conference Proceeding -
20
A high-level synthesis flow for the implementation of iterative stencil loop algorithms on FPGA devices
ISBN: 1450320716, 9781450320719ISSN: 0738-100XPublished: New York, NY, USA ACM 29.05.2013Published in 2013 50th ACM/EDAC/IEEE Design Automation Conference (DAC) (29.05.2013)“…The automatic generation of hardware implementations for a given algorithm is generally a difficult task, especially when data dependencies span across…”
Get full text
Conference Proceeding

