Search Results - "Theory of computation Models of computation Concurrency Parallel computing models"

1

Loading…

On Thin Air Reads Towards an Event Structures Model of Relaxed Memory by Jeffrey, Alan, Riely, James

ISBN: 9781450343916, 1450343910

Published: New York, NY, USA ACM 05.07.2016

Published in Proceedings of the 31st Annual ACM/IEEE Symposium on Logic in Computer Science (05.07.2016)
“…This is the first paper to propose a pure event structures model of relaxed memory. We propose confusion-free event structures over an alphabet with a…”

Get full text

Conference Proceeding

Save to List

Saved in:
2

Loading…

Managing GPU Concurrency in Heterogeneous Architectures by Kayiran, Onur, Nachiappan, Nachiappan Chidambaram, Jog, Adwait, Ausavarungnirun, Rachata, Kandemir, Mahmut T., Loh, Gabriel H., Mutlu, Onur, Das, Chita R.

ISSN: 1072-4451

Published: IEEE 01.12.2014

Published in 2014 47th Annual IEEE/ACM International Symposium on Microarchitecture (01.12.2014)
“…Heterogeneous architectures consisting of general-purpose CPUs and throughput-optimized GPUs are projected to be the dominant computing platforms for many…”

Get full text

Conference Proceeding

Save to List

Saved in:
3

Loading…

In search of lost time: Axiomatising parallel composition in process algebras by Aceto, Luca, Anastasiadi, Elli, Castiglioni, Valentina, Ingolfsdottir, Anna, Luttik, Bas

Published: IEEE 29.06.2021

Published in Proceedings of the 36th Annual ACM/IEEE Symposium on Logic in Computer Science (29.06.2021)
“…This survey reviews some of the most recent achievements in the saga of the axiomatisation of parallel composition, along with some classic results. We focus…”

Get full text

Conference Proceeding

Save to List

Saved in:
4

Loading…

Optimizing indirect memory references with milk by Kiriansky, Vladimir, Zhang, Yunming, Amarasinghe, Saman

Published: ACM 01.09.2016

Published in 2016 International Conference on Parallel Architecture and Compilation Techniques (PACT) (01.09.2016)
“…Modern applications such as graph and data analytics, when operating on real world data, have working sets much larger than cache capacity and are bottlenecked…”

Get full text

Conference Proceeding

Save to List

Saved in:
5

Loading…

Reverse-Mode Automatic Differentiation and Optimization of GPU Kernels via Enzyme by Moses, William S., Churavy, Valentin, Paehler, Ludger, Huckelheim, Jan, Narayanan, Sri Hari Krishna, Schanen, Michel, Doerfert, Johannes

ISSN: 2167-4337

Published: ACM 14.11.2021

Published in SC21: International Conference for High Performance Computing, Networking, Storage and Analysis (14.11.2021)
“…Computing derivatives is key to many algorithms in scientific computing and machine learning such as optimization, uncertainty quantification, and stability…”

Get full text

Conference Proceeding

Save to List

Saved in:
6

Loading…

Virtualizing Transactional Memory by Rajwar, Ravi, Herlihy, Maurice, Lai, Konrad

ISBN: 076952270X, 9780769522708

ISSN: 1063-6897

Published: Washington, DC, USA IEEE Computer Society 01.05.2005

Published in 32nd International Symposium on Computer Architecture (ISCA'05) (01.05.2005)
“…Writing concurrent programs is difficult because of the complexity of ensuring proper synchronization. Conventional lock-based synchronization suffers from…”

Get full text

Conference Proceeding

Save to List

Saved in:
7

Loading…

Flexible Decoupled Transactional Memory Support by Shriraman, Arrvindh, Dwarkadas, Sandhya, Scott, Michael L.

ISBN: 9780769531748, 0769531741

ISSN: 1063-6897

Published: Washington, DC, USA IEEE Computer Society 01.06.2008

Published in 2008 International Symposium on Computer Architecture (01.06.2008)
“…A high-concurrency transactional memory (TM) implementation needs to track concurrent accesses, buffer speculative updates, and manage conflicts. We present a…”

Get full text

Conference Proceeding

Save to List

Saved in:
8

Loading…

Bandwidth Bandit: Quantitative characterization of memory contention by Black-Schaffer, David, Nikoleris, Nikos, Hagersten, Erik, Eklov, David

ISBN: 9781467355247, 1467355240

Published: Washington, DC, USA IEEE Computer Society 23.02.2013

Published in 2013 IEEE/ACM International Symposium on Code Generation and Optimization (CGO) (23.02.2013)
“…On multicore processors, co-executing applications compete for shared resources, such as cache capacity and memory bandwidth. This leads to suboptimal resource…”

Get full text

Conference Proceeding

Save to List

Saved in:
9

Loading…

Fast Track: A Software System for Speculative Program Optimization by Kelsey, Kirk, Bai, Tongxin, Ding, Chen, Zhang, Chengliang

ISBN: 9780769535760, 0769535763

Published: Washington, DC, USA IEEE Computer Society 22.03.2009

Published in Code Generation and Optimization, Proceedings (22.03.2009)
“…Fast track is a software speculation system that enables unsafe optimization of sequential code. It speculatively runs optimized code to improve performance…”

Get full text

Conference Proceeding

Save to List

Saved in:
10

Loading…

Quantitative Reasoning for Proving Lock-Freedom by Hoffmann, Jan, Marmar, Michael, Zhong Shao

ISBN: 1479904139, 9781479904136

ISSN: 1043-6871

Published: IEEE 01.06.2013

Published in 2013 28th Annual ACM/IEEE Symposium on Logic in Computer Science (01.06.2013)
“…This article describes a novel quantitative proof technique for the modular and local verification of lock-freedom. In contrast to proofs based on temporal…”

Get full text

Conference Proceeding

Save to List

Saved in:
11

Loading…

Alchemist: A Transparent Dependence Distance Profiling Infrastructure by Zhang, Xiangyu, Navabi, Armand, Jagannathan, Suresh

ISBN: 9780769535760, 0769535763

Published: Washington, DC, USA IEEE Computer Society 22.03.2009

Published in Code Generation and Optimization, Proceedings (22.03.2009)
“…Effectively migrating sequential applications to take advantage of parallelism available on multicore platforms is a well-recognized challenge. This paper…”

Get full text

Conference Proceeding

Save to List

Saved in:
12

Loading…

Parallel random numbers: as easy as 1, 2, 3 by Salmon, John K., Moraes, Mark A., Dror, Ron O., Shaw, David E.

ISBN: 145030771X, 9781450307710

ISSN: 2167-4329

Published: New York, NY, USA ACM 12.11.2011

Published in 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC) (12.11.2011)
“…Most pseudorandom number generators (PRNGs) scale poorly to massively parallel high-performance computation because they are designed as sequentially dependent…”

Get full text

Conference Proceeding

Save to List

Saved in:
13

Loading…

ILP and TLP in shared memory applications: A limit study by Fatehi, Ehsan, Gratz, Paul V.

Published: ACM 01.08.2014

Published in PACT '14 : proceedings of the 23rd International Conference on Parallel Architectures and Compilation Techniques : August 24-27, 2014, Edmonton, AB, Canada (01.08.2014)
“…With the breakdown of Dennard scaling, future processor designs will be at the mercy of power limits as Chip MultiProcessor (CMP) designs scale out to…”

Get full text

Conference Proceeding

Save to List

Saved in:
14

Loading…

Using Hardware Memory Protection to Build a High-Performance, Strongly-Atomic Hybrid Transactional Memory by Baugh, Lee, Neelakantam, Naveen, Zilles, Craig

ISBN: 9780769531748, 0769531741

ISSN: 1063-6897

Published: Washington, DC, USA IEEE Computer Society 01.06.2008

Published in 2008 International Symposium on Computer Architecture (01.06.2008)
“…We demonstrate how fine-grained memory protection can be used in support of transactional memory systems: first showing how a software transactional memory…”

Get full text

Conference Proceeding

Save to List

Saved in:
15

Loading…

Decidability and Complexity for Quiescent Consistency by Dongol, Brijesh, Hierons, Robert M.

ISBN: 9781450343916, 1450343910

Published: New York, NY, USA ACM 05.07.2016

Published in Proceedings of the 31st Annual ACM/IEEE Symposium on Logic in Computer Science (05.07.2016)
“…Quiescent consistency is a notion of correctness for a concurrent object that gives meaning to the object's behaviours in quiescent states, i.e., states in…”

Get full text

Conference Proceeding

Save to List

Saved in:
16

Loading…

Communication-Sensitive Static Dataflow for Parallel Message Passing Applications by Bronevetsky, Greg

ISBN: 9780769535760, 0769535763

Published: Washington, DC, USA IEEE Computer Society 22.03.2009

Published in Code Generation and Optimization, Proceedings (22.03.2009)
“…Message passing is a very popular style of parallel programming, used in a wide variety of applications and supported by many APIs, such as BSD sockets, MPI…”

Get full text

Conference Proceeding

Save to List

Saved in:
17

Loading…

Shadow Profiling: Hiding Instrumentation Costs with Parallelism by Moseley, Tipp, Shye, Alex, Reddi, Vijay Janapa, Grunwald, Dirk, Peri, Ramesh

ISBN: 9780769527642, 0769527647

Published: Washington, DC, USA IEEE Computer Society 11.03.2007

Published in Proceedings of the International Symposium on Code Generation and Optimization (11.03.2007)
“…In profiling, a tradeoff exists between information and overhead. For example, hardware-sampling profilers incur negligible overhead, but the information they…”

Get full text

Conference Proceeding

Save to List

Saved in:
18

Loading…

Complexity-effective superscalar processors by Palacharla, Subbarao, Jouppi, Norman P., Smith, J. E.

ISBN: 9780897919012, 0897919017

ISSN: 1063-6897

Published: New York, NY, USA ACM 01.01.1997

Published in Conference Proceedings. The 24th Annual International Symposium on Computer Architecture (01.01.1997)
“…The performance tradeoff between hardware complexity and clock speed is studied. First, a generic superscalar pipeline is defined. Then the specific areas of…”

Get full text

Conference Proceeding

Save to List

Saved in:
19

Loading…

PGX.D: a fast distributed graph processing engine by Hong, Sungpack, Depner, Siegfried, Manhardt, Thomas, Van Der Lugt, Jan, Verstraaten, Merijn, Chafi, Hassan

ISBN: 1450337236, 9781450337236

ISSN: 2167-4337

Published: New York, NY, USA ACM 15.11.2015

Published in Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (15.11.2015)
“…Graph analysis is a powerful method in data analysis. Although several frameworks have been proposed for processing large graph instances in distributed…”

Get full text

Conference Proceeding

Save to List

Saved in:
20

Loading…

A high-level synthesis flow for the implementation of iterative stencil loop algorithms on FPGA devices by Nacci, Alessandro Antonio, Rana, Vincenzo, Bruschi, Francesco, Sciuto, Donatella, Beretta, Ivan, Atienza, David

ISBN: 1450320716, 9781450320719

ISSN: 0738-100X

Published: New York, NY, USA ACM 29.05.2013

Published in 2013 50th ACM/EDAC/IEEE Design Automation Conference (DAC) (29.05.2013)
“…The automatic generation of hardware implementations for a given algorithm is generally a difficult task, especially when data dependencies span across…”

Get full text

Conference Proceeding

Save to List

Saved in:

Search Results - "Theory of computation Models of computation Concurrency Parallel computing models"

On Thin Air Reads Towards an Event Structures Model of Relaxed Memory by Jeffrey, Alan, Riely, James

Managing GPU Concurrency in Heterogeneous Architectures by Kayiran, Onur, Nachiappan, Nachiappan Chidambaram, Jog, Adwait, Ausavarungnirun, Rachata, Kandemir, Mahmut T., Loh, Gabriel H., Mutlu, Onur, Das, Chita R.

In search of lost time: Axiomatising parallel composition in process algebras by Aceto, Luca, Anastasiadi, Elli, Castiglioni, Valentina, Ingolfsdottir, Anna, Luttik, Bas

Optimizing indirect memory references with milk by Kiriansky, Vladimir, Zhang, Yunming, Amarasinghe, Saman

Reverse-Mode Automatic Differentiation and Optimization of GPU Kernels via Enzyme by Moses, William S., Churavy, Valentin, Paehler, Ludger, Huckelheim, Jan, Narayanan, Sri Hari Krishna, Schanen, Michel, Doerfert, Johannes

Virtualizing Transactional Memory by Rajwar, Ravi, Herlihy, Maurice, Lai, Konrad

Flexible Decoupled Transactional Memory Support by Shriraman, Arrvindh, Dwarkadas, Sandhya, Scott, Michael L.

Bandwidth Bandit: Quantitative characterization of memory contention by Black-Schaffer, David, Nikoleris, Nikos, Hagersten, Erik, Eklov, David

Fast Track: A Software System for Speculative Program Optimization by Kelsey, Kirk, Bai, Tongxin, Ding, Chen, Zhang, Chengliang

Quantitative Reasoning for Proving Lock-Freedom by Hoffmann, Jan, Marmar, Michael, Zhong Shao

Alchemist: A Transparent Dependence Distance Profiling Infrastructure by Zhang, Xiangyu, Navabi, Armand, Jagannathan, Suresh

Parallel random numbers: as easy as 1, 2, 3 by Salmon, John K., Moraes, Mark A., Dror, Ron O., Shaw, David E.

ILP and TLP in shared memory applications: A limit study by Fatehi, Ehsan, Gratz, Paul V.

Using Hardware Memory Protection to Build a High-Performance, Strongly-Atomic Hybrid Transactional Memory by Baugh, Lee, Neelakantam, Naveen, Zilles, Craig

Decidability and Complexity for Quiescent Consistency by Dongol, Brijesh, Hierons, Robert M.

Communication-Sensitive Static Dataflow for Parallel Message Passing Applications by Bronevetsky, Greg

Shadow Profiling: Hiding Instrumentation Costs with Parallelism by Moseley, Tipp, Shye, Alex, Reddi, Vijay Janapa, Grunwald, Dirk, Peri, Ramesh

Complexity-effective superscalar processors by Palacharla, Subbarao, Jouppi, Norman P., Smith, J. E.

PGX.D: a fast distributed graph processing engine by Hong, Sungpack, Depner, Siegfried, Manhardt, Thomas, Van Der Lugt, Jan, Verstraaten, Merijn, Chafi, Hassan

A high-level synthesis flow for the implementation of iterative stencil loop algorithms on FPGA devices by Nacci, Alessandro Antonio, Rana, Vincenzo, Bruschi, Francesco, Sciuto, Donatella, Beretta, Ivan, Atienza, David

Search Tools:

Refine Results

Format

Subject Area

Topic

Language

Year of Publication