Výsledky vyhledávání - single‐instruction‐multiple‐threads pattern

1

Načítá se…

Fully GPU-based electromagnetic transient simulation considering large-scale control systems for system-level studies Autor Song, Yankan, Chen, Ying, Huang, Shaowei, Xu, Yin, Yu, Zhitong, Marti, Jose R

ISSN: 1751-8687, 1751-8695

Vydáno: The Institution of Engineering and Technology 03.08.2017

Vydáno v IET generation, transmission & distribution (03.08.2017)
“…As more generators and loads are integrated by power electronic converters with complicated controls, electromagnetic transients (EMTs) simulation becomes an…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
2

Načítá se…

A Quantitative Method to Data Reuse Patterns of SIMT Applications Autor Lai, Bo-Cheng Charles, Garrido Platero, Luis, Hsien-Kai Kuo

ISSN: 1556-6056, 1556-6064

Vydáno: New York IEEE 01.07.2016

Vydáno v IEEE computer architecture letters (01.07.2016)
“… The emerging Single Instruction Multiple Threads (SIMT) processor adopts a programming model that is fundamentally disparate from conventional scalar processors…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
3

Načítá se…

Massively parallel differential evolution—pattern search optimization with graphics hardware acceleration: an investigation on bound constrained optimization problems Autor Zhu, Weihang

ISSN: 0925-5001, 1573-2916

Vydáno: Boston Springer US 01.07.2011

Vydáno v Journal of global optimization (01.07.2011)
“… In this paper, the classical DE was adapted in the data-parallel CPU-GPU heterogeneous computing platform featuring Single Instruction-Multiple Thread (SIMT) execution…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
4

Načítá se…

Cluster-based approach for improving graphics processing unit performance by inter streaming multiprocessors locality Autor Keshtegar, Mohammad Mahdi, Falahati, Hajar, Hessabi, Shaahin

ISSN: 1751-8601, 1751-861X, 2095-882X, 1751-861X, 2589-0514

Vydáno: Beijing The Institution of Engineering and Technology 01.09.2015

Vydáno v Chronic diseases and translational medicine (01.09.2015)
“… As GPUs employ multithreading to hide latency, there is a small private data cache in each single instruction multiple thread (SIMT) core…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
5

Načítá se…

Parallel ant colony for nonlinear function optimization with graphics hardware acceleration Autor Weihang Zhu, Curry, J.

ISBN: 9781424427932, 1424427932

ISSN: 1062-922X

Vydáno: IEEE 01.10.2009

Vydáno v 2009 IEEE International Conference on Systems, Man and Cybernetics (01.10.2009)
“… `single instruction - multiple thread' (SIMT). The global optimal search of the ACO is enhanced by the classical local pattern search (PS) method…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:
6

Načítá se…

Contention-Aware Selective Caching to Mitigate Intra-Warp Contention on GPUs Autor Kyoshin Choo, Troendle, David, Gad, Esraa A., Byunghyun Jang

Vydáno: IEEE 01.07.2017

Vydáno v 2017 16th International Symposium on Parallel and Distributed Computing (ISPDC) (01.07.2017)
“… However, the behavior and effect of the cache on GPUs are different from those on conventional processors due to the Single Instruction Multiple Thread (SIMT…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:
7

Načítá se…

GPU accelerated multilevel fast physical optics algorithm for radiation from non-planar apertures Autor Milo, Matan, Galanti, Barak, Boag, Amir

Vydáno: IEEE 01.11.2015

Vydáno v 2015 IEEE International Conference on Microwaves, Communications, Antennas and Electronic Systems (COMCAS) (01.11.2015)
“…Acceleration of the multilevel physical optics (MLPO) algorithm using the single instruction multiple threads (SIMT…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:
8

Načítá se…

Nonlinear optimization with a massively parallel Evolution Strategy–Pattern Search algorithm on graphics hardware Autor Zhu, Weihang

ISSN: 1568-4946, 1872-9681

Vydáno: Elsevier B.V 01.03.2011

Vydáno v Applied soft computing (01.03.2011)
“… ‘Single Instruction Multiple Thread’ (SIMT). Evolution Strategy is a population-based evolutionary algorithm for solving complex optimization problems…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
9

Načítá se…

Accelerated parametric chamfer alignment using a parallel, pipelined GPU realization Autor Elliethy, Ahmed, Sharma, Gaurav

ISSN: 1861-8200, 1861-8219

Vydáno: Berlin/Heidelberg Springer Berlin Heidelberg 01.10.2019

Vydáno v Journal of real-time image processing (01.10.2019)
“…Parametric chamfer alignment (PChA) is commonly employed for aligning an observed set of points with a corresponding set of reference points. PChA estimates…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
10

Načítá se…

gSoFa: Scalable Sparse Symbolic LU Factorization on GPUs Autor Gaihre, Anil, Li, Xiaoye Sherry, Liu, Hang

ISSN: 1045-9219, 1558-2183

Vydáno: New York IEEE 01.04.2022

Vydáno v IEEE transactions on parallel and distributed systems (01.04.2022)
“…Decomposing a matrix <inline-formula><tex-math notation="LaTeX">\mathbf {A}</tex-math> <mml:math><mml:mi…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
11

Načítá se…

cuTensor-Tubal: Efficient Primitives for Tubal-Rank Tensor Learning Operations on GPUs Autor Zhang, Tao, Liu, Xiao-Yang, Wang, Xiaodong, Walid, Anwar

ISSN: 1045-9219, 1558-2183

Vydáno: New York IEEE 01.03.2020

Vydáno v IEEE transactions on parallel and distributed systems (01.03.2020)
“…Tensors are the cornerstone data structures in high-performance computing, big data analysis and machine learning. However, tensor computations are…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
12

Načítá se…

Hardware Accelerators for Real-Time Face Recognition: A Survey Autor Baobaid, Asma, Meribout, Mahmoud, Tiwari, Varun Kumar, Pena, Juan Pablo

ISSN: 2169-3536, 2169-3536

Vydáno: Piscataway IEEE 2022

Vydáno v IEEE access (2022)
“…Real-time face recognition has been of great interest in the last decade due to its wide and varied critical applications which include biometrics, security in…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
13

Načítá se…

Using SIMD and SIMT vectorization to evaluate sparse chemical kinetic Jacobian matrices and thermochemical source terms Autor Curtis, Nicholas J., Niemeyer, Kyle E., Sung, Chih-Jen

ISSN: 0010-2180, 1556-2921

Vydáno: New York Elsevier Inc 01.12.2018

Vydáno v Combustion and flame (01.12.2018)
“…) and single-instruction, multiple thread (SIMT) paradigms. These are implemented in pyJac, an open-source, reproducible code generation…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
14

Načítá se…

Simultaneous branch and warp interweaving for sustained GPU performance Autor Brunie, Nicolas, Collange, Caroline, Diamos, Gregory

ISBN: 9781467304757, 1467304751

ISSN: 1063-6897

Vydáno: IEEE 01.06.2012

Vydáno v 2012 39th Annual International Symposium on Computer Architecture (ISCA) (01.06.2012)
“…Instruction Multiple-Thread (SIMT) micro-architectures implemented in Graphics Processing Units (GPUs) run fine-grained threads in lockstep by grouping them…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:
15

Načítá se…

Acceleration of Bilateral Filtering Algorithm for Manycore and Multicore Architectures Autor Agarwal, D., Wilf, S., Dhungel, A., Prasad, S. K.

ISBN: 9781467325080, 1467325082

ISSN: 0190-3918

Vydáno: IEEE 01.09.2012

Vydáno v 2012 41st International Conference on Parallel Processing (01.09.2012)
“… patterns as per the computations to exploit special purpose instructions. We also propose optimizations pertinent to Nvidia's Compute Unified Device…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:
16

Načítá se…

Using SIMD and SIMT vectorization to evaluate sparse chemical kinetic Jacobian matrices and thermochemical source terms Autor Curtis, Nicholas J, Niemeyer, Kyle E, Chih-Jen Sung

ISSN: 2331-8422

Vydáno: Ithaca Cornell University Library, arXiv.org 04.09.2018

Vydáno v arXiv.org (04.09.2018)
“…) and single-instruction, multiple thread (SIMT) paradigms. These are implemented in pyJac, an open-source, reproducible code generation…”

Získat plný text

Paper

Přidat do oblíbených

Uloženo v:
17

Načítá se…

An efficient STT-RAM-based register file in GPU architectures Autor Xiaoxiao Liu, Mengjie Mao, Xiuyuan Bi, Hai Li, Yiran Chen

ISSN: 2153-6961

Vydáno: IEEE 01.01.2015

Vydáno v The 20th Asia and South Pacific Design Automation Conference (01.01.2015)
“…Modern GPGPUs employ a large register file (RF) to efficiently process heavily parallel threads in single instruction multiple thread (SIMT) fashion…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:
18

Načítá se…

Multigrid on GPU: Tackling Power Grid Analysis on parallel SIMT platforms Autor Zhuo Feng, Peng Li

ISBN: 142442819X, 9781424428199

ISSN: 1092-3152

Vydáno: IEEE 01.11.2008

Vydáno v 2008 IEEE/ACM International Conference on Computer-Aided Design (01.11.2008)
“… For the first time, we show how to exploit recent massively parallel single-instruction multiple-thread (SIMT…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:
19

Načítá se…

Fast 1-itemset frequency count using CUDA Autor Uy, Roger Luis, Marcos, Nelson

ISSN: 2159-3450

Vydáno: IEEE 01.11.2016

Vydáno v TENCON ... IEEE Region Ten Conference (01.11.2016)
“… Thus there is a need to speed-up this process. One of the techniques to speed-up the process is using the Single Instruction Multiple Thread (SIMT) architecture…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:
20

Načítá se…

GSoFa: Scalable Sparse Symbolic LU Factorization on GPUs Autor Gaihre, Anil, Li, Xiaoye S, Liu, Hang

ISSN: 2331-8422

Vydáno: Ithaca Cornell University Library, arXiv.org 09.05.2021

Vydáno v arXiv.org (09.05.2021)
“…Decomposing matrix A into a lower matrix L and an upper matrix U, which is also known as LU decomposition, is an essential operation in numerical linear…”

Získat plný text

Paper

Přidat do oblíbených

Uloženo v:

Výsledky vyhledávání - single‐instruction‐multiple‐threads pattern

Fully GPU-based electromagnetic transient simulation considering large-scale control systems for system-level studies Autor Song, Yankan, Chen, Ying, Huang, Shaowei, Xu, Yin, Yu, Zhitong, Marti, Jose R

A Quantitative Method to Data Reuse Patterns of SIMT Applications Autor Lai, Bo-Cheng Charles, Garrido Platero, Luis, Hsien-Kai Kuo

Massively parallel differential evolution—pattern search optimization with graphics hardware acceleration: an investigation on bound constrained optimization problems Autor Zhu, Weihang

Cluster-based approach for improving graphics processing unit performance by inter streaming multiprocessors locality Autor Keshtegar, Mohammad Mahdi, Falahati, Hajar, Hessabi, Shaahin

Parallel ant colony for nonlinear function optimization with graphics hardware acceleration Autor Weihang Zhu, Curry, J.

Contention-Aware Selective Caching to Mitigate Intra-Warp Contention on GPUs Autor Kyoshin Choo, Troendle, David, Gad, Esraa A., Byunghyun Jang

GPU accelerated multilevel fast physical optics algorithm for radiation from non-planar apertures Autor Milo, Matan, Galanti, Barak, Boag, Amir

Nonlinear optimization with a massively parallel Evolution Strategy–Pattern Search algorithm on graphics hardware Autor Zhu, Weihang

Accelerated parametric chamfer alignment using a parallel, pipelined GPU realization Autor Elliethy, Ahmed, Sharma, Gaurav

gSoFa: Scalable Sparse Symbolic LU Factorization on GPUs Autor Gaihre, Anil, Li, Xiaoye Sherry, Liu, Hang

cuTensor-Tubal: Efficient Primitives for Tubal-Rank Tensor Learning Operations on GPUs Autor Zhang, Tao, Liu, Xiao-Yang, Wang, Xiaodong, Walid, Anwar

Hardware Accelerators for Real-Time Face Recognition: A Survey Autor Baobaid, Asma, Meribout, Mahmoud, Tiwari, Varun Kumar, Pena, Juan Pablo

Using SIMD and SIMT vectorization to evaluate sparse chemical kinetic Jacobian matrices and thermochemical source terms Autor Curtis, Nicholas J., Niemeyer, Kyle E., Sung, Chih-Jen

Simultaneous branch and warp interweaving for sustained GPU performance Autor Brunie, Nicolas, Collange, Caroline, Diamos, Gregory

Acceleration of Bilateral Filtering Algorithm for Manycore and Multicore Architectures Autor Agarwal, D., Wilf, S., Dhungel, A., Prasad, S. K.

Using SIMD and SIMT vectorization to evaluate sparse chemical kinetic Jacobian matrices and thermochemical source terms Autor Curtis, Nicholas J, Niemeyer, Kyle E, Chih-Jen Sung

An efficient STT-RAM-based register file in GPU architectures Autor Xiaoxiao Liu, Mengjie Mao, Xiuyuan Bi, Hai Li, Yiran Chen

Multigrid on GPU: Tackling Power Grid Analysis on parallel SIMT platforms Autor Zhuo Feng, Peng Li

Fast 1-itemset frequency count using CUDA Autor Uy, Roger Luis, Marcos, Nelson

GSoFa: Scalable Sparse Symbolic LU Factorization on GPUs Autor Gaihre, Anil, Li, Xiaoye S, Liu, Hang

Vyhledávací nástroje:

Upřesnit hledání

Médium

Předmětová oblast

Téma

Jazyk

Rok vydání