Výsledky vyhledávání - single‐instruction‐multiple‐threads pattern

  1. 1

    Fully GPU-based electromagnetic transient simulation considering large-scale control systems for system-level studies Autor Song, Yankan, Chen, Ying, Huang, Shaowei, Xu, Yin, Yu, Zhitong, Marti, Jose R

    ISSN: 1751-8687, 1751-8695
    Vydáno: The Institution of Engineering and Technology 03.08.2017
    “…As more generators and loads are integrated by power electronic converters with complicated controls, electromagnetic transients (EMTs) simulation becomes an…”
    Získat plný text
    Journal Article
  2. 2

    A Quantitative Method to Data Reuse Patterns of SIMT Applications Autor Lai, Bo-Cheng Charles, Garrido Platero, Luis, Hsien-Kai Kuo

    ISSN: 1556-6056, 1556-6064
    Vydáno: New York IEEE 01.07.2016
    Vydáno v IEEE computer architecture letters (01.07.2016)
    “… The emerging Single Instruction Multiple Threads (SIMT) processor adopts a programming model that is fundamentally disparate from conventional scalar processors…”
    Získat plný text
    Journal Article
  3. 3

    Massively parallel differential evolution—pattern search optimization with graphics hardware acceleration: an investigation on bound constrained optimization problems Autor Zhu, Weihang

    ISSN: 0925-5001, 1573-2916
    Vydáno: Boston Springer US 01.07.2011
    Vydáno v Journal of global optimization (01.07.2011)
    “… In this paper, the classical DE was adapted in the data-parallel CPU-GPU heterogeneous computing platform featuring Single Instruction-Multiple Thread (SIMT) execution…”
    Získat plný text
    Journal Article
  4. 4

    Cluster-based approach for improving graphics processing unit performance by inter streaming multiprocessors locality Autor Keshtegar, Mohammad Mahdi, Falahati, Hajar, Hessabi, Shaahin

    ISSN: 1751-8601, 1751-861X, 2095-882X, 1751-861X, 2589-0514
    Vydáno: Beijing The Institution of Engineering and Technology 01.09.2015
    “… As GPUs employ multithreading to hide latency, there is a small private data cache in each single instruction multiple thread (SIMT) core…”
    Získat plný text
    Journal Article
  5. 5

    Parallel ant colony for nonlinear function optimization with graphics hardware acceleration Autor Weihang Zhu, Curry, J.

    ISBN: 9781424427932, 1424427932
    ISSN: 1062-922X
    Vydáno: IEEE 01.10.2009
    “… `single instruction - multiple thread' (SIMT). The global optimal search of the ACO is enhanced by the classical local pattern search (PS) method…”
    Získat plný text
    Konferenční příspěvek
  6. 6

    Contention-Aware Selective Caching to Mitigate Intra-Warp Contention on GPUs Autor Kyoshin Choo, Troendle, David, Gad, Esraa A., Byunghyun Jang

    Vydáno: IEEE 01.07.2017
    “… However, the behavior and effect of the cache on GPUs are different from those on conventional processors due to the Single Instruction Multiple Thread (SIMT…”
    Získat plný text
    Konferenční příspěvek
  7. 7

    GPU accelerated multilevel fast physical optics algorithm for radiation from non-planar apertures Autor Milo, Matan, Galanti, Barak, Boag, Amir

    Vydáno: IEEE 01.11.2015
    “…Acceleration of the multilevel physical optics (MLPO) algorithm using the single instruction multiple threads (SIMT…”
    Získat plný text
    Konferenční příspěvek
  8. 8

    Nonlinear optimization with a massively parallel Evolution Strategy–Pattern Search algorithm on graphics hardware Autor Zhu, Weihang

    ISSN: 1568-4946, 1872-9681
    Vydáno: Elsevier B.V 01.03.2011
    Vydáno v Applied soft computing (01.03.2011)
    “… ‘Single Instruction Multiple Thread’ (SIMT). Evolution Strategy is a population-based evolutionary algorithm for solving complex optimization problems…”
    Získat plný text
    Journal Article
  9. 9

    Accelerated parametric chamfer alignment using a parallel, pipelined GPU realization Autor Elliethy, Ahmed, Sharma, Gaurav

    ISSN: 1861-8200, 1861-8219
    Vydáno: Berlin/Heidelberg Springer Berlin Heidelberg 01.10.2019
    Vydáno v Journal of real-time image processing (01.10.2019)
    “…Parametric chamfer alignment (PChA) is commonly employed for aligning an observed set of points with a corresponding set of reference points. PChA estimates…”
    Získat plný text
    Journal Article
  10. 10

    gSoFa: Scalable Sparse Symbolic LU Factorization on GPUs Autor Gaihre, Anil, Li, Xiaoye Sherry, Liu, Hang

    ISSN: 1045-9219, 1558-2183
    Vydáno: New York IEEE 01.04.2022
    “…Decomposing a matrix <inline-formula><tex-math notation="LaTeX">\mathbf {A}</tex-math> <mml:math><mml:mi…”
    Získat plný text
    Journal Article
  11. 11

    cuTensor-Tubal: Efficient Primitives for Tubal-Rank Tensor Learning Operations on GPUs Autor Zhang, Tao, Liu, Xiao-Yang, Wang, Xiaodong, Walid, Anwar

    ISSN: 1045-9219, 1558-2183
    Vydáno: New York IEEE 01.03.2020
    “…Tensors are the cornerstone data structures in high-performance computing, big data analysis and machine learning. However, tensor computations are…”
    Získat plný text
    Journal Article
  12. 12

    Hardware Accelerators for Real-Time Face Recognition: A Survey Autor Baobaid, Asma, Meribout, Mahmoud, Tiwari, Varun Kumar, Pena, Juan Pablo

    ISSN: 2169-3536, 2169-3536
    Vydáno: Piscataway IEEE 2022
    Vydáno v IEEE access (2022)
    “…Real-time face recognition has been of great interest in the last decade due to its wide and varied critical applications which include biometrics, security in…”
    Získat plný text
    Journal Article
  13. 13

    Using SIMD and SIMT vectorization to evaluate sparse chemical kinetic Jacobian matrices and thermochemical source terms Autor Curtis, Nicholas J., Niemeyer, Kyle E., Sung, Chih-Jen

    ISSN: 0010-2180, 1556-2921
    Vydáno: New York Elsevier Inc 01.12.2018
    Vydáno v Combustion and flame (01.12.2018)
    “…) and single-instruction, multiple thread (SIMT) paradigms. These are implemented in pyJac, an open-source, reproducible code generation…”
    Získat plný text
    Journal Article
  14. 14

    Simultaneous branch and warp interweaving for sustained GPU performance Autor Brunie, Nicolas, Collange, Caroline, Diamos, Gregory

    ISBN: 9781467304757, 1467304751
    ISSN: 1063-6897
    Vydáno: IEEE 01.06.2012
    “…Instruction Multiple-Thread (SIMT) micro-architectures implemented in Graphics Processing Units (GPUs) run fine-grained threads in lockstep by grouping them…”
    Získat plný text
    Konferenční příspěvek
  15. 15

    Acceleration of Bilateral Filtering Algorithm for Manycore and Multicore Architectures Autor Agarwal, D., Wilf, S., Dhungel, A., Prasad, S. K.

    ISBN: 9781467325080, 1467325082
    ISSN: 0190-3918
    Vydáno: IEEE 01.09.2012
    “… patterns as per the computations to exploit special purpose instructions. We also propose optimizations pertinent to Nvidia's Compute Unified Device…”
    Získat plný text
    Konferenční příspěvek
  16. 16

    Using SIMD and SIMT vectorization to evaluate sparse chemical kinetic Jacobian matrices and thermochemical source terms Autor Curtis, Nicholas J, Niemeyer, Kyle E, Chih-Jen Sung

    ISSN: 2331-8422
    Vydáno: Ithaca Cornell University Library, arXiv.org 04.09.2018
    Vydáno v arXiv.org (04.09.2018)
    “…) and single-instruction, multiple thread (SIMT) paradigms. These are implemented in pyJac, an open-source, reproducible code generation…”
    Získat plný text
    Paper
  17. 17

    An efficient STT-RAM-based register file in GPU architectures Autor Xiaoxiao Liu, Mengjie Mao, Xiuyuan Bi, Hai Li, Yiran Chen

    ISSN: 2153-6961
    Vydáno: IEEE 01.01.2015
    “…Modern GPGPUs employ a large register file (RF) to efficiently process heavily parallel threads in single instruction multiple thread (SIMT) fashion…”
    Získat plný text
    Konferenční příspěvek
  18. 18

    Multigrid on GPU: Tackling Power Grid Analysis on parallel SIMT platforms Autor Zhuo Feng, Peng Li

    ISBN: 142442819X, 9781424428199
    ISSN: 1092-3152
    Vydáno: IEEE 01.11.2008
    “… For the first time, we show how to exploit recent massively parallel single-instruction multiple-thread (SIMT…”
    Získat plný text
    Konferenční příspěvek
  19. 19

    Fast 1-itemset frequency count using CUDA Autor Uy, Roger Luis, Marcos, Nelson

    ISSN: 2159-3450
    Vydáno: IEEE 01.11.2016
    Vydáno v TENCON ... IEEE Region Ten Conference (01.11.2016)
    “… Thus there is a need to speed-up this process. One of the techniques to speed-up the process is using the Single Instruction Multiple Thread (SIMT) architecture…”
    Získat plný text
    Konferenční příspěvek
  20. 20

    GSoFa: Scalable Sparse Symbolic LU Factorization on GPUs Autor Gaihre, Anil, Li, Xiaoye S, Liu, Hang

    ISSN: 2331-8422
    Vydáno: Ithaca Cornell University Library, arXiv.org 09.05.2021
    Vydáno v arXiv.org (09.05.2021)
    “…Decomposing matrix A into a lower matrix L and an upper matrix U, which is also known as LU decomposition, is an essential operation in numerical linear…”
    Získat plný text
    Paper