Suchergebnisse - algorithms and (architectural OR architecture) for parallel processing

  1. 1

    An architectural framework for accelerating dynamic parallel algorithms on reconfigurable hardware von Chen, Tao, Srinath, Shreesha, Batten, Christopher, Suh, G. Edward

    ISBN: 9781538662403, 153866240X
    Veröffentlicht: Piscataway, NJ, USA IEEE Press 20.10.2018
    “… In this paper, we propose ParallelXL, an architectural framework for building application-specific parallel accelerators with low manual effort …”
    Volltext
    Tagungsbericht
  2. 2

    A Fast CTU-level SAO Algorithm and Its Hardware Architecture for AVS3 Video Coding von Lin, Hao, Wen, Yingbo, Xiang, Guoqing, Qu, Xinyu, Zhang, Peng, Yan, Wei

    ISSN: 2158-4001
    Veröffentlicht: IEEE 06.01.2024
    “… To address this problem, a fast Coding Tree Unit (CTU)-level SAO algorithm and its hardware architecture is proposed …”
    Volltext
    Tagungsbericht
  3. 3

    Enabling Highly Efficient Capsule Networks Processing Through A PIM-Based Architecture Design von Zhang, Xingyao, Song, Shuaiwen Leon, Xie, Chenhao, Wang, Jing, Zhang, Weigong, Fu, Xin

    ISSN: 2378-203X
    Veröffentlicht: IEEE 01.02.2020
    “… In recent years, the CNNs have achieved great successes in the image processing tasks, e.g …”
    Volltext
    Tagungsbericht
  4. 4

    Implementing a Parallel Image Edge Detection Algorithm Based on the Otsu-Canny Operator on the Hadoop Platform von Tian, Yun, Wang, Min, Chen, Lichao, Cao, Jianfang

    ISSN: 1687-5265, 1687-5273, 1687-5273
    Veröffentlicht: Cairo, Egypt Hindawi Publishing Corporation 01.01.2018
    Veröffentlicht in Computational intelligence and neuroscience (01.01.2018)
    “… parallel programming model that runs on the Hadoop platform. The Otsu algorithm is used to optimize the Canny …”
    Volltext
    Journal Article
  5. 5

    Scalable Parallel Processing: Architectural Models, Real-Time Programming, and Performance Evaluation von Mirela Sino, Ervin Domazet

    ISSN: 2673-4591
    Veröffentlicht: MDPI AG 01.08.2025
    Veröffentlicht in Engineering proceedings (01.08.2025)
    “… This research paper analyzes and highlights the benefits of parallel processing to enhance performance and computational efficiency in modern computing systems …”
    Volltext
    Journal Article
  6. 6

    Evaluation of parallel particle swarm optimization algorithms within the CUDA™ architecture von Mussi, Luca, Daolio, Fabio, Cagnoni, Stefano

    ISSN: 0020-0255, 1872-6291
    Veröffentlicht: Elsevier Inc 15.10.2011
    Veröffentlicht in Information sciences (15.10.2011)
    “… ), which are, in fact, massively parallel processing architectures. In this paper we discuss possible approaches to parallelizing PSO on graphics hardware within the Compute Unified Device Architecture (CUDA …”
    Volltext
    Journal Article
  7. 7

    Resampling algorithms and architectures for distributed particle filters von Bolic, M., Djuric, P.M., Sangjin Hong

    ISSN: 1053-587X, 1941-0476
    Veröffentlicht: New York, NY IEEE 01.07.2005
    Veröffentlicht in IEEE transactions on signal processing (01.07.2005)
    “… In this paper, we propose novel resampling algorithms with architectures for efficient distributed implementation of particle filters …”
    Volltext
    Journal Article
  8. 8

    Algorithm and Architecture of a Low-Complexity and High-Parallelism Preprocessing-Based K -Best Detector for Large-Scale MIMO Systems von Peng, Guiqiang, Liu, Leibo, Zhou, Sheng, Xue, Yang, Yin, Shouyi, Wei, Shaojun

    ISSN: 1053-587X, 1941-0476
    Veröffentlicht: IEEE 01.04.2018
    Veröffentlicht in IEEE transactions on signal processing (01.04.2018)
    “… To address this problem, this paper proposes a preprocessing algorithm combining Cholesky sorted QR decomposition and partial iterative lattice reduction (CHOSLAR …”
    Volltext
    Journal Article
  9. 9

    IOPA: I/O-aware parallelism adaption for parallel programs von Liu, Tao, Liu, Yi, Qian, Chen, Qian, Depei

    ISSN: 1932-6203, 1932-6203
    Veröffentlicht: United States Public Library of Science 09.03.2017
    Veröffentlicht in PloS one (09.03.2017)
    “… With the development of multi-/many-core processors, applications need to be written as parallel programs to improve execution efficiency …”
    Volltext
    Journal Article
  10. 10

    An Alpha-Tree Algorithm for Massively Parallel Architectures von Carlinet, Edwin, Kaci, Quentin, Blin, Nicolas

    ISSN: 1057-7149, 1941-0042, 1941-0042
    Veröffentlicht: United States IEEE 01.01.2025
    Veröffentlicht in IEEE transactions on image processing (01.01.2025)
    “… In this paper, we propose the first massively parallel alpha-tree algorithm that leverages concurrent union-find data structures to exploit the SIMT …”
    Volltext
    Journal Article
  11. 11

    Accelerating Fourier and Number Theoretic Transforms using Tensor Cores and Warp Shuffles von Durrani, Sultan, Chughtai, Muhammad Saad, Hidayetoglu, Mert, Tahir, Rashid, Dakkak, Abdul, Rauchwerger, Lawrence, Zaffar, Fareed, Hwu, Wen-mei

    Veröffentlicht: IEEE 01.09.2021
    “… To speed things up, fast Fourier transform (FFT) algorithms, which are reduced-complexity formulations for computing the DFT of a sequence, have been proposed and implemented for traditional processors and their corresponding instruction sets …”
    Volltext
    Tagungsbericht
  12. 12

    An Efficient Implementation of the Bellman-Ford Algorithm for Kepler GPU Architectures von Busato, Federico, Bombieri, Nicola

    ISSN: 1045-9219, 1558-2183
    Veröffentlicht: New York IEEE 01.08.2016
    “… ) involves much more redundant work and a consequent waste of power consumption. This article presents a parallel implementation of the Bellman-Ford algorithm that exploits the architectural characteristics of recent GPU architectures (i.e …”
    Volltext
    Journal Article
  13. 13

    STREAMINGGS: Voxel-Based Streaming 3D Gaussian Splatting with Memory Optimization and Architectural Support von Zhang, Chenqi, Feng, Yu, Zhao, Jieru, Liu, Guangda, Ding, Wenchao, Wu, Chentao, Guo, Minyi

    Veröffentlicht: IEEE 22.06.2025
    “… We introduce STREAMINGGS, a fully streaming 3DGS algorithm-architecture co-design that achieves fine-grained pipelining and reduces DRAM traffic by transforming from a tile-centric rendering …”
    Volltext
    Tagungsbericht
  14. 14

    Parallel Branch-And-Bound Algorithms: Survey and Synthesis von Gendron, Bernard, Crainic, Teodor Gabriel

    ISSN: 0030-364X, 1526-5463
    Veröffentlicht: Linthicum, MD Operations Research Society of America 01.11.1994
    Veröffentlicht in Operations research (01.11.1994)
    “… We present a detailed and up-to-date survey of the literature on parallel branch-and-bound algorithms …”
    Volltext
    Journal Article
  15. 15

    Parallel and accurate k‐means algorithm on CPU‐GPU architectures for spectral clustering von He, Guanlin, Vialle, Stephane, Baboulin, Marc

    ISSN: 1532-0626, 1532-0634
    Veröffentlicht: Hoboken Wiley Subscription Services, Inc 25.06.2022
    Veröffentlicht in Concurrency and computation (25.06.2022)
    “… We propose parallel optimization techniques for the k‐means algorithm on CPU and GPU. Particularly we use a two …”
    Volltext
    Journal Article
  16. 16

    An efficient heterogeneous parallel password recovery system on MT-3000 von Luo, Yongtao, Liu, Jie, Gong, Chunye, Li, Tun

    ISSN: 0920-8542, 1573-0484
    Veröffentlicht: New York Springer US 01.01.2025
    Veröffentlicht in The Journal of supercomputing (01.01.2025)
    “… However, as encryption algorithms and complex passwords become more prevalent for security purposes, traditional CPU-based and GPU-based password recovery systems are struggling to meet the time …”
    Volltext
    Journal Article
  17. 17

    Hardware-Efficient and High-Throughput LLRC Segregation Based Binary QC-LDPC Decoding Algorithm and Architecture von Verma, Anuj, Shrestha, Rahul

    ISSN: 1549-7747, 1558-3791
    Veröffentlicht: New York IEEE 01.08.2021
    “… ) segregation technique. Subsequently, we present hardware-efficient QC-LDPC decoder-architecture based on the proposed algorithm and additional architectural optimizations …”
    Volltext
    Journal Article
  18. 18

    An efficient hardware supported and parallelization architecture for intelligent systems to overcome speculative overheads von Kumar, Sudhakar, Singh, Sunil K., Aggarwal, Naveen, Gupta, Brij B., Alhalabi, Wadee, Band, Shahab S.

    ISSN: 0884-8173, 1098-111X
    Veröffentlicht: New York John Wiley & Sons, Inc 01.12.2022
    Veröffentlicht in International journal of intelligent systems (01.12.2022)
    “… time‐consuming and central processing unit intensive. As a consequence, parallel processing systems are gaining popularity to enhance overall computer performance …”
    Volltext
    Journal Article
  19. 19

    Auto-GNAS: A Parallel Graph Neural Architecture Search Framework von Chen, Jiamin, Gao, Jianliang, Chen, Yibo, Oloulade, Babatounde Moctard, Lyu, Tengfei, Li, Zhao

    ISSN: 1045-9219, 1558-2183
    Veröffentlicht: New York IEEE 01.11.2022
    “… Graph neural architecture search effectively constructs the GNNs that achieve the expected model performance with the rise of automatic machine learning …”
    Volltext
    Journal Article
  20. 20

    Max-PIM: Fast and Efficient Max/Min Searching in DRAM von Zhang, Fan, Angizi, Shaahin, Fan, Deliang

    Veröffentlicht: IEEE 05.12.2021
    “… In this work, for the first time, we propose a novel 'Min/Max-in-memory' algorithm based on iterative XNOR bit-wise comparison, which supports parallel inmemory searching for minimum and maximum …”
    Volltext
    Tagungsbericht