Search Results - algorithms and (architekture OR architecture) for parallel processing

Refine Results
  1. 1

    Evaluation of parallel particle swarm optimization algorithms within the CUDA™ architecture by Mussi, Luca, Daolio, Fabio, Cagnoni, Stefano

    ISSN: 0020-0255, 1872-6291
    Published: Elsevier Inc 15.10.2011
    Published in Information sciences (15.10.2011)
    “…), which are, in fact, massively parallel processing architectures. In this paper we discuss possible approaches to parallelizing PSO on graphics hardware within the Compute Unified Device Architecture (CUDA…”
    Get full text
    Journal Article
  2. 2

    An Alpha-Tree Algorithm for Massively Parallel Architectures by Carlinet, Edwin, Kaci, Quentin, Blin, Nicolas

    ISSN: 1057-7149, 1941-0042, 1941-0042
    Published: United States IEEE 01.01.2025
    Published in IEEE transactions on image processing (01.01.2025)
    “… In this paper, we propose the first massively parallel alpha-tree algorithm that leverages concurrent union-find data structures to exploit the SIMT…”
    Get full text
    Journal Article
  3. 3

    Parallel and accurate k‐means algorithm on CPU‐GPU architectures for spectral clustering by He, Guanlin, Vialle, Stephane, Baboulin, Marc

    ISSN: 1532-0626, 1532-0634
    Published: Hoboken Wiley Subscription Services, Inc 25.06.2022
    Published in Concurrency and computation (25.06.2022)
    “… We propose parallel optimization techniques for the k‐means algorithm on CPU and GPU. Particularly we use a two…”
    Get full text
    Journal Article
  4. 4

    Auto-GNAS: A Parallel Graph Neural Architecture Search Framework by Chen, Jiamin, Gao, Jianliang, Chen, Yibo, Oloulade, Babatounde Moctard, Lyu, Tengfei, Li, Zhao

    ISSN: 1045-9219, 1558-2183
    Published: New York IEEE 01.11.2022
    “… Graph neural architecture search effectively constructs the GNNs that achieve the expected model performance with the rise of automatic machine learning…”
    Get full text
    Journal Article
  5. 5

    Max-PIM: Fast and Efficient Max/Min Searching in DRAM by Zhang, Fan, Angizi, Shaahin, Fan, Deliang

    Published: IEEE 05.12.2021
    “… In this work, for the first time, we propose a novel 'Min/Max-in-memory' algorithm based on iterative XNOR bit-wise comparison, which supports parallel inmemory searching for minimum and maximum…”
    Get full text
    Conference Proceeding
  6. 6

    PV to Virtual Bus Parallel Differential Power Processing Architecture for Photovoltaic Systems by Nazer, Afshin, Isabella, Olindo, Manganiello, Patrizio

    ISSN: 0278-0046, 1557-9948
    Published: New York IEEE 01.05.2025
    “…This article introduces an innovative parallel differential power processing (PDPP) architecture designed to mitigate the effect of mismatch among photovoltaic…”
    Get full text
    Journal Article
  7. 7

    A flexible algorithm for calculating pair interactions on SIMD architectures by Páll, Szilárd, Hess, Berk

    ISSN: 0010-4655, 1879-2944, 1879-2944
    Published: Elsevier B.V 01.12.2013
    Published in Computer physics communications (01.12.2013)
    “… In order to reach high performance on modern CPU and accelerator architectures, single-instruction multiple-data (SIMD…”
    Get full text
    Journal Article
  8. 8

    Skywalker: Efficient Alias-Method-Based Graph Sampling and Random Walk on GPUs by Wang, Pengyu, Li, Chao, Wang, Jing, Wang, Taolei, Zhang, Lu, Leng, Jingwen, Chen, Quan, Guo, Minyi

    Published: IEEE 01.09.2021
    “…Graph sampling and random walk operations, capturing the structural properties of graphs, are playing an important role today as we cannot directly adopt computing-intensive algorithms on large-scale graphs…”
    Get full text
    Conference Proceeding
  9. 9

    Practical Implementation of Multichannel Filtered-x Least Mean Square Algorithm Based on the Multiple-Parallel-Branch With Folding Architecture for Large-Scale Active Noise Control by Shi, Dongyuan, Gan, Woon-Seng, He, Jianjun, Lam, Bhan

    ISSN: 1063-8210, 1557-9999
    Published: New York IEEE 01.04.2020
    “… The feedforward multichannel filtered-x least mean square (FFMCFxLMS) algorithm is commonly used to dynamically adjust the transfer function of the multichannel controllers for different noise environments…”
    Get full text
    Journal Article
  10. 10

    Parallel Pipelined Architecture and Algorithm for Matrix Transposition Using Registers by Zhang, Bo, Ma, Zhenguo, Luo, Wei

    ISSN: 1549-7747, 1558-3791
    Published: New York IEEE 01.03.2022
    “…In this brief, we present a new algorithm and architecture for continuous-flow matrix transposition using registers…”
    Get full text
    Journal Article
  11. 11

    Neural Speech Phase Prediction Based on Parallel Estimation Architecture and Anti-Wrapping Losses by Ai, Yang, Ling, Zhen-Hua

    ISSN: 2379-190X
    Published: IEEE 04.06.2023
    “… The proposed model is a cascade of a residual convolutional network and a parallel estimation architecture…”
    Get full text
    Conference Proceeding
  12. 12

    cuFSDAF: An Enhanced Flexible Spatiotemporal Data Fusion Algorithm Parallelized Using Graphics Processing Units by Gao, Huan, Zhu, Xiaolin, Guan, Qingfeng, Yang, Xue, Yao, Yao, Zeng, Wen, Peng, Xuantong

    ISSN: 0196-2892, 1558-0644
    Published: New York IEEE 2022
    “…) algorithm is suitable for heterogeneous landscapes and capable of capturing abrupt land-cover changes…”
    Get full text
    Journal Article
  13. 13

    InnerSP: A Memory Efficient Sparse Matrix Multiplication Accelerator with Locality-Aware Inner Product Processing by Baek, Daehyeon, Hwang, Soojin, Heo, Taekyung, Kim, Daehoon, Huh, Jaehyuk

    Published: IEEE 01.09.2021
    “… To mitigate the memory access overheads, recent accelerator designs advocated the outer product processing which minimizes input accesses but generates intermediate products to be merged to the final output matrix…”
    Get full text
    Conference Proceeding
  14. 14

    New parallel computing algorithm of molecular dynamics for extremely huge scale biological systems by Jung, Jaewoon, Kobayashi, Chigusa, Kasahara, Kento, Tan, Cheng, Kuroda, Akiyoshi, Minami, Kazuo, Ishiduki, Shigeru, Nishiki, Tatsuo, Inoue, Hikaru, Ishikawa, Yutaka, Feig, Michael, Sugita, Yuji

    ISSN: 0192-8651, 1096-987X, 1096-987X
    Published: Hoboken, USA John Wiley & Sons, Inc 05.02.2021
    Published in Journal of computational chemistry (05.02.2021)
    “…In this paper, we address high performance extreme‐scale molecular dynamics (MD) algorithm in the GENESIS software to perform cellular…”
    Get full text
    Journal Article
  15. 15

    MoDL: Model-Based Deep Learning Architecture for Inverse Problems by Aggarwal, Hemant K., Mani, Merry P., Jacob, Mathews

    ISSN: 0278-0062, 1558-254X, 1558-254X
    Published: United States IEEE 01.02.2019
    Published in IEEE transactions on medical imaging (01.02.2019)
    “…)-based regularization prior. The proposed formulation provides a systematic approach for deriving deep architectures for inverse problems with the arbitrary structure…”
    Get full text
    Journal Article
  16. 16

    Parallel Implementation of Key Algorithms for Intelligent Processing of Graphic Signal Data of Consumer Digital Equipment by Huang, Changbing, Li, Ruibo, Li, Aiping

    ISSN: 1383-469X, 1572-8153
    Published: New York Springer US 01.10.2024
    Published in Mobile networks and applications (01.10.2024)
    “… The purpose of this paper is to process the graphic signal data in enterprise digital equipment and improve the efficiency of algorithm task processing in the system…”
    Get full text
    Journal Article
  17. 17

    IMAGING: In-Memory AlGorithms for Image processiNG by Haj-Ali, Ameer, Ben-Hur, Rotem, Wald, Nimrod, Ronen, Ronny, Kvatinsky, Shahar

    ISSN: 1549-8328, 1558-0806
    Published: New York IEEE 01.12.2018
    “…Data-intensive applications such as image processing suffer from massive data movement between memory and processing units…”
    Get full text
    Journal Article
  18. 18

    An Ising Model-Based Parallel Tempering Processing Architecture for Combinatorial Optimization by Zhang, Yang, Wang, Xiangrui, Fan, Gaopeng, Cao, Yuan, Liu, Yiqiu, Yang, Yongkui, Yao, Enyi

    ISSN: 0278-0070, 1937-4151
    Published: New York IEEE 01.12.2024
    “… This article presents a novel parallel tempering processing architecture (PTPA) based on the fully connected Ising model to address these issues…”
    Get full text
    Journal Article
  19. 19

    Splitwise: Efficient Generative LLM Inference Using Phase Splitting by Patel, Pratyush, Choukse, Esha, Zhang, Chaojie, Shah, Aashaka, Goiri, Inigo, Maleki, Saeed, Bianchini, Ricardo

    Published: IEEE 29.06.2024
    “…Generative large language model (LLM) applications are growing rapidly, leading to large-scale deployments of expensive and power-hungry GPUs. Our…”
    Get full text
    Conference Proceeding
  20. 20

    High-Throughput Non-Binary LDPC Decoder Architecture Using Parallel EMS Algorithm by Choe, Jeongwon, Lee, Youngjoo

    ISSN: 0018-9200, 1558-173X
    Published: New York IEEE 01.10.2022
    Published in IEEE journal of solid-state circuits (01.10.2022)
    “…) decoding algorithm that reduces the processing latency of each iteration by managing multiple message entries at a time…”
    Get full text
    Journal Article