Search Results - algorithms and (architekture OR architecture) for parallel processing
-
1
Evaluation of parallel particle swarm optimization algorithms within the CUDA™ architecture
ISSN: 0020-0255, 1872-6291Published: Elsevier Inc 15.10.2011Published in Information sciences (15.10.2011)“…), which are, in fact, massively parallel processing architectures. In this paper we discuss possible approaches to parallelizing PSO on graphics hardware within the Compute Unified Device Architecture (CUDA…”
Get full text
Journal Article -
2
An Alpha-Tree Algorithm for Massively Parallel Architectures
ISSN: 1057-7149, 1941-0042, 1941-0042Published: United States IEEE 01.01.2025Published in IEEE transactions on image processing (01.01.2025)“… In this paper, we propose the first massively parallel alpha-tree algorithm that leverages concurrent union-find data structures to exploit the SIMT…”
Get full text
Journal Article -
3
Parallel and accurate k‐means algorithm on CPU‐GPU architectures for spectral clustering
ISSN: 1532-0626, 1532-0634Published: Hoboken Wiley Subscription Services, Inc 25.06.2022Published in Concurrency and computation (25.06.2022)“… We propose parallel optimization techniques for the k‐means algorithm on CPU and GPU. Particularly we use a two…”
Get full text
Journal Article -
4
Auto-GNAS: A Parallel Graph Neural Architecture Search Framework
ISSN: 1045-9219, 1558-2183Published: New York IEEE 01.11.2022Published in IEEE transactions on parallel and distributed systems (01.11.2022)“… Graph neural architecture search effectively constructs the GNNs that achieve the expected model performance with the rise of automatic machine learning…”
Get full text
Journal Article -
5
Max-PIM: Fast and Efficient Max/Min Searching in DRAM
Published: IEEE 05.12.2021Published in 2021 58th ACM/IEEE Design Automation Conference (DAC) (05.12.2021)“… In this work, for the first time, we propose a novel 'Min/Max-in-memory' algorithm based on iterative XNOR bit-wise comparison, which supports parallel inmemory searching for minimum and maximum…”
Get full text
Conference Proceeding -
6
PV to Virtual Bus Parallel Differential Power Processing Architecture for Photovoltaic Systems
ISSN: 0278-0046, 1557-9948Published: New York IEEE 01.05.2025Published in IEEE transactions on industrial electronics (1982) (01.05.2025)“…This article introduces an innovative parallel differential power processing (PDPP) architecture designed to mitigate the effect of mismatch among photovoltaic…”
Get full text
Journal Article -
7
A flexible algorithm for calculating pair interactions on SIMD architectures
ISSN: 0010-4655, 1879-2944, 1879-2944Published: Elsevier B.V 01.12.2013Published in Computer physics communications (01.12.2013)“… In order to reach high performance on modern CPU and accelerator architectures, single-instruction multiple-data (SIMD…”
Get full text
Journal Article -
8
Skywalker: Efficient Alias-Method-Based Graph Sampling and Random Walk on GPUs
Published: IEEE 01.09.2021Published in 2021 30th International Conference on Parallel Architectures and Compilation Techniques (PACT) (01.09.2021)“…Graph sampling and random walk operations, capturing the structural properties of graphs, are playing an important role today as we cannot directly adopt computing-intensive algorithms on large-scale graphs…”
Get full text
Conference Proceeding -
9
Practical Implementation of Multichannel Filtered-x Least Mean Square Algorithm Based on the Multiple-Parallel-Branch With Folding Architecture for Large-Scale Active Noise Control
ISSN: 1063-8210, 1557-9999Published: New York IEEE 01.04.2020Published in IEEE transactions on very large scale integration (VLSI) systems (01.04.2020)“… The feedforward multichannel filtered-x least mean square (FFMCFxLMS) algorithm is commonly used to dynamically adjust the transfer function of the multichannel controllers for different noise environments…”
Get full text
Journal Article -
10
Parallel Pipelined Architecture and Algorithm for Matrix Transposition Using Registers
ISSN: 1549-7747, 1558-3791Published: New York IEEE 01.03.2022Published in IEEE transactions on circuits and systems. II, Express briefs (01.03.2022)“…In this brief, we present a new algorithm and architecture for continuous-flow matrix transposition using registers…”
Get full text
Journal Article -
11
Neural Speech Phase Prediction Based on Parallel Estimation Architecture and Anti-Wrapping Losses
ISSN: 2379-190XPublished: IEEE 04.06.2023Published in Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) (04.06.2023)“… The proposed model is a cascade of a residual convolutional network and a parallel estimation architecture…”
Get full text
Conference Proceeding -
12
cuFSDAF: An Enhanced Flexible Spatiotemporal Data Fusion Algorithm Parallelized Using Graphics Processing Units
ISSN: 0196-2892, 1558-0644Published: New York IEEE 2022Published in IEEE transactions on geoscience and remote sensing (2022)“…) algorithm is suitable for heterogeneous landscapes and capable of capturing abrupt land-cover changes…”
Get full text
Journal Article -
13
InnerSP: A Memory Efficient Sparse Matrix Multiplication Accelerator with Locality-Aware Inner Product Processing
Published: IEEE 01.09.2021Published in 2021 30th International Conference on Parallel Architectures and Compilation Techniques (PACT) (01.09.2021)“… To mitigate the memory access overheads, recent accelerator designs advocated the outer product processing which minimizes input accesses but generates intermediate products to be merged to the final output matrix…”
Get full text
Conference Proceeding -
14
New parallel computing algorithm of molecular dynamics for extremely huge scale biological systems
ISSN: 0192-8651, 1096-987X, 1096-987XPublished: Hoboken, USA John Wiley & Sons, Inc 05.02.2021Published in Journal of computational chemistry (05.02.2021)“…In this paper, we address high performance extreme‐scale molecular dynamics (MD) algorithm in the GENESIS software to perform cellular…”
Get full text
Journal Article -
15
MoDL: Model-Based Deep Learning Architecture for Inverse Problems
ISSN: 0278-0062, 1558-254X, 1558-254XPublished: United States IEEE 01.02.2019Published in IEEE transactions on medical imaging (01.02.2019)“…)-based regularization prior. The proposed formulation provides a systematic approach for deriving deep architectures for inverse problems with the arbitrary structure…”
Get full text
Journal Article -
16
Parallel Implementation of Key Algorithms for Intelligent Processing of Graphic Signal Data of Consumer Digital Equipment
ISSN: 1383-469X, 1572-8153Published: New York Springer US 01.10.2024Published in Mobile networks and applications (01.10.2024)“… The purpose of this paper is to process the graphic signal data in enterprise digital equipment and improve the efficiency of algorithm task processing in the system…”
Get full text
Journal Article -
17
IMAGING: In-Memory AlGorithms for Image processiNG
ISSN: 1549-8328, 1558-0806Published: New York IEEE 01.12.2018Published in IEEE transactions on circuits and systems. I, Regular papers (01.12.2018)“…Data-intensive applications such as image processing suffer from massive data movement between memory and processing units…”
Get full text
Journal Article -
18
An Ising Model-Based Parallel Tempering Processing Architecture for Combinatorial Optimization
ISSN: 0278-0070, 1937-4151Published: New York IEEE 01.12.2024Published in IEEE transactions on computer-aided design of integrated circuits and systems (01.12.2024)“… This article presents a novel parallel tempering processing architecture (PTPA) based on the fully connected Ising model to address these issues…”
Get full text
Journal Article -
19
Splitwise: Efficient Generative LLM Inference Using Phase Splitting
Published: IEEE 29.06.2024Published in 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA) (29.06.2024)“…Generative large language model (LLM) applications are growing rapidly, leading to large-scale deployments of expensive and power-hungry GPUs. Our…”
Get full text
Conference Proceeding -
20
High-Throughput Non-Binary LDPC Decoder Architecture Using Parallel EMS Algorithm
ISSN: 0018-9200, 1558-173XPublished: New York IEEE 01.10.2022Published in IEEE journal of solid-state circuits (01.10.2022)“…) decoding algorithm that reduces the processing latency of each iteration by managing multiple message entries at a time…”
Get full text
Journal Article