Suchergebnisse - Parallel and vector implementations

1

Wird geladen …

Adaptive Particle Swarm Optimization with Heterogeneous Multicore Parallelism and GPU Acceleration von Wachowiak, Mark P., Timson, Mitchell C., DuVal, David J.

ISSN: 1045-9219, 1558-2183

Veröffentlicht: New York IEEE 01.10.2017

Veröffentlicht in IEEE transactions on parallel and distributed systems (01.10.2017)
“… ), which is analyzed for parallelization on readily-available heterogeneous parallel computational hardware …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
2

Wird geladen …

SLEEF: A Portable Vectorized Library of C Standard Mathematical Functions von Shibata, Naoki, Petrogalli, Francesco

ISSN: 1045-9219, 1558-2183

Veröffentlicht: New York IEEE 01.06.2020

Veröffentlicht in IEEE transactions on parallel and distributed systems (01.06.2020)
“… In order to make the library portable while maintaining good performance, intrinsic functions of vector extensions are abstracted by inline functions or preprocessor macros …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
3

Wird geladen …

Parallel Implementation of the 2D Discrete Wavelet Transform on Graphics Processing Units: Filter Bank versus Lifting von Tenllado, C., Setoain, J., Prieto, M., Pinuel, L., Tirado, F.

ISSN: 1045-9219, 1558-2183

Veröffentlicht: New York IEEE 01.03.2008

Veröffentlicht in IEEE transactions on parallel and distributed systems (01.03.2008)
“… The widespread usage of the discrete wavelet transform (DWT) has motivated the development of fast DWT algorithms and their tuning on all sorts of computer …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
4

Wird geladen …

Parallel Implementation of the Ensemble Empirical Mode Decomposition (PEEMD) and Its Application for Earth Science Data Analysis von Shen, Bo-Wen, Cheung, Samson, Wu, Yu-ling, Li, Jui-Lin, Kao, David

ISSN: 1521-9615

Veröffentlicht: IEEE 08.06.2017

Veröffentlicht in Computing in science & engineering (08.06.2017)
“… ), achieving a parallel speedup of 720x using 200 eight-core processors. In this study, we discuss the implementation of the PEEMD and its application for the analysis …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
5

Wird geladen …

Accelerating Matrix Operations with Improved Deeply Pipelined Vector Reduction von Yi-Gang Tai, Chia-Tien Dan Lo, Psarris, K.

ISSN: 1045-9219, 1558-2183

Veröffentlicht: New York IEEE 01.02.2012

Veröffentlicht in IEEE transactions on parallel and distributed systems (01.02.2012)
“… Many scientific or engineering applications involve matrix operations, in which reduction of vectors is a common operation …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
6

Wird geladen …

An Optimized FFT-Based Direct Poisson Solver on CUDA GPUs von Jing Wu, JaJa, Joseph, Balaras, Elias

ISSN: 1045-9219, 1558-2183

Veröffentlicht: New York IEEE 01.03.2014

Veröffentlicht in IEEE transactions on parallel and distributed systems (01.03.2014)
“… A highly multithreaded FFT-based direct Poisson solver that makes effective use of the capabilities of the current NVIDIA graphics processing units (GPUs) is …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
7

Wird geladen …

Optimized FFT computations on heterogeneous platforms with application to the Poisson equation von Wu, Jing, JaJa, Joseph

ISSN: 0743-7315, 1096-0848

Veröffentlicht: Amsterdam Elsevier Inc 01.08.2014

Veröffentlicht in Journal of parallel and distributed computing (01.08.2014)
“… We develop optimized multi-dimensional FFT implementations on CPU–GPU heterogeneous platforms for the case when the input is too large to fit on the GPU global memory, and use the resulting techniques to develop a fast Poisson solver …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
8

Wird geladen …

Three Applications of GPU Computing in Neuroscience von Baladron Pezoa, Javier, Fasoli, Diego, Faugeras, Olivier

ISSN: 1521-9615, 1558-366X

Veröffentlicht: IEEE 01.05.2012

Veröffentlicht in Computing in science & engineering (01.05.2012)
“… Three scenarios outlined here show the benefits of using a computer system with multiple GPUs in theoretical neuroscience. In each instance, it's clear that …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
9

Wird geladen …

Performance evaluation and comparison of parallel conjugate gradient on modern multi-core accelerator and massively parallel systems von Sibai, Fadi N., El-Moursy, Ali

ISSN: 1744-5760, 1744-5779

Veröffentlicht: Abingdon Taylor & Francis 02.01.2014

Veröffentlicht in International journal of parallel, emergent and distributed systems (02.01.2014)
“… Two parallel computer paradigms available today are multi-core accelerators such as the Sony, Toshiba and IBM Cell or Graphics Processing Unit (GPUs …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
10

Wird geladen …

An Optimized Cell BE Special Function Library Generated by Coconut von Anand, C.K., Kahl, W.

ISSN: 0018-9340, 1557-9956

Veröffentlicht: New York IEEE 01.08.2009

Veröffentlicht in IEEE transactions on computers (01.08.2009)
“… ) embedded in Haskell. The DSL supports interactive prototyping and unit testing, simplifying the process of designing efficient implementations of common patterns …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
11

Wird geladen …

Parallel cryptographic arithmetic using a redundant Montgomery representation von Page, D., Smart, N.P.

ISSN: 0018-9340, 1557-9956

Veröffentlicht: New York IEEE 01.11.2004

Veröffentlicht in IEEE transactions on computers (01.11.2004)
“… representation. We present some preliminary implementation timings using the SSE2 instruction set on a Pentium 4 processor and show that an SIMD parallel implementation of RSA can be around twice as fast as traditional sequential code …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
12

Wird geladen …

Parallel implementations of randomized vector algorithm for solving large systems of linear equations von Sabelfeld, Karl K., Kireev, Sergey, Kireeva, Anastasiya

ISSN: 0920-8542, 1573-0484

Veröffentlicht: New York Springer US 01.07.2023

Veröffentlicht in The Journal of supercomputing (01.07.2023)
“… The results of a parallel implementation of a randomized vector algorithm for solving systems of linear equations are presented in the paper …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
13

Wird geladen …

GPU Parallel Implementation of Support Vector Machines for Hyperspectral Image Classification von Tan, Kun, Zhang, Junpeng, Du, Qian, Wang, Xuesong

ISSN: 1939-1404, 2151-1535

Veröffentlicht: Piscataway IEEE 01.10.2015

Veröffentlicht in IEEE journal of selected topics in applied earth observations and remote sensing (01.10.2015)
“… Support vector machine (SVM) is considered as one of the most powerful classifiers for hyperspectral remote sensing images …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
14

Wird geladen …

Parallel Implementation of MOEA/D with Parallel Weight Vectors for Feature Selection von Liao, Weiduo, Ishibuchi, Hisao, Meng Pang, Lie, Shang, Ke

ISSN: 2577-1655

Veröffentlicht: IEEE 11.10.2020

Veröffentlicht in Conference proceedings - IEEE International Conference on Systems, Man, and Cybernetics (11.10.2020)
“… In machine learning field, feature selection can be treated as a bi-objective optimization problem. It is reported that a decomposition-based evolutionary …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
15

Wird geladen …

High Performance FFT Based Poisson Solver on a CPU-GPU Heterogeneous Platform von Jing Wu, JaJa, Joseph

ISBN: 146736066X, 9781467360661

ISSN: 1530-2075

Veröffentlicht: IEEE 01.05.2013

Veröffentlicht in 2013 IEEE 27th International Symposium on Parallel and Distributed Processing (01.05.2013)
“… We develop an optimized FFT based Poisson solver on a CPU-GPU heterogeneous platform for the case when the input is too large to fit on the GPU global memory …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
16

Wird geladen …

Numerical engineering: design of PDE black-box solvers von Schönauer, Willi

ISSN: 0378-4754, 1872-7166

Veröffentlicht: Amsterdam Elsevier B.V 15.12.2000

Veröffentlicht in Mathematics and computers in simulation (15.12.2000)
“… The design of PDE black-box solvers (for nonlinear systems of elliptic and parabolic PDEs) needs many compromises between efficiency and robustness which we …”

Volltext

Journal Article Tagungsbericht

Zu den Favoriten

Gespeichert in:
17

Wird geladen …

Parallel Implementation on FPGA of Support Vector Machines Using Stochastic Gradient Descent von Lopes, Felipe F., Ferreira, João Canas, Fernandes, Marcelo A. C.

ISSN: 2079-9292, 2079-9292

Veröffentlicht: Basel MDPI AG 01.06.2019

Veröffentlicht in Electronics (Basel) (01.06.2019)
“… For this reason, accelerators such as Field-programmable Gate Arrays (FPGAs) are used. This work describes an implementation in hardware, using FPGA, of a fully parallel SVM using Stochastic Gradient Descent …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
18

Wird geladen …

Exploring Parallel Implementation of SPHINCS+ Using Advanced Vector Extensions (AVX) Sets von Zhou, Yaoyun, Rajasekaran, Kavin, Wang, Qian

ISSN: 1948-3295

Veröffentlicht: IEEE 23.04.2025

Veröffentlicht in Proceedings / IEEE International Symposium on Quality Electronic Design (23.04.2025)
“… In this work, we investigate and profile different parallel hash function implementations for SPHINCS …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
19

Wird geladen …

Compensated summation and dot product algorithms for floating-point vectors on parallel architectures: Error bounds, implementation and application in the Krylov subspace methods von Evstigneev, N.M., Ryabkov, O.I., Bocharov, A.N., Petrovskiy, V.P., Teplyakov, I.O.

ISSN: 0377-0427, 1879-1778

Veröffentlicht: Elsevier B.V 01.11.2022

Veröffentlicht in Journal of computational and applied mathematics (01.11.2022)
“… The compensated parallel variants of summation and dot product operations for floating point vectors are considered …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
20

Wird geladen …

Parallel implementation of an efficient preconditioned linear solver for grid-based applications in chemical physics. III: Improved parallel scalability for sparse matrix–vector products von Chen, Wenwu, Poirier, Bill

ISSN: 0743-7315, 1096-0848

Veröffentlicht: Amsterdam Elsevier Inc 01.07.2010

Veröffentlicht in Journal of parallel and distributed computing (01.07.2010)
“… In two previous papers [W. Chen, B. Poirier, Parallel implementation of efficient preconditioned linear solver for grid-based applications in chemical physics …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:

Suchergebnisse - Parallel and vector implementations

Adaptive Particle Swarm Optimization with Heterogeneous Multicore Parallelism and GPU Acceleration von Wachowiak, Mark P., Timson, Mitchell C., DuVal, David J.

SLEEF: A Portable Vectorized Library of C Standard Mathematical Functions von Shibata, Naoki, Petrogalli, Francesco

Parallel Implementation of the 2D Discrete Wavelet Transform on Graphics Processing Units: Filter Bank versus Lifting von Tenllado, C., Setoain, J., Prieto, M., Pinuel, L., Tirado, F.

Parallel Implementation of the Ensemble Empirical Mode Decomposition (PEEMD) and Its Application for Earth Science Data Analysis von Shen, Bo-Wen, Cheung, Samson, Wu, Yu-ling, Li, Jui-Lin, Kao, David

Accelerating Matrix Operations with Improved Deeply Pipelined Vector Reduction von Yi-Gang Tai, Chia-Tien Dan Lo, Psarris, K.

An Optimized FFT-Based Direct Poisson Solver on CUDA GPUs von Jing Wu, JaJa, Joseph, Balaras, Elias

Optimized FFT computations on heterogeneous platforms with application to the Poisson equation von Wu, Jing, JaJa, Joseph

Three Applications of GPU Computing in Neuroscience von Baladron Pezoa, Javier, Fasoli, Diego, Faugeras, Olivier

Performance evaluation and comparison of parallel conjugate gradient on modern multi-core accelerator and massively parallel systems von Sibai, Fadi N., El-Moursy, Ali

An Optimized Cell BE Special Function Library Generated by Coconut von Anand, C.K., Kahl, W.

Parallel cryptographic arithmetic using a redundant Montgomery representation von Page, D., Smart, N.P.

Parallel implementations of randomized vector algorithm for solving large systems of linear equations von Sabelfeld, Karl K., Kireev, Sergey, Kireeva, Anastasiya

GPU Parallel Implementation of Support Vector Machines for Hyperspectral Image Classification von Tan, Kun, Zhang, Junpeng, Du, Qian, Wang, Xuesong

Parallel Implementation of MOEA/D with Parallel Weight Vectors for Feature Selection von Liao, Weiduo, Ishibuchi, Hisao, Meng Pang, Lie, Shang, Ke

High Performance FFT Based Poisson Solver on a CPU-GPU Heterogeneous Platform von Jing Wu, JaJa, Joseph

Numerical engineering: design of PDE black-box solvers von Schönauer, Willi

Parallel Implementation on FPGA of Support Vector Machines Using Stochastic Gradient Descent von Lopes, Felipe F., Ferreira, João Canas, Fernandes, Marcelo A. C.

Exploring Parallel Implementation of SPHINCS+ Using Advanced Vector Extensions (AVX) Sets von Zhou, Yaoyun, Rajasekaran, Kavin, Wang, Qian

Compensated summation and dot product algorithms for floating-point vectors on parallel architectures: Error bounds, implementation and application in the Krylov subspace methods von Evstigneev, N.M., Ryabkov, O.I., Bocharov, A.N., Petrovskiy, V.P., Teplyakov, I.O.

Parallel implementation of an efficient preconditioned linear solver for grid-based applications in chemical physics. III: Improved parallel scalability for sparse matrix–vector products von Chen, Wenwu, Poirier, Bill

Suchwerkzeuge:

Treffer weiter einschränken

Format

Schlagwortumfeld

Thema

Sprache

Erscheinungsjahr