Search Results - parallel and vector implementations

Refine Results
  1. 1

    Adaptive Particle Swarm Optimization with Heterogeneous Multicore Parallelism and GPU Acceleration by Wachowiak, Mark P., Timson, Mitchell C., DuVal, David J.

    ISSN: 1045-9219, 1558-2183
    Published: New York IEEE 01.10.2017
    “…), which is analyzed for parallelization on readily-available heterogeneous parallel computational hardware…”
    Get full text
    Journal Article
  2. 2

    SLEEF: A Portable Vectorized Library of C Standard Mathematical Functions by Shibata, Naoki, Petrogalli, Francesco

    ISSN: 1045-9219, 1558-2183
    Published: New York IEEE 01.06.2020
    “… In order to make the library portable while maintaining good performance, intrinsic functions of vector extensions are abstracted by inline functions or preprocessor macros…”
    Get full text
    Journal Article
  3. 3

    Parallel Implementation of the 2D Discrete Wavelet Transform on Graphics Processing Units: Filter Bank versus Lifting by Tenllado, C., Setoain, J., Prieto, M., Pinuel, L., Tirado, F.

    ISSN: 1045-9219, 1558-2183
    Published: New York IEEE 01.03.2008
    “…The widespread usage of the discrete wavelet transform (DWT) has motivated the development of fast DWT algorithms and their tuning on all sorts of computer…”
    Get full text
    Journal Article
  4. 4

    Parallel Implementation of the Ensemble Empirical Mode Decomposition (PEEMD) and Its Application for Earth Science Data Analysis by Shen, Bo-Wen, Cheung, Samson, Wu, Yu-ling, Li, Jui-Lin, Kao, David

    ISSN: 1521-9615
    Published: IEEE 08.06.2017
    Published in Computing in science & engineering (08.06.2017)
    “…), achieving a parallel speedup of 720x using 200 eight-core processors. In this study, we discuss the implementation of the PEEMD and its application for the analysis…”
    Get full text
    Journal Article
  5. 5

    Accelerating Matrix Operations with Improved Deeply Pipelined Vector Reduction by Yi-Gang Tai, Chia-Tien Dan Lo, Psarris, K.

    ISSN: 1045-9219, 1558-2183
    Published: New York IEEE 01.02.2012
    “…Many scientific or engineering applications involve matrix operations, in which reduction of vectors is a common operation…”
    Get full text
    Journal Article
  6. 6

    An Optimized FFT-Based Direct Poisson Solver on CUDA GPUs by Jing Wu, JaJa, Joseph, Balaras, Elias

    ISSN: 1045-9219, 1558-2183
    Published: New York IEEE 01.03.2014
    “…A highly multithreaded FFT-based direct Poisson solver that makes effective use of the capabilities of the current NVIDIA graphics processing units (GPUs) is…”
    Get full text
    Journal Article
  7. 7

    Optimized FFT computations on heterogeneous platforms with application to the Poisson equation by Wu, Jing, JaJa, Joseph

    ISSN: 0743-7315, 1096-0848
    Published: Amsterdam Elsevier Inc 01.08.2014
    “…We develop optimized multi-dimensional FFT implementations on CPU–GPU heterogeneous platforms for the case when the input is too large to fit on the GPU global memory, and use the resulting techniques to develop a fast Poisson solver…”
    Get full text
    Journal Article
  8. 8

    Three Applications of GPU Computing in Neuroscience by Baladron Pezoa, Javier, Fasoli, Diego, Faugeras, Olivier

    ISSN: 1521-9615, 1558-366X
    Published: IEEE 01.05.2012
    Published in Computing in science & engineering (01.05.2012)
    “…Three scenarios outlined here show the benefits of using a computer system with multiple GPUs in theoretical neuroscience. In each instance, it's clear that…”
    Get full text
    Journal Article
  9. 9

    Performance evaluation and comparison of parallel conjugate gradient on modern multi-core accelerator and massively parallel systems by Sibai, Fadi N., El-Moursy, Ali

    ISSN: 1744-5760, 1744-5779
    Published: Abingdon Taylor & Francis 02.01.2014
    “…Two parallel computer paradigms available today are multi-core accelerators such as the Sony, Toshiba and IBM Cell or Graphics Processing Unit (GPUs…”
    Get full text
    Journal Article
  10. 10

    An Optimized Cell BE Special Function Library Generated by Coconut by Anand, C.K., Kahl, W.

    ISSN: 0018-9340, 1557-9956
    Published: New York IEEE 01.08.2009
    Published in IEEE transactions on computers (01.08.2009)
    “…) embedded in Haskell. The DSL supports interactive prototyping and unit testing, simplifying the process of designing efficient implementations of common patterns…”
    Get full text
    Journal Article
  11. 11

    Parallel cryptographic arithmetic using a redundant Montgomery representation by Page, D., Smart, N.P.

    ISSN: 0018-9340, 1557-9956
    Published: New York IEEE 01.11.2004
    Published in IEEE transactions on computers (01.11.2004)
    “… representation. We present some preliminary implementation timings using the SSE2 instruction set on a Pentium 4 processor and show that an SIMD parallel implementation of RSA can be around twice as fast as traditional sequential code…”
    Get full text
    Journal Article
  12. 12

    Parallel implementations of randomized vector algorithm for solving large systems of linear equations by Sabelfeld, Karl K., Kireev, Sergey, Kireeva, Anastasiya

    ISSN: 0920-8542, 1573-0484
    Published: New York Springer US 01.07.2023
    Published in The Journal of supercomputing (01.07.2023)
    “…The results of a parallel implementation of a randomized vector algorithm for solving systems of linear equations are presented in the paper…”
    Get full text
    Journal Article
  13. 13

    GPU Parallel Implementation of Support Vector Machines for Hyperspectral Image Classification by Tan, Kun, Zhang, Junpeng, Du, Qian, Wang, Xuesong

    ISSN: 1939-1404, 2151-1535
    Published: Piscataway IEEE 01.10.2015
    “…Support vector machine (SVM) is considered as one of the most powerful classifiers for hyperspectral remote sensing images…”
    Get full text
    Journal Article
  14. 14

    Parallel Implementation of MOEA/D with Parallel Weight Vectors for Feature Selection by Liao, Weiduo, Ishibuchi, Hisao, Meng Pang, Lie, Shang, Ke

    ISSN: 2577-1655
    Published: IEEE 11.10.2020
    “…In machine learning field, feature selection can be treated as a bi-objective optimization problem. It is reported that a decomposition-based evolutionary…”
    Get full text
    Conference Proceeding
  15. 15

    High Performance FFT Based Poisson Solver on a CPU-GPU Heterogeneous Platform by Jing Wu, JaJa, Joseph

    ISBN: 146736066X, 9781467360661
    ISSN: 1530-2075
    Published: IEEE 01.05.2013
    “…We develop an optimized FFT based Poisson solver on a CPU-GPU heterogeneous platform for the case when the input is too large to fit on the GPU global memory…”
    Get full text
    Conference Proceeding
  16. 16

    Numerical engineering: design of PDE black-box solvers by Schönauer, Willi

    ISSN: 0378-4754, 1872-7166
    Published: Amsterdam Elsevier B.V 15.12.2000
    Published in Mathematics and computers in simulation (15.12.2000)
    “…The design of PDE black-box solvers (for nonlinear systems of elliptic and parabolic PDEs) needs many compromises between efficiency and robustness which we…”
    Get full text
    Journal Article Conference Proceeding
  17. 17

    Parallel Implementation on FPGA of Support Vector Machines Using Stochastic Gradient Descent by Lopes, Felipe F., Ferreira, João Canas, Fernandes, Marcelo A. C.

    ISSN: 2079-9292, 2079-9292
    Published: Basel MDPI AG 01.06.2019
    Published in Electronics (Basel) (01.06.2019)
    “… For this reason, accelerators such as Field-programmable Gate Arrays (FPGAs) are used. This work describes an implementation in hardware, using FPGA, of a fully parallel SVM using Stochastic Gradient Descent…”
    Get full text
    Journal Article
  18. 18

    Exploring Parallel Implementation of SPHINCS+ Using Advanced Vector Extensions (AVX) Sets by Zhou, Yaoyun, Rajasekaran, Kavin, Wang, Qian

    ISSN: 1948-3295
    Published: IEEE 23.04.2025
    “… In this work, we investigate and profile different parallel hash function implementations for SPHINCS…”
    Get full text
    Conference Proceeding
  19. 19

    Compensated summation and dot product algorithms for floating-point vectors on parallel architectures: Error bounds, implementation and application in the Krylov subspace methods by Evstigneev, N.M., Ryabkov, O.I., Bocharov, A.N., Petrovskiy, V.P., Teplyakov, I.O.

    ISSN: 0377-0427, 1879-1778
    Published: Elsevier B.V 01.11.2022
    “… The compensated parallel variants of summation and dot product operations for floating point vectors are considered…”
    Get full text
    Journal Article
  20. 20

    Parallel implementation of an efficient preconditioned linear solver for grid-based applications in chemical physics. III: Improved parallel scalability for sparse matrix–vector products by Chen, Wenwu, Poirier, Bill

    ISSN: 0743-7315, 1096-0848
    Published: Amsterdam Elsevier Inc 01.07.2010
    “… In two previous papers [W. Chen, B. Poirier, Parallel implementation of efficient preconditioned linear solver for grid-based applications in chemical physics…”
    Get full text
    Journal Article