Suchergebnisse - Parallel and vector implementation

  1. 1

    Adaptive Particle Swarm Optimization with Heterogeneous Multicore Parallelism and GPU Acceleration von Wachowiak, Mark P., Timson, Mitchell C., DuVal, David J.

    ISSN: 1045-9219, 1558-2183
    Veröffentlicht: New York IEEE 01.10.2017
    “… ), which is analyzed for parallelization on readily-available heterogeneous parallel computational hardware …”
    Volltext
    Journal Article
  2. 2

    SLEEF: A Portable Vectorized Library of C Standard Mathematical Functions von Shibata, Naoki, Petrogalli, Francesco

    ISSN: 1045-9219, 1558-2183
    Veröffentlicht: New York IEEE 01.06.2020
    “… In order to make the library portable while maintaining good performance, intrinsic functions of vector extensions are abstracted by inline functions or preprocessor macros …”
    Volltext
    Journal Article
  3. 3

    Parallel Implementation of the 2D Discrete Wavelet Transform on Graphics Processing Units: Filter Bank versus Lifting von Tenllado, C., Setoain, J., Prieto, M., Pinuel, L., Tirado, F.

    ISSN: 1045-9219, 1558-2183
    Veröffentlicht: New York IEEE 01.03.2008
    “… The widespread usage of the discrete wavelet transform (DWT) has motivated the development of fast DWT algorithms and their tuning on all sorts of computer …”
    Volltext
    Journal Article
  4. 4

    Parallel Implementation of the Ensemble Empirical Mode Decomposition (PEEMD) and Its Application for Earth Science Data Analysis von Shen, Bo-Wen, Cheung, Samson, Wu, Yu-ling, Li, Jui-Lin, Kao, David

    ISSN: 1521-9615
    Veröffentlicht: IEEE 08.06.2017
    Veröffentlicht in Computing in science & engineering (08.06.2017)
    “… ), achieving a parallel speedup of 720x using 200 eight-core processors. In this study, we discuss the implementation of the PEEMD and its application for the analysis …”
    Volltext
    Journal Article
  5. 5

    GPU Parallel Implementation of Support Vector Machines for Hyperspectral Image Classification von Tan, Kun, Zhang, Junpeng, Du, Qian, Wang, Xuesong

    ISSN: 1939-1404, 2151-1535
    Veröffentlicht: Piscataway IEEE 01.10.2015
    “… Support vector machine (SVM) is considered as one of the most powerful classifiers for hyperspectral remote sensing images …”
    Volltext
    Journal Article
  6. 6

    Parallel implementations of randomized vector algorithm for solving large systems of linear equations von Sabelfeld, Karl K., Kireev, Sergey, Kireeva, Anastasiya

    ISSN: 0920-8542, 1573-0484
    Veröffentlicht: New York Springer US 01.07.2023
    Veröffentlicht in The Journal of supercomputing (01.07.2023)
    “… The results of a parallel implementation of a randomized vector algorithm for solving systems of linear equations are presented in the paper …”
    Volltext
    Journal Article
  7. 7

    Accelerating Matrix Operations with Improved Deeply Pipelined Vector Reduction von Yi-Gang Tai, Chia-Tien Dan Lo, Psarris, K.

    ISSN: 1045-9219, 1558-2183
    Veröffentlicht: New York IEEE 01.02.2012
    “… Many scientific or engineering applications involve matrix operations, in which reduction of vectors is a common operation …”
    Volltext
    Journal Article
  8. 8

    Parallel Implementation of MOEA/D with Parallel Weight Vectors for Feature Selection von Liao, Weiduo, Ishibuchi, Hisao, Meng Pang, Lie, Shang, Ke

    ISSN: 2577-1655
    Veröffentlicht: IEEE 11.10.2020
    “… In machine learning field, feature selection can be treated as a bi-objective optimization problem. It is reported that a decomposition-based evolutionary …”
    Volltext
    Tagungsbericht
  9. 9

    A PARALLEL AND VECTOR IMPLEMENTATION OF CIRCUIT SIMULATION ON CRAY SUPERCOMPUTERS von BATAINEH, ABDULLA, AAMODT, MIKE, THOMAS, KEVIN

    ISSN: 1063-7192
    Veröffentlicht: Taylor & Francis Group 01.07.1999
    Veröffentlicht in Parallel algorithms and applications (01.07.1999)
    “… A speedup of 40 times on 16 vector processors was achieved for MOSFET transistor model evaluation component …”
    Volltext
    Journal Article
  10. 10

    An Optimized FFT-Based Direct Poisson Solver on CUDA GPUs von Jing Wu, JaJa, Joseph, Balaras, Elias

    ISSN: 1045-9219, 1558-2183
    Veröffentlicht: New York IEEE 01.03.2014
    “… A highly multithreaded FFT-based direct Poisson solver that makes effective use of the capabilities of the current NVIDIA graphics processing units (GPUs) is …”
    Volltext
    Journal Article
  11. 11

    Optimized FFT computations on heterogeneous platforms with application to the Poisson equation von Wu, Jing, JaJa, Joseph

    ISSN: 0743-7315, 1096-0848
    Veröffentlicht: Amsterdam Elsevier Inc 01.08.2014
    Veröffentlicht in Journal of parallel and distributed computing (01.08.2014)
    “… We develop optimized multi-dimensional FFT implementations on CPU–GPU heterogeneous platforms for the case when the input is too large to fit on the GPU global memory, and use the resulting techniques to develop a fast Poisson solver …”
    Volltext
    Journal Article
  12. 12

    Three Applications of GPU Computing in Neuroscience von Baladron Pezoa, Javier, Fasoli, Diego, Faugeras, Olivier

    ISSN: 1521-9615, 1558-366X
    Veröffentlicht: IEEE 01.05.2012
    Veröffentlicht in Computing in science & engineering (01.05.2012)
    “… Three scenarios outlined here show the benefits of using a computer system with multiple GPUs in theoretical neuroscience. In each instance, it's clear that …”
    Volltext
    Journal Article
  13. 13

    Parallel Implementation on FPGA of Support Vector Machines Using Stochastic Gradient Descent von Lopes, Felipe F., Ferreira, João Canas, Fernandes, Marcelo A. C.

    ISSN: 2079-9292, 2079-9292
    Veröffentlicht: Basel MDPI AG 01.06.2019
    Veröffentlicht in Electronics (Basel) (01.06.2019)
    “… For this reason, accelerators such as Field-programmable Gate Arrays (FPGAs) are used. This work describes an implementation in hardware, using FPGA, of a fully parallel SVM using Stochastic Gradient Descent …”
    Volltext
    Journal Article
  14. 14

    Exploring Parallel Implementation of SPHINCS+ Using Advanced Vector Extensions (AVX) Sets von Zhou, Yaoyun, Rajasekaran, Kavin, Wang, Qian

    ISSN: 1948-3295
    Veröffentlicht: IEEE 23.04.2025
    “… In this work, we investigate and profile different parallel hash function implementations for SPHINCS …”
    Volltext
    Tagungsbericht
  15. 15

    Compensated summation and dot product algorithms for floating-point vectors on parallel architectures: Error bounds, implementation and application in the Krylov subspace methods von Evstigneev, N.M., Ryabkov, O.I., Bocharov, A.N., Petrovskiy, V.P., Teplyakov, I.O.

    ISSN: 0377-0427, 1879-1778
    Veröffentlicht: Elsevier B.V 01.11.2022
    Veröffentlicht in Journal of computational and applied mathematics (01.11.2022)
    “… The compensated parallel variants of summation and dot product operations for floating point vectors are considered …”
    Volltext
    Journal Article
  16. 16

    Performance evaluation and comparison of parallel conjugate gradient on modern multi-core accelerator and massively parallel systems von Sibai, Fadi N., El-Moursy, Ali

    ISSN: 1744-5760, 1744-5779
    Veröffentlicht: Abingdon Taylor & Francis 02.01.2014
    “… Two parallel computer paradigms available today are multi-core accelerators such as the Sony, Toshiba and IBM Cell or Graphics Processing Unit (GPUs …”
    Volltext
    Journal Article
  17. 17

    Parallel-and-vector implementation of the event-driven logic simulation algorithm on the Cray Y-MP supercomputer von Bataineh, A., Ozguner, F.

    ISBN: 9780818626302, 0818626305
    Veröffentlicht: IEEE Comput. Soc. Press 1992
    Veröffentlicht in Supercomputing, `92 (1992)
    “… The authors propose logic simulation techniques using parallel and vector machines to reduce the simulation time of large digital circuits …”
    Volltext
    Tagungsbericht
  18. 18

    Parallel-and-vector implementation of the event-driven logic simulation algorithm on the Cray Y-MP supercomputer von Bataineh, A., Özgüner, F.

    ISBN: 9780818626302, 0818626305
    Veröffentlicht: Los Alamitos, CA, USA IEEE Computer Society Press 01.12.1992
    Volltext
    Tagungsbericht
  19. 19

    Graphics Card Computing for Cosmology: Cholesky Factorization von Gratton, Steven

    ISBN: 1424475473, 9781424475476
    Veröffentlicht: IEEE 01.06.2010
    “… Cosmological data sets are becoming so large as to make optimal statistical analyses of them impossible. Even with approximations made, the computational …”
    Volltext
    Tagungsbericht
  20. 20

    Parallel cryptographic arithmetic using a redundant Montgomery representation von Page, D., Smart, N.P.

    ISSN: 0018-9340, 1557-9956
    Veröffentlicht: New York IEEE 01.11.2004
    Veröffentlicht in IEEE transactions on computers (01.11.2004)
    “… representation. We present some preliminary implementation timings using the SSE2 instruction set on a Pentium 4 processor and show that an SIMD parallel implementation of RSA can be around twice as fast as traditional sequential code …”
    Volltext
    Journal Article