Suchergebnisse - software tools for parallel programming

  1. 1

    From Design Patterns to Parallel Architectural Skeletons von Goswami, Dhrubajyoti, Singh, Ajit, Preiss, Bruno R.

    ISSN: 0743-7315, 1096-0848
    Veröffentlicht: San Diego, CA Elsevier Inc 01.04.2002
    Veröffentlicht in Journal of parallel and distributed computing (01.04.2002)
    “… The concept of design patterns has been extensively studied and applied in the context of object-oriented software design …”
    Volltext
    Journal Article
  2. 2

    The Direction of Development Parallel Programming of Software Tools Software Complexes Military Purpose von Aksenov, Michael A, Lutikovov, Igor V

    ISSN: 1999-494X, 2313-6057
    Veröffentlicht: Krasnoyarsk Siberian Federal University 01.01.2022
    “… В статье рассмотрены вопросы классификационного выбора предпочтительных алгоритмов распараллеливания (с минимальным временем выполнения), реализованных в …”
    Volltext
    Journal Article
  3. 3

    The Direction of Development Parallel Programming of Software Tools Software Complexes Military Purpose von Lutikovov, Igor V.

    ISSN: 1999-494X, 2313-6057
    Veröffentlicht: 01.02.2022
    “… (with minimal execution time) implemented in parallel software development tools for multi-core (multiprocessor …”
    Volltext
    Journal Article
  4. 4

    Splitwise: Efficient Generative LLM Inference Using Phase Splitting von Patel, Pratyush, Choukse, Esha, Zhang, Chaojie, Shah, Aashaka, Goiri, Inigo, Maleki, Saeed, Bianchini, Ricardo

    Veröffentlicht: IEEE 29.06.2024
    “… Generative large language model (LLM) applications are growing rapidly, leading to large-scale deployments of expensive and power-hungry GPUs. Our …”
    Volltext
    Tagungsbericht
  5. 5

    Polygeist: Raising C to Polyhedral MLIR von Moses, William S., Chelini, Lorenzo, Zhao, Ruizhe, Zinenko, Oleksandr

    Veröffentlicht: IEEE 01.09.2021
    “… We present Polygeist, a new compilation flow that connects the MLIR compiler infrastructure to cutting edge polyhedral optimization tools …”
    Volltext
    Tagungsbericht
  6. 6

    SimplePIM: A Software Framework for Productive and Efficient Processing-in-Memory von Chen, Jinfan, Gomez-Luna, Juan, El Hajj, Izzat, Guo, Yuxin, Mutlu, Onur

    Veröffentlicht: IEEE 21.10.2023
    “… This paper presents a new software framework, SimplePIM, to aid programming real PIM systems …”
    Volltext
    Tagungsbericht
  7. 7

    UWOmppro: UWOmp++ with Point-to-Point Synchronization, Reduction and Schedules von Agrawal, Aditya, Nandivada, V. Krishna

    Veröffentlicht: IEEE 21.10.2023
    “… (for example, iterations of a parallel-for-loop). While OpenMP allows synchronization among these threads, many classes of computations can be conveniently expressed by specifying synchronization among the parallel activities …”
    Volltext
    Tagungsbericht
  8. 8

    LLM-Based Java Concurrent Program to ArkTS Converter von Liu, Runlin, Lin, Yuhang, Hu, Yunge, Zhang, Zhe, Gao, Xiang

    ISSN: 2643-1572
    Veröffentlicht: ACM 27.10.2024
    “… However, HarmonyOS utilizes ArkTS, a superset of TypeScript, as the programming language for application development …”
    Volltext
    Tagungsbericht
  9. 9

    Efficient Execution of OpenMP on GPUs von Huber, Joseph, Cornelius, Melanie, Georgakoudis, Giorgis, Tian, Shilei, Diaz, Jose M Monsalve, Dinel, Kuter, Chapman, Barbara, Doerfert, Johannes

    Veröffentlicht: IEEE 02.04.2022
    “… OpenMP is the preferred choice for CPU parallelism in High-Performance-Computing (HPC) applications written in C, C++, or Fortran. As HPC systems became …”
    Volltext
    Tagungsbericht
  10. 10

    PID-Comm: A Fast and Flexible Collective Communication Framework for Commodity Processing-in-DIMM Devices von Noh, Si Ung, Hong, Junguk, Lim, Chaemin, Park, Seongyeon, Kim, Jeehyun, Kim, Hanjun, Kim, Youngsok, Lee, Jinho

    Veröffentlicht: IEEE 29.06.2024
    “… Many highly parallel applications have been shown to benefit from these PIM-enabled DIMMs, but further speedup is often limited by the huge overhead of inter-PE collective communication …”
    Volltext
    Tagungsbericht
  11. 11

    Cognitive Correlative Encoding for Genome Sequence Matching in Hyperdimensional System von Poduval, Prathyush, Zou, Zhuowen, Yin, Xunzhao, Sadredini, Elaheh, Imani, Mohsen

    Veröffentlicht: IEEE 05.12.2021
    “… In this paper, we propose HYPERS, a novel framework supporting highly efficient and parallel pattern matching based on HyperDimensional computing (HDC …”
    Volltext
    Tagungsbericht
  12. 12

    Architecture-Aware Currying von Kandemir, Mahmut Taylan, Akbulut, Gulsum Gudukbay, Choi, Wonil, Karakoy, Mustafa

    Veröffentlicht: IEEE 21.10.2023
    “… In near-data computing (NDC), computation is brought into data, as opposed to bringing data to computation. While there is prior work focusing on different NDC …”
    Volltext
    Tagungsbericht
  13. 13
  14. 14

    BOLT: Optimizing OpenMP Parallel Regions with User-Level Threads von Iwasaki, Shintaro, Amer, Abdelhalim, Taura, Kenjiro, Seo, Sangmin, Balaji, Pavan

    ISSN: 2641-7936
    Veröffentlicht: IEEE 01.09.2019
    “… As a result, multiple levels of the software stack use OpenMP independently of one another, often leading to nested parallel regions …”
    Volltext
    Tagungsbericht
  15. 15

    A Finer-Grained Blocking Analysis for Parallel Real-Time Tasks with Spin-Locks von Chen, Zewei, Lei, Hang, Yang, Maolin, Liao, Yong, Qiao, Lei

    Veröffentlicht: IEEE 05.12.2021
    “… Real-time synchronization is one of the essential theories in real-time systems, and the recent booming of parallel real-time tasks has brought new challenges to the synchronization analysis …”
    Volltext
    Tagungsbericht
  16. 16

    HiSpTRSV: Exploring Tile-Level Parallelism for SpTRSV Acceleration on FPGAs von Sun, Fan, Dong, Fang, Shen, Dian

    Veröffentlicht: IEEE 22.06.2025
    “… HiSpTRSV addresses these challenges through dependency graph parsing, tile-based highly parallel algorithm, filtering mechanisms, and bidirectional matching with modular indexing …”
    Volltext
    Tagungsbericht
  17. 17

    One Automaton to Rule Them All: Beyond Multiple Regular Expressions Execution von Cicolini, Luisa, Carloni, Filippo, Santambrogio, Marco D., Conficconi, Davide

    ISSN: 2643-2838
    Veröffentlicht: IEEE 02.03.2024
    “… Regular Expressions (REs) matching is crucial to identify strings exhibiting certain morphological properties in a data stream, resulting paramount in contexts …”
    Volltext
    Tagungsbericht
  18. 18

    A Framework for Fine-Grained Synchronization of Dependent GPU Kernels von Jangda, Abhinav, Maleki, Saeed, Dehnavi, Maryam Mehri, Musuvathi, Madan, Saarikivi, Olli

    ISSN: 2643-2838
    Veröffentlicht: IEEE 02.03.2024
    “… Machine Learning (ML) models execute several parallel computations including Generalized Matrix Multiplication, Convolution, Dropout, etc …”
    Volltext
    Tagungsbericht
  19. 19

    Synergically Rebalancing Parallel Execution via DCT and Turbo Boosting von Marques, Sandro M., Medeiros, Thiarles S., Rossi, Fabio D., Luizelli, Marcelo C., Beck, Antonio Carlos S., Lorenzon, Arthur F.

    Veröffentlicht: IEEE 05.12.2021
    “… Many dynamic concurrency throttling (DCT) techniques have successfully used to tune the number of executing threads to better balance a parallel application according to its available scalability …”
    Volltext
    Tagungsbericht
  20. 20

    CELLO: Compiler-Assisted Efficient Load-Load Ordering in Data-Race-Free Regions von Singh, Sawan, Feliu, Josue, Acacio, Manuel E., Jimborean, Alexandra, Ros, Alberto

    Veröffentlicht: IEEE 21.10.2023
    “… ) property imposed by modern programming languages. To leverage this observation …”
    Volltext
    Tagungsbericht