Search Results - software tools for parallel programming

Refine Results
  1. 1

    From Design Patterns to Parallel Architectural Skeletons by Goswami, Dhrubajyoti, Singh, Ajit, Preiss, Bruno R.

    ISSN: 0743-7315, 1096-0848
    Published: San Diego, CA Elsevier Inc 01.04.2002
    “…The concept of design patterns has been extensively studied and applied in the context of object-oriented software design…”
    Get full text
    Journal Article
  2. 2

    The Direction of Development Parallel Programming of Software Tools Software Complexes Military Purpose by Lutikovov, Igor V.

    ISSN: 1999-494X, 2313-6057
    Published: 01.02.2022
    “… (with minimal execution time) implemented in parallel software development tools for multi-core (multiprocessor…”
    Get full text
    Journal Article
  3. 3

    Splitwise: Efficient Generative LLM Inference Using Phase Splitting by Patel, Pratyush, Choukse, Esha, Zhang, Chaojie, Shah, Aashaka, Goiri, Inigo, Maleki, Saeed, Bianchini, Ricardo

    Published: IEEE 29.06.2024
    “…Generative large language model (LLM) applications are growing rapidly, leading to large-scale deployments of expensive and power-hungry GPUs. Our…”
    Get full text
    Conference Proceeding
  4. 4

    Polygeist: Raising C to Polyhedral MLIR by Moses, William S., Chelini, Lorenzo, Zhao, Ruizhe, Zinenko, Oleksandr

    Published: IEEE 01.09.2021
    “…We present Polygeist, a new compilation flow that connects the MLIR compiler infrastructure to cutting edge polyhedral optimization tools…”
    Get full text
    Conference Proceeding
  5. 5

    SimplePIM: A Software Framework for Productive and Efficient Processing-in-Memory by Chen, Jinfan, Gomez-Luna, Juan, El Hajj, Izzat, Guo, Yuxin, Mutlu, Onur

    Published: IEEE 21.10.2023
    “… This paper presents a new software framework, SimplePIM, to aid programming real PIM systems…”
    Get full text
    Conference Proceeding
  6. 6

    The Direction of Development Parallel Programming of Software Tools Software Complexes Military Purpose by Aksenov, Michael A, Lutikovov, Igor V

    ISSN: 1999-494X, 2313-6057
    Published: Krasnoyarsk Siberian Federal University 01.01.2022
    “…В статье рассмотрены вопросы классификационного выбора предпочтительных алгоритмов распараллеливания (с минимальным временем выполнения), реализованных в…”
    Get full text
    Journal Article
  7. 7

    UWOmppro: UWOmp++ with Point-to-Point Synchronization, Reduction and Schedules by Agrawal, Aditya, Nandivada, V. Krishna

    Published: IEEE 21.10.2023
    “… (for example, iterations of a parallel-for-loop). While OpenMP allows synchronization among these threads, many classes of computations can be conveniently expressed by specifying synchronization among the parallel activities…”
    Get full text
    Conference Proceeding
  8. 8

    LLM-Based Java Concurrent Program to ArkTS Converter by Liu, Runlin, Lin, Yuhang, Hu, Yunge, Zhang, Zhe, Gao, Xiang

    ISSN: 2643-1572
    Published: ACM 27.10.2024
    “… However, HarmonyOS utilizes ArkTS, a superset of TypeScript, as the programming language for application development…”
    Get full text
    Conference Proceeding
  9. 9

    Efficient Execution of OpenMP on GPUs by Huber, Joseph, Cornelius, Melanie, Georgakoudis, Giorgis, Tian, Shilei, Diaz, Jose M Monsalve, Dinel, Kuter, Chapman, Barbara, Doerfert, Johannes

    Published: IEEE 02.04.2022
    “…OpenMP is the preferred choice for CPU parallelism in High-Performance-Computing (HPC) applications written in C, C++, or Fortran. As HPC systems became…”
    Get full text
    Conference Proceeding
  10. 10

    PID-Comm: A Fast and Flexible Collective Communication Framework for Commodity Processing-in-DIMM Devices by Noh, Si Ung, Hong, Junguk, Lim, Chaemin, Park, Seongyeon, Kim, Jeehyun, Kim, Hanjun, Kim, Youngsok, Lee, Jinho

    Published: IEEE 29.06.2024
    “… Many highly parallel applications have been shown to benefit from these PIM-enabled DIMMs, but further speedup is often limited by the huge overhead of inter-PE collective communication…”
    Get full text
    Conference Proceeding
  11. 11

    Cognitive Correlative Encoding for Genome Sequence Matching in Hyperdimensional System by Poduval, Prathyush, Zou, Zhuowen, Yin, Xunzhao, Sadredini, Elaheh, Imani, Mohsen

    Published: IEEE 05.12.2021
    “… In this paper, we propose HYPERS, a novel framework supporting highly efficient and parallel pattern matching based on HyperDimensional computing (HDC…”
    Get full text
    Conference Proceeding
  12. 12

    Architecture-Aware Currying by Kandemir, Mahmut Taylan, Akbulut, Gulsum Gudukbay, Choi, Wonil, Karakoy, Mustafa

    Published: IEEE 21.10.2023
    “…In near-data computing (NDC), computation is brought into data, as opposed to bringing data to computation. While there is prior work focusing on different NDC…”
    Get full text
    Conference Proceeding
  13. 13
  14. 14

    BOLT: Optimizing OpenMP Parallel Regions with User-Level Threads by Iwasaki, Shintaro, Amer, Abdelhalim, Taura, Kenjiro, Seo, Sangmin, Balaji, Pavan

    ISSN: 2641-7936
    Published: IEEE 01.09.2019
    “… As a result, multiple levels of the software stack use OpenMP independently of one another, often leading to nested parallel regions…”
    Get full text
    Conference Proceeding
  15. 15

    A Finer-Grained Blocking Analysis for Parallel Real-Time Tasks with Spin-Locks by Chen, Zewei, Lei, Hang, Yang, Maolin, Liao, Yong, Qiao, Lei

    Published: IEEE 05.12.2021
    “…Real-time synchronization is one of the essential theories in real-time systems, and the recent booming of parallel real-time tasks has brought new challenges to the synchronization analysis…”
    Get full text
    Conference Proceeding
  16. 16

    HiSpTRSV: Exploring Tile-Level Parallelism for SpTRSV Acceleration on FPGAs by Sun, Fan, Dong, Fang, Shen, Dian

    Published: IEEE 22.06.2025
    “… HiSpTRSV addresses these challenges through dependency graph parsing, tile-based highly parallel algorithm, filtering mechanisms, and bidirectional matching with modular indexing…”
    Get full text
    Conference Proceeding
  17. 17

    One Automaton to Rule Them All: Beyond Multiple Regular Expressions Execution by Cicolini, Luisa, Carloni, Filippo, Santambrogio, Marco D., Conficconi, Davide

    ISSN: 2643-2838
    Published: IEEE 02.03.2024
    “…Regular Expressions (REs) matching is crucial to identify strings exhibiting certain morphological properties in a data stream, resulting paramount in contexts…”
    Get full text
    Conference Proceeding
  18. 18

    A Framework for Fine-Grained Synchronization of Dependent GPU Kernels by Jangda, Abhinav, Maleki, Saeed, Dehnavi, Maryam Mehri, Musuvathi, Madan, Saarikivi, Olli

    ISSN: 2643-2838
    Published: IEEE 02.03.2024
    “…Machine Learning (ML) models execute several parallel computations including Generalized Matrix Multiplication, Convolution, Dropout, etc…”
    Get full text
    Conference Proceeding
  19. 19

    Synergically Rebalancing Parallel Execution via DCT and Turbo Boosting by Marques, Sandro M., Medeiros, Thiarles S., Rossi, Fabio D., Luizelli, Marcelo C., Beck, Antonio Carlos S., Lorenzon, Arthur F.

    Published: IEEE 05.12.2021
    “… Many dynamic concurrency throttling (DCT) techniques have successfully used to tune the number of executing threads to better balance a parallel application according to its available scalability…”
    Get full text
    Conference Proceeding
  20. 20