SPMD OpenMP versus MPI on a IBM SMP for 3 Kernels of the NAS Benchmarks.

Saved in:
Bibliographic Details
Title: SPMD OpenMP versus MPI on a IBM SMP for 3 Kernels of the NAS Benchmarks.
Authors: Krawezik, Géraud, Alléon, Guillaume, Cappello, Franck
Source: High Performance Computing (9783540436744); 2006, p425-436, 12p
Abstract: Shared Memory Multiprocessors are becoming more popular since they are used to deploy large parallel computers. The current trend is to enlarge the number of processors inside such multiprocessor nodes. However a lot of existing applications are using the message passing paradigm even when running on shared memory machines. This is due to three main factors: 1) the legacy of previous versions written for distributed memory computers, 2) the difficulty to obtain high performances with OpenMP when using loop level parallelization and 3) the complexity of writing multithreaded programs using a low level thread library. In this paper we demonstrate that OpenMP can provide better performance than MPI on SMP machines. We use a coarse grain parallelization approach, also known as the SPMD programming style with OpenMP. The performance evaluation considers the IBM SP3 NH2 and three kernels of the NAS benchmark: FT, CG and MG. We compare three implementations of them: the NAS 2.3 MPI, a fine grain (loop level) OpenMP version and our SPMD OpenMP version. A breakdown of the execution times provides an explanation of the performance results. [ABSTRACT FROM AUTHOR]
Copyright of High Performance Computing (9783540436744) is the property of Springer Nature / Books and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Database: Complementary Index
FullText Text:
  Availability: 0
Header DbId: edb
DbLabel: Complementary Index
An: 76719148
RelevancyScore: 848
AccessLevel: 6
PubType: Book
PubTypeId: book
PreciseRelevancyScore: 847.6142578125
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: SPMD OpenMP versus MPI on a IBM SMP for 3 Kernels of the NAS Benchmarks.
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Krawezik%2C+Géraud%22">Krawezik, Géraud</searchLink><br /><searchLink fieldCode="AR" term="%22Alléon%2C+Guillaume%22">Alléon, Guillaume</searchLink><br /><searchLink fieldCode="AR" term="%22Cappello%2C+Franck%22">Cappello, Franck</searchLink>
– Name: TitleSource
  Label: Source
  Group: Src
  Data: High Performance Computing (9783540436744); 2006, p425-436, 12p
– Name: Abstract
  Label: Abstract
  Group: Ab
  Data: Shared Memory Multiprocessors are becoming more popular since they are used to deploy large parallel computers. The current trend is to enlarge the number of processors inside such multiprocessor nodes. However a lot of existing applications are using the message passing paradigm even when running on shared memory machines. This is due to three main factors: 1) the legacy of previous versions written for distributed memory computers, 2) the difficulty to obtain high performances with OpenMP when using loop level parallelization and 3) the complexity of writing multithreaded programs using a low level thread library. In this paper we demonstrate that OpenMP can provide better performance than MPI on SMP machines. We use a coarse grain parallelization approach, also known as the SPMD programming style with OpenMP. The performance evaluation considers the IBM SP3 NH2 and three kernels of the NAS benchmark: FT, CG and MG. We compare three implementations of them: the NAS 2.3 MPI, a fine grain (loop level) OpenMP version and our SPMD OpenMP version. A breakdown of the execution times provides an explanation of the performance results. [ABSTRACT FROM AUTHOR]
– Name: Abstract
  Label:
  Group: Ab
  Data: <i>Copyright of High Performance Computing (9783540436744) is the property of Springer Nature / Books and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract.</i> (Copyright applies to all Abstracts.)
PLink https://erproxy.cvtisr.sk/sfx/access?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=edb&AN=76719148
RecordInfo BibRecord:
  BibEntity:
    Identifiers:
      – Type: doi
        Value: 10.1007/3-540-47847-7_39
    Languages:
      – Code: eng
        Text: English
    PhysicalDescription:
      Pagination:
        PageCount: 12
        StartPage: 425
    Titles:
      – TitleFull: SPMD OpenMP versus MPI on a IBM SMP for 3 Kernels of the NAS Benchmarks.
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Krawezik, Géraud
      – PersonEntity:
          Name:
            NameFull: Alléon, Guillaume
      – PersonEntity:
          Name:
            NameFull: Cappello, Franck
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 01
              M: 01
              Text: 2006
              Type: published
              Y: 2006
          Identifiers:
            – Type: isbn-print
              Value: 9783540436744
          Titles:
            – TitleFull: High Performance Computing (9783540436744)
              Type: main
ResultId 1