Sorting on the SGI Origin 2000: comparing MPI and shared memory implementations

Analyses the C/sup 3/-Radix (Communication- and Cache-Conscious Radix) sort algorithm, using the distributed and the shared memory parallel programming models. C/sup 3/-Radix was originally proposed based on the idea of the classic Radix sort to exploit the memory hierarchy locality and to reduce th...

Full description

Saved in:
Bibliographic Details
Published in:Proceedings. SCCC'99 XIX International Conference of the Chilean Computer Science Society pp. 209 - 215
Main Authors: Jimenez-Gonzalez, D., Guinovart, E., Larriba-Pey, J.-L., Navarro, J.J.
Format: Conference Proceeding
Language:English
Published: IEEE 1999
Subjects:
ISBN:0769502962, 9780769502960
ISSN:1522-4902
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract Analyses the C/sup 3/-Radix (Communication- and Cache-Conscious Radix) sort algorithm, using the distributed and the shared memory parallel programming models. C/sup 3/-Radix was originally proposed based on the idea of the classic Radix sort to exploit the memory hierarchy locality and to reduce the amount of communication for distributed memory computers. We implement C/sup 3/-Radix on the SGI Origin 2000 NUMA multiprocessor and make use of the Message Passing Interface (MPI) and the native shared memory directives of that computer to implement the two programming models that we want to analyse. We give results for up to 16 processors and 64 million 32-bit keys. The results show that for data sets that are small compared to the number of processors, the MPI implementation is faster, while for data sets that are large, the shared memory implementation is faster. In this paper, we explain the reasons for the different behaviours depending on the size of the data sets.
AbstractList Analyses the C/sup 3/-Radix (Communication- and Cache-Conscious Radix) sort algorithm, using the distributed and the shared memory parallel programming models. C/sup 3/-Radix was originally proposed based on the idea of the classic Radix sort to exploit the memory hierarchy locality and to reduce the amount of communication for distributed memory computers. We implement C/sup 3/-Radix on the SGI Origin 2000 NUMA multiprocessor and make use of the Message Passing Interface (MPI) and the native shared memory directives of that computer to implement the two programming models that we want to analyse. We give results for up to 16 processors and 64 million 32-bit keys. The results show that for data sets that are small compared to the number of processors, the MPI implementation is faster, while for data sets that are large, the shared memory implementation is faster. In this paper, we explain the reasons for the different behaviours depending on the size of the data sets.
Author Guinovart, E.
Navarro, J.J.
Jimenez-Gonzalez, D.
Larriba-Pey, J.-L.
Author_xml – sequence: 1
  givenname: D.
  surname: Jimenez-Gonzalez
  fullname: Jimenez-Gonzalez, D.
  organization: Dept. d'Arquitectura de Comput., Univ. Politecnica de Catalunya, Barcelona, Spain
– sequence: 2
  givenname: E.
  surname: Guinovart
  fullname: Guinovart, E.
– sequence: 3
  givenname: J.-L.
  surname: Larriba-Pey
  fullname: Larriba-Pey, J.-L.
– sequence: 4
  givenname: J.J.
  surname: Navarro
  fullname: Navarro, J.J.
BookMark eNotj0FLwzAYhgNOcJ3exVP-QOuXtGnyeZOiszCp0N1H23zdImta0l7271Xm6T08Dw-8EVv50RNjjwISIQCf66IoEoGIiREgEG5YBDpHBRJzuWJroaSMMwR5x6J5_gaQv0SsWVWPYXH-yEfPlxPxelvyKrij81wCwAvvxmFqwp_x-VXyxls-n5pAlg80jOHC3TCdaSC_NIsb_XzPbvvmPNPD_27Y_v1tX3zEu2pbFq-72BlcYqWktag6CZhRpowxrbSdxUxrkirV1Btt807qHiyQ6KzQqW7bLhW9zrBN0w17umYdER2m4IYmXA7X6-kPNKZNyA
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/SCCC.1999.810190
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE/IET Electronic Library
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EndPage 215
ExternalDocumentID 810190
GroupedDBID 23M
29O
29R
6IE
6IF
6IH
6IK
6IL
6IM
6IN
AAJGR
AAWTH
ABLEC
ACGFS
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IPLJI
M43
OCL
RIE
RIL
RNS
ID FETCH-LOGICAL-i89t-552dd95c2094e45888b2dcd9477e2537ef87d6c27f0d0e1cd1737bbc31f749b33
IEDL.DBID RIE
ISBN 0769502962
9780769502960
ISSN 1522-4902
IngestDate Tue Aug 26 16:57:22 EDT 2025
IsPeerReviewed false
IsScholarly true
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i89t-552dd95c2094e45888b2dcd9477e2537ef87d6c27f0d0e1cd1737bbc31f749b33
PageCount 7
ParticipantIDs ieee_primary_810190
PublicationCentury 1900
PublicationDate 19990000
PublicationDateYYYYMMDD 1999-01-01
PublicationDate_xml – year: 1999
  text: 19990000
PublicationDecade 1990
PublicationTitle Proceedings. SCCC'99 XIX International Conference of the Chilean Computer Science Society
PublicationTitleAbbrev SCCC
PublicationYear 1999
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0020291
ssj0000443293
Score 1.4193976
Snippet Analyses the C/sup 3/-Radix (Communication- and Cache-Conscious Radix) sort algorithm, using the distributed and the shared memory parallel programming models....
SourceID ieee
SourceType Publisher
StartPage 209
SubjectTerms Computer architecture
Contracts
Graphics
Memory architecture
Message passing
Multiprocessor interconnection networks
Postal services
Programming profession
Silicon
Sorting
Title Sorting on the SGI Origin 2000: comparing MPI and shared memory implementations
URI https://ieeexplore.ieee.org/document/810190
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1NS8MwGA46PHiazonf5OC1Ls3HkngtTge6DTZkt9F8VHdYO_Yh-O9N0nZD8OKtDYGUJO3b5837PA8A98y4MJZ6-3amdUQdYI6ExCLSipnYge0sC5Ys7698MBDTqRxVOtuBC2OtDcVn9sFfhrN8U-itT5V1vBiVdPj8kPNuSdXapVMQpQTvKZFuFFlKpTqsRSWqELtkrr2LK-Gd-h7Vx5dIdsZJkngGn9tBYbBfpish5vSa_3raE9Dec_fgaBeVTsGBzVugWZs3wOpdPgPDceEFBD5gkUP3EwjHz304DC5Z0PNqHmFZne57vI36MM0NXH_6anW48MW533C-qEvPw9Ztg0nvaZK8RJW7QjQXchMxho1xC4QdvrOerioUNtpIyrnFjHCbCW66GvMMGWRjbWJOuFKaxBmnUhFyDhp5kdsLADnNGFOZRYb5726qlEXUCKElS7Ul-hK0_OzMlqV-xqycmKs_W6_BcSmM4JMcN6CxWW3tLTjSX5v5enUX1vwH5_CkUQ
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1bS8MwGA0yBX2azol38-BrXZomS-JrcTrcDTZkb6O5VPewVnYR_Pcmabsh-OJbGwIpSdqv58t3zgHgnmobxhJn306VCogFzAEXmAdKUh1asJ2m3pLlrccGAz6dilGps-25MMYYX3xmHtylP8vXudq4VFnLiVEJi8_3KSEYFWStbUIFERLhHSnSjiMKsVSLtohAJWYX1La3cSm9U92j6gATidY4jmPH4bN7yA_3y3bFR51O_V_PewyaO_YeHG3j0gnYM1kD1Cv7Bli-zadgOM6dhMA7zDNofwPh-LkLh94nCzpmzSMs6tNdj_6oC5NMw9WHq1eHC1ee-w3ni6r43G_eJph0nibxS1D6KwRzLtYBpVhru0TYIjzjCKtcYq20IIwZTCNmUs50W2GWIo1MqHTIIialisKUESGj6AzUsjwz5wAyklIqU4M0dV_eREqDiOZcCZooE6kL0HCzM_ssFDRmxcRc_tl6Bw5fJv3erNcdvF6Bo0ImwaU8rkFtvdyYG3Cgvtbz1fLWr_8PqcWnmA
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=Proceedings.+SCCC%2799+XIX+International+Conference+of+the+Chilean+Computer+Science+Society&rft.atitle=Sorting+on+the+SGI+Origin+2000%3A+comparing+MPI+and+shared+memory+implementations&rft.au=Jimenez-Gonzalez%2C+D.&rft.au=Guinovart%2C+E.&rft.au=Larriba-Pey%2C+J.-L.&rft.au=Navarro%2C+J.J.&rft.date=1999-01-01&rft.pub=IEEE&rft.isbn=9780769502960&rft.issn=1522-4902&rft.spage=209&rft.epage=215&rft_id=info:doi/10.1109%2FSCCC.1999.810190&rft.externalDocID=810190
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1522-4902&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1522-4902&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1522-4902&client=summon