An efficient tree cache coherence protocol for distributed shared memory multiprocessors

Directory schemes have long been used to solve the cache coherence problem for large scale shared memory multiprocessors. In addition, tree-based protocols have been employed to reduce the directory size and the invalidation latency for a large degree of data sharing in the system. However, the exis...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:IEEE transactions on computers Ročník 48; číslo 3; s. 352 - 360
Hlavní autoři: Yeimkuan Chang, Bhuyan, L.N.
Médium: Journal Article
Jazyk:angličtina
Vydáno: New York, NY IEEE 01.03.1999
Institute of Electrical and Electronics Engineers
Témata:
ISSN:0018-9340
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Abstract Directory schemes have long been used to solve the cache coherence problem for large scale shared memory multiprocessors. In addition, tree-based protocols have been employed to reduce the directory size and the invalidation latency for a large degree of data sharing in the system. However, the existing tree-based protocols involve a very high communication overhead for maintaining a balanced tree, especially when the degree of data sharing is low. This paper presents a new tree-based cache coherence protocol which is a hybrid of the limited directory and the linked list schemes. By utilizing a limited number of pointers in the directory, the proposed protocol connects the nodes caching a shared block in a tree fashion without incurring any communication overhead. In addition to the low communication overhead, the proposed scheme also possesses the advantages of the existing bit-map and tree-based linked list protocols, namely, scalable memory requirement and logarithmic invalidation latency. We evaluate the performance of our protocol by running four applications on the Proteus execution-driven simulator. Our simulation results show that the performance of the proposed protocol is very close to that of the full-map protocol.
AbstractList Directory schemes have long been used to solve the cache coherence problem for large scale shared memory multiprocessors. In addition, tree-based protocols have been employed to reduce the directory size and the invalidation latency for a large degree of data sharing in the system. However, the existing tree-based protocols involve a very high communication overhead for maintaining a balanced tree, especially when the degree of data sharing is low. This paper presents a new tree-based cache coherence protocol which is a hybrid of the limited directory and the linked list schemes. By utilizing a limited number of pointers in the directory, the proposed protocol connects the nodes caching a shared block in a tree fashion without incurring any communication overhead. In addition to the low communication overhead, the proposed scheme also possesses the advantages of the existing bit-map and tree-based linked list protocols, namely, scalable memory requirement and logarithmic invalidation latency. We evaluate the performance of our protocol by running four applications on the Proteus execution-driven simulator. Our simulation results show that the performance of the proposed protocol is very close to that of the full-map protocol.
Directory schemes have long been used to solve the cache coherence problem for large scale shared memory multiprocessors. In addition, tree-based protocols have been employed to reduce the directory size and the invalidation latency for a large degree of data sharing in the system. However, the existing tree-based protocols involve a very high communication overhead for maintaining a balanced tree, especially when the degree of data sharing is low. This paper presents a new tree-based cache coherence protocol which is a hybrid of the limited directory and the linked list schemes. By utilizing a limited number of pointers in the directory, the proposed protocol connects the nodes caching a shared block in a tree fashion without incurring any communication overhead. In addition to the low communication overhead, the proposed scheme also possesses the advantages of the existing bit-map and tree-based linked list protocols, namely, scalable memory requirement and logarithmic invalidation latency. We evaluate the performance of our protocol by running four applications on the Proteus execution-driven simulator. Our simulation results show that the performance of the proposed protocol is very close to that of the full-map protocol
Author Bhuyan, L.N.
Yeimkuan Chang
Author_xml – sequence: 1
  surname: Yeimkuan Chang
  fullname: Yeimkuan Chang
  organization: Dept. of Inf. Manage., Chung-Hua Univ., Taiwan
– sequence: 2
  givenname: L.N.
  surname: Bhuyan
  fullname: Bhuyan, L.N.
BackLink http://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=1752004$$DView record in Pascal Francis
BookMark eNqF0TtPwzAQAGAPRaIFBlamDAjE0PZsJ3E8VhUvqRILSGyW41xUoyQutjv032PUqgNCsNwN_u58upuQ0eAGJOSSwoxSkHPKZqIoAOiIjFOsppLncEomIXwAQMlAjsn7Ysiwba2xOMQsesTMaLNO0a3R42Aw23gXnXFd1jqfNTZEb-ttxCYLa-1T6rF3fpf12y7aZA2G4Hw4Jyet7gJeHPIZeXu4f10-TVcvj8_LxWpqeCnjtGw4NkXDeSOkEYxhDbnO03RFnZecs6LUldCM1zXnGsCU2NJWNBU1ALUWnJ-R233f9PXnFkNUvQ0Gu04P6LZBSSpl2gZjSd78KVmVMyEq8T8s0yIrWSV4fYA6GN21Xg_GBrXxttd-p6goGECe2HzPjHcheGyVsVFH64bote0UBfV9L0WZ2t8rVdz9qDj2_MVe7a1FxKM7PH4BsAWhAg
CODEN ITCOB4
CitedBy_id crossref_primary_10_1109_12_954507
crossref_primary_10_1109_TPDS_2005_4
Cites_doi 10.1145/106972.106995
10.1109/ISCA.1994.288139
10.1145/161541.161544
10.1109/71.205652
10.21236/ADA241343
10.1109/2.55503
10.1145/158439.158907
10.1109/IPPS.1996.508053
10.1109/TC.1978.1675013
10.1109/ISCA.1988.5238
10.1109/2.15
10.1109/2.55504
10.1145/130823.130824
10.1109/SUPERC.1992.236667
10.1145/70082.68205
10.1109/SPDP.1992.242703
10.1109/ISCA.1993.698554
10.1109/IPPS.1993.262852
ContentType Journal Article
Copyright 1999 INIST-CNRS
Copyright_xml – notice: 1999 INIST-CNRS
DBID RIA
RIE
AAYXX
CITATION
IQODW
7SC
8FD
JQ2
L7M
L~C
L~D
7SP
F28
FR3
DOI 10.1109/12.755001
DatabaseName IEEE All-Society Periodicals Package (ASPP) 1998–Present
IEEE Electronic Library (IEL)
CrossRef
Pascal-Francis
Computer and Information Systems Abstracts
Technology Research Database
ProQuest Computer Science Collection
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
Electronics & Communications Abstracts
ANTE: Abstracts in New Technology & Engineering
Engineering Research Database
DatabaseTitle CrossRef
Computer and Information Systems Abstracts
Technology Research Database
Computer and Information Systems Abstracts – Academic
Advanced Technologies Database with Aerospace
ProQuest Computer Science Collection
Computer and Information Systems Abstracts Professional
Electronics & Communications Abstracts
Engineering Research Database
ANTE: Abstracts in New Technology & Engineering
DatabaseTitleList
Computer and Information Systems Abstracts
Technology Research Database
Computer and Information Systems Abstracts
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
Computer Science
Applied Sciences
EndPage 360
ExternalDocumentID 1752004
10_1109_12_755001
755001
GroupedDBID --Z
-DZ
-~X
.55
.DC
0R~
29I
3EH
3O-
4.4
5GY
5VS
6IK
85S
97E
AAJGR
AARMG
AASAJ
AAWTH
ABAZT
ABFSI
ABQJQ
ABVLG
ACGFO
ACIWK
ACNCT
AENEX
AETEA
AETIX
AGQYO
AGSQL
AHBIQ
AI.
AIBXA
AKJIK
AKQYR
ALLEH
ALMA_UNASSIGNED_HOLDINGS
ASUFR
ATWAV
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CS3
DU5
E.L
EBS
EJD
HZ~
H~9
IAAWW
IBMZZ
ICLAB
IEDLZ
IFIPE
IFJZH
IPLJI
JAVBF
LAI
M43
MS~
MVM
O9-
OCL
P2P
PQQKQ
RIA
RIE
RNI
RNS
RXW
RZB
TAE
TN5
TWZ
UHB
UKR
UPT
VH1
X7M
XJT
XOL
XZL
YXB
YYQ
YZZ
ZCG
AAYXX
ABUFD
CITATION
IQODW
RIG
7SC
8FD
JQ2
L7M
L~C
L~D
7SP
F28
FR3
ID FETCH-LOGICAL-c369t-6d3ed5d33d79c722eb04a40065b4633256a87a23bb33a00c6ef1f7d81c00ba733
IEDL.DBID RIE
ISICitedReferencesCount 6
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000079352600008&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 0018-9340
IngestDate Thu Oct 02 12:12:25 EDT 2025
Sun Sep 28 11:30:05 EDT 2025
Sun Sep 28 01:30:52 EDT 2025
Mon Jul 21 09:15:16 EDT 2025
Tue Nov 18 21:43:50 EST 2025
Sat Nov 29 03:56:56 EST 2025
Wed Aug 27 02:49:02 EDT 2025
IsPeerReviewed true
IsScholarly true
Issue 3
Keywords Distributed memory multiprocessor system
Computer simulation
Cache memory
Storage management
Performance analysis
Parallel computer
Access protocol
Distributed system
Language English
License https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html
CC BY 4.0
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c369t-6d3ed5d33d79c722eb04a40065b4633256a87a23bb33a00c6ef1f7d81c00ba733
Notes ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 23
PQID 26934898
PQPubID 23500
PageCount 9
ParticipantIDs crossref_citationtrail_10_1109_12_755001
proquest_miscellaneous_26934898
proquest_miscellaneous_919911022
proquest_miscellaneous_28427787
pascalfrancis_primary_1752004
crossref_primary_10_1109_12_755001
ieee_primary_755001
PublicationCentury 1900
PublicationDate 1999-03-01
PublicationDateYYYYMMDD 1999-03-01
PublicationDate_xml – month: 03
  year: 1999
  text: 1999-03-01
  day: 01
PublicationDecade 1990
PublicationPlace New York, NY
PublicationPlace_xml – name: New York, NY
PublicationTitle IEEE transactions on computers
PublicationTitleAbbrev TC
PublicationYear 1999
Publisher IEEE
Institute of Electrical and Electronics Engineers
Publisher_xml – name: IEEE
– name: Institute of Electrical and Electronics Engineers
References bibT035215
(bibT03523) 1993
bibT035214
bibT035217
bibT03529
gupta (bibT035216) 1990
bibT035211
bibT035221
bibT035210
bibT035213
bibT035212
singh (bibT035220) 1992
brewer (bibT035219) 1991
bibT03525
bibT03526
bibT03528
bibT03521
bibT03522
bibT035218
johnson (bibT03527) 1993
bibT03524
References_xml – ident: bibT035210
  doi: 10.1145/106972.106995
– year: 1993
  ident: bibT03523
  publication-title: IEEE Std 1596-1992 IEEE Standard for Scalable Coherent Interface
– ident: bibT035211
  doi: 10.1109/ISCA.1994.288139
– ident: bibT035214
  doi: 10.1145/161541.161544
– ident: bibT035221
  doi: 10.1109/71.205652
– year: 1991
  ident: bibT035219
  article-title: proteus: a high-performance parallel architecture simulator
  doi: 10.21236/ADA241343
– ident: bibT035212
  doi: 10.1109/2.55503
– ident: bibT03522
  doi: 10.1145/158439.158907
– year: 1990
  ident: bibT035216
  article-title: reducing memory and traffic requirements for scalable directory-based cache coherence schemes
  publication-title: Proc Int l Conf Parallel Processing
– ident: bibT03528
  doi: 10.1109/IPPS.1996.508053
– ident: bibT035213
  doi: 10.1109/TC.1978.1675013
– ident: bibT03529
  doi: 10.1109/ISCA.1988.5238
– ident: bibT03521
  doi: 10.1109/2.15
– ident: bibT03525
  doi: 10.1109/2.55504
– year: 1993
  ident: bibT03527
  article-title: extending the scalable coherent interface for large-scale shared-memory multiprocessors
  publication-title: ?Extending the Scalable Coherent Interface for Large-Scale Shared-Memory Multiprocessors ?
– year: 1992
  ident: bibT035220
  article-title: splash: stanford parallel applications for shared memory
  doi: 10.1145/130823.130824
– ident: bibT035217
  doi: 10.1109/SUPERC.1992.236667
– ident: bibT035218
  doi: 10.1145/70082.68205
– ident: bibT03526
  doi: 10.1109/SPDP.1992.242703
– ident: bibT035215
  doi: 10.1109/ISCA.1993.698554
– ident: bibT03524
  doi: 10.1109/IPPS.1993.262852
SSID ssj0006209
Score 1.587364
Snippet Directory schemes have long been used to solve the cache coherence problem for large scale shared memory multiprocessors. In addition, tree-based protocols...
SourceID proquest
pascalfrancis
crossref
ieee
SourceType Aggregation Database
Index Database
Enrichment Source
Publisher
StartPage 352
SubjectTerms Access methods and protocols, osi model
Applied sciences
Balances (scales)
Broadcasting
Coherence
Computer science; control theory; systems
Computer simulation
Computer systems and distributed systems. User interface
Degradation
Delay
Directories
Exact sciences and technology
Large-scale systems
Lists
Memory and file management (including protection and security)
Memory organisation. Data processing
Multiprocessor
Multiprocessor interconnection networks
Parallel processing
Parallel programming
Protocols
Running
Software
System performance
Telecommunications
Telecommunications and information theory
Teleprocessing networks. Isdn
Trees
Title An efficient tree cache coherence protocol for distributed shared memory multiprocessors
URI https://ieeexplore.ieee.org/document/755001
https://www.proquest.com/docview/26934898
https://www.proquest.com/docview/28427787
https://www.proquest.com/docview/919911022
Volume 48
WOSCitedRecordID wos000079352600008&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVIEE
  databaseName: IEEE Electronic Library (IEL)
  issn: 0018-9340
  databaseCode: RIE
  dateStart: 19680101
  customDbUrl:
  isFulltext: true
  dateEnd: 99991231
  titleUrlDefault: https://ieeexplore.ieee.org/
  omitProxy: false
  ssIdentifier: ssj0006209
  providerName: IEEE
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1Nb9QwEB2Vqgc4tHQBsaWlFuLAJa1jZzPxsaqoOFUcWmlvkT9VJEiqzS4S_x6PnSxUFCRuUTSOrPHXTPzmPYD3PobdOgRbGOtigiKbKj4JVWhhG1OjUeh1EpvA6-tmuVSfR57tVAvjvU_gM39Gj-ku3_V2Q7_KzjGG01Sr9QSxzqVa2023ntAcZVy_suIjiVDJ1XkpznLDB0dP0lIhJKQeojNCVrH4Y0NOp8zVwX_17znsj8Eku8ijfwg7vpvBwSTUwMZ1O4Nnv7EOvoDlRcd8Yo6IX2N0K80s0Toz29_l4j9G7A19nCIshrTMEbcuyWJ5x4Y7Aqyzb4TP_cEyGjGXGvSr4SXcXn28ufxUjAILhZW1Whe1k94tnJQOlUUhvOGVrigqMVUtZYyGdINaSGOk1Jzb2ocyoGtKy7nRKOUr2O36zr8GhmGBqILj2ugKQzz0LDHVlcpRjrkIc_gw-b61I_s4iWB8bVMWwlVbijb7bw7vtqb3mXLjMaMZ-X9rML09eTCev9oj0UtVczidxreNy4juRnTn-83QijrOmkY1_7BoKoFxe5sD-4uFIhgZZdBHj3buDTzNnA8EYDuG3fVq409gz35ffxlWb9N0_glfTPXa
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3PaxQxFH5IK6gHq6vi2tYG8eBl2kySnUyOpbRUrIuHCnsb8pMKOiM7u0L_--YlM6vFWvA2DC9DePn13uR73wfw3sewW4dgC2NdTFB4LeITU4VmtjaVNEp6ncQm5HxeLxbqy8CznWphvPcJfOYP8THd5bvOrvFX2ZGM4TTWam3PREx7crHWZtutRjxHGVcwF3SgESqpOirZYW566_BJaiqIhdR9dEfIOhZ_bcnpnDnb-a8ePoOnQzhJjvP4P4cHvp3AzijVQIaVO4Enf_AOvoDFcUt84o6IXyN4L00sEjsT213l8j-C_A1dnCQkBrXEIbsuCmN5R_orhKyTH4jQvSYZj5iLDbpl_xK-np1enpwXg8RCYXmlVkXluHczx7mTykrGvKFCC4xLjKg4j_GQrqVm3BjONaW28qEM0tWlpdRoyfkr2Gq71r8GIsNMShUc1UYLGeKxZ5GrrlQOs8xZmMKH0feNHfjHUQbje5PyEKqakjXZf1N4tzH9mUk37jKaoP83BuPb_Vvj-bu9RIIpMYWDcXybuJDwdkS3vlv3DavirKlVfY9FLZiMG9wUyD8sFALJMId-c2fnDuDR-eXni-bi4_zTLjzODBAIZ9uDrdVy7ffhof21-tYv36apfQNFzPkh
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=An+efficient+tree+cache+coherence+protocol+for+distributed+shared+memory+multiprocessors&rft.jtitle=IEEE+transactions+on+computers&rft.au=YEIMKUAN+CHANG&rft.au=BHUYAN%2C+L.+N&rft.date=1999-03-01&rft.pub=Institute+of+Electrical+and+Electronics+Engineers&rft.issn=0018-9340&rft.volume=48&rft.issue=3&rft.spage=352&rft.epage=360&rft_id=info:doi/10.1109%2F12.755001&rft.externalDBID=n%2Fa&rft.externalDocID=1752004
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0018-9340&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0018-9340&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0018-9340&client=summon