Multi-failure fault-tolerance of embedded loops on hypercubes: issues and performance study
The authors study the multi-failure fault-tolerance of hypercubes. Reconfiguration algorithms are proposed to reallocate the function of failed nodes to spare nodes so the communication structure of the interrupted parallel algorithms is preserved. Both clustered fault and concurrent fault are consi...
Uloženo v:
| Vydáno v: | Parallel and Distributed Processing, 2nd IEEE Symposium On s. 511 - 518 |
|---|---|
| Hlavní autoři: | , |
| Médium: | Konferenční příspěvek |
| Jazyk: | angličtina |
| Vydáno: |
IEEE Comput. Soc. Press
1990
|
| Témata: | |
| ISBN: | 0818620870, 9780818620874 |
| On-line přístup: | Získat plný text |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
| Abstract | The authors study the multi-failure fault-tolerance of hypercubes. Reconfiguration algorithms are proposed to reallocate the function of failed nodes to spare nodes so the communication structure of the interrupted parallel algorithms is preserved. Both clustered fault and concurrent fault are considered. Loops are selected as the embedded communication structures, where a wide variety of applications have been implemented. In early work, two classes of fault-tolerant embedded loops, Mapping II and III, have been designed and proved one-step reconfigurable for any single failure. The authors derive from shortest path algorithms a distributed reconfiguration algorithm for multiple failures on these embedded loops. Proof of reconfigurability for clustered fault is conducted for Mapping III. Performance of both mappings is evaluated by simulation with parameters such as the average number of tolerable failures, the average number of job migrations, and the utilization rate of nodes.< > |
|---|---|
| AbstractList | The authors study the multi-failure fault-tolerance of hypercubes. Reconfiguration algorithms are proposed to reallocate the function of failed nodes to spare nodes so the communication structure of the interrupted parallel algorithms is preserved. Both clustered fault and concurrent fault are considered. Loops are selected as the embedded communication structures, where a wide variety of applications have been implemented. In early work, two classes of fault-tolerant embedded loops, Mapping II and III, have been designed and proved one-step reconfigurable for any single failure. The authors derive from shortest path algorithms a distributed reconfiguration algorithm for multiple failures on these embedded loops. Proof of reconfigurability for clustered fault is conducted for Mapping III. Performance of both mappings is evaluated by simulation with parameters such as the average number of tolerable failures, the average number of job migrations, and the utilization rate of nodes.< > |
| Author | Liang, C.T. Tsai, W.T. |
| Author_xml | – sequence: 1 givenname: C.T. surname: Liang fullname: Liang, C.T. organization: Dept. of Comput. Sci., Minnesota Univ., Minneapolis, MN, USA – sequence: 2 givenname: W.T. surname: Tsai fullname: Tsai, W.T. organization: Dept. of Comput. Sci., Minnesota Univ., Minneapolis, MN, USA |
| BookMark | eNotT81KxDAYDKigu-5dPOUFuiZt0k28yfoLKy64Nw_Ll-QLVtqmJO2hb29wncswAzPDLMh5H3ok5IazNedM333uH_drrnWWopJanJEFU1zVJVMbdklWKf2wDCmzJa7I1_vUjk3hoWmniNRDlsUYWozQW6TBU-wMOoeOtiEMiYaefs8DRjsZTPe0SWnCRKF3NJs-xO4vl8bJzdfkwkObcPXPS3J4fjpsX4vdx8vb9mFXNEqPRcksgLBcKW-MKK1VRjhmaglCliiEq_N8VVtR8Y2SHAC0495p4ByZYVW1JLen2gYRj0NsOojz8fS--gUeUlP_ |
| ContentType | Conference Proceeding |
| DBID | 6IE 6IL CBEJK RIE RIL |
| DOI | 10.1109/SPDP.1990.143594 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Xplore url: https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Computer Science |
| EndPage | 518 |
| ExternalDocumentID | 143594 |
| GroupedDBID | 6IE 6IK 6IL AAJGR AAWTH ACGHX ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK OCL RIB RIC RIE RIL |
| ID | FETCH-LOGICAL-i89t-20caa4c188fbb42cc8b4d0b65a452e44d6edd36c4317851aaa9d1fd9a11e0b033 |
| IEDL.DBID | RIE |
| ISBN | 0818620870 9780818620874 |
| IngestDate | Tue Aug 26 17:03:07 EDT 2025 |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-i89t-20caa4c188fbb42cc8b4d0b65a452e44d6edd36c4317851aaa9d1fd9a11e0b033 |
| PageCount | 8 |
| ParticipantIDs | ieee_primary_143594 |
| PublicationCentury | 1900 |
| PublicationDate | 19900000 |
| PublicationDateYYYYMMDD | 1990-01-01 |
| PublicationDate_xml | – year: 1990 text: 19900000 |
| PublicationDecade | 1990 |
| PublicationTitle | Parallel and Distributed Processing, 2nd IEEE Symposium On |
| PublicationTitleAbbrev | SPDP |
| PublicationYear | 1990 |
| Publisher | IEEE Comput. Soc. Press |
| Publisher_xml | – name: IEEE Comput. Soc. Press |
| SSID | ssj0000558624 |
| Score | 1.2081757 |
| Snippet | The authors study the multi-failure fault-tolerance of hypercubes. Reconfiguration algorithms are proposed to reallocate the function of failed nodes to spare... |
| SourceID | ieee |
| SourceType | Publisher |
| StartPage | 511 |
| SubjectTerms | Clustering algorithms Computational modeling Computer networks Computer science Fault tolerance Hardware Hypercubes Large-scale systems Network topology Parallel algorithms |
| Title | Multi-failure fault-tolerance of embedded loops on hypercubes: issues and performance study |
| URI | https://ieeexplore.ieee.org/document/143594 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV07T8MwELZoxcBUKEW85YHVNEmdxGYFKqYqEh0qMVR-nEWlEldNwu_HdtJWSCxsvgxny-fzXe783SH04JNlkEhJMiYZoc4lIJI7fcypD7pp4WxG22win83YYsGLrs52wMIAQHh8Bo9-GHL52qrGh8rG3rZz2kO9PM9bqNY-nBKlqYc6hAqPsRtF7hx29XV2NN1lKSM-fi9eCg_Uc2Tg-au3SjAt08G_FnWKRgeIHi72xucMHUE5RINdjwbcqew5-ggIW2LEyr8_x0Y4ktR2Db6hBmBrMHxJcLePxmtrNxW2Jf50_6Zb1UionnCQS4VFqfHmgDHAoSrtCM2nr_PnN9I1VCArxmunEEoIqmLGjJQ0UYpJqiOZpYKmCVCqMzfdJFPep3COmBCC69hoLuIYIhlNJheoX9oSLhE2SmeRMbHkwCiwRLqz4Hh7TiBzKq7Q0O_UctOWzFi2m3T959cbdOIl0cY1blG_3jZwh47Vd72qtvdBzD_AzKeM |
| linkProvider | IEEE |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LT8MwDI5gIMFpMIZ4kwPXsrZLu4QrMA0xpkrsMInDlIcjJo1mWjt-P0nabULiwi3uwYniOHbtfDZCdy5ZBrEQQUoFDYh1CQLBrD72iAu6KW5tRtVsojca0cmEZXWdbY-FAQD_-Azu3dDn8pWRKxcq6zjbzsgu2ksIiaMKrLUJqIRJ4sAOvsZjZEehPYl1hZ01TdZ5ypB13rOnzEH1LOm5_uqu4o1Lv_mvZR2h9hakh7ON-TlGO5C3UHPdpQHXSnuCPjzGNtB85l6gY80tGZRmDq6lBmCjMXwJsPePwnNjFgU2Of60f6dLuRJQPGAvmQLzXOHFFmWAfV3aNhr3n8ePg6BuqRDMKCutSkjOiYwo1UKQWEoqiApFmnCSxECISu103VQ6r8K6YpxzpiKtGI8iCEXY7Z6iRm5yOENYS5WGWkeCASVAY2FPg-XtOIHoEX6OWm6npouqaMa02qSLP7_eooPB-G04Hb6MXi_RoZNKFeW4Qo1yuYJrtC-_y1mxvPEi_wGktqrT |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=Parallel+and+Distributed+Processing%2C+2nd+IEEE+Symposium+On&rft.atitle=Multi-failure+fault-tolerance+of+embedded+loops+on+hypercubes%3A+issues+and+performance+study&rft.au=Liang%2C+C.T.&rft.au=Tsai%2C+W.T.&rft.date=1990-01-01&rft.pub=IEEE+Comput.+Soc.+Press&rft.isbn=9780818620874&rft.spage=511&rft.epage=518&rft_id=info:doi/10.1109%2FSPDP.1990.143594&rft.externalDocID=143594 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9780818620874/lc.gif&client=summon&freeimage=true |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9780818620874/mc.gif&client=summon&freeimage=true |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9780818620874/sc.gif&client=summon&freeimage=true |

