Optimizing performance and reliability on heterogeneous parallel systems: Approximation algorithms and heuristics
We study the problem of scheduling tasks (with and without precedence constraints) on a set of related processors which have a probability of failure governed by an exponential law. The goal is to design approximation algorithms or heuristics that optimize both makespan and reliability. First, we sh...
Uloženo v:
| Vydáno v: | Journal of parallel and distributed computing Ročník 72; číslo 2; s. 268 - 280 |
|---|---|
| Hlavní autoři: | , , |
| Médium: | Journal Article |
| Jazyk: | angličtina |
| Vydáno: |
Elsevier Inc
01.02.2012
Elsevier |
| Témata: | |
| ISSN: | 0743-7315, 1096-0848 |
| On-line přístup: | Získat plný text |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
| Abstract | We study the problem of scheduling tasks (with and without precedence constraints) on a set of related processors which have a probability of failure governed by an exponential law. The goal is to design approximation algorithms or heuristics that optimize both makespan and reliability. First, we show that both objectives are contradictory and that the number of points of the Pareto-front can be exponential. This means that this problem cannot be approximated by a single schedule. Second, for independent unitary tasks, we provide an optimal scheduling algorithm where the objective is to maximize the reliability subject to makespan minimization. For the bi-objective optimization, we provide a
(
1
+
ϵ
,
1
)
-approximation algorithm of the Pareto-front. Next, for independent arbitrary tasks, we propose a
〈
2
̄
,
1
〉
-approximation algorithm (
i.e. for any fixed value of the makespan, the obtained solution is optimal on the reliability and no more than twice the given makespan) that has a much lower complexity than the other existing algorithms. This solution is used to derive a
(
2
+
ϵ
,
1
)
-approximation of the Pareto-front of the problem.
All these proposed solutions are discriminated by the value of the product {
failure rate} × {
unitary instruction execution time} of each processor, which appears to be a crucial parameter in the context of bi-objective optimization. Based on this observation, we provide a general method for converting scheduling heuristics on heterogeneous clusters into heuristics that take into account the reliability when there are precedence constraints. The average behavior is studied by extensive simulations. Finally, we discuss the specific case of scheduling a chain of tasks which leads to optimal results.
► We study the problem of scheduling task in case of failures. ► We target optimizing makespan and reliability. ► We provide approximation algorithms of the Pareto front for important subcases. ► We provide an efficient heuristic for the general case. |
|---|---|
| AbstractList | We study the problem of scheduling tasks (with and without precedence constraints) on a set of related processors which have a probability of failure governed by an exponential law. The goal is to design approximation algorithms or heuristics that optimize both makespan and reliability. First, we show that both objectives are contradictory and that the number of points of the Pareto-front can be exponential. This means that this problem cannot be approximated by a single schedule. Second, for independent unitary tasks, we provide an optimal scheduling algorithm where the objective is to maximize the reliability subject to makespan minimization. For the bi-objective optimization, we provide a (1+ ,1)-approximation algorithm of the Pareto-front. Next, for independent arbitrary tasks, we propose a 1; 2 ; -approximation algorithm (i.e. for any xed value of the makespan, the obtained solution is optimal on the reliability and no more than twice the given makespan) that has a much lower complexity than the other existing algorithms. This solution is used to derive a (2 + ; 1)-approximation of the Pareto-front of the problem. All these proposed solutions are discriminated by the value of the product ffailure rateg funitary instruction execution timeg of each processor, which appears to be a crucial parameter in the context of bi-objective optimization. Based on this observation, we provide a general method for converting scheduling heuristics on heterogeneous clusters into heuristics that take into account the reliability when there are precedence constraints. The average behaviour is studied by extensive simulations. Finally, we discuss the speci c case of scheduling a chain of tasks which leads to optimal results. We study the problem of scheduling tasks (with and without precedence constraints) on a set of related processors which have a probability of failure governed by an exponential law. The goal is to design approximation algorithms or heuristics that optimize both makespan and reliability. First, we show that both objectives are contradictory and that the number of points of the Pareto-front can be exponential. This means that this problem cannot be approximated by a single schedule. Second, for independent unitary tasks, we provide an optimal scheduling algorithm where the objective is to maximize the reliability subject to makespan minimization. For the bi-objective optimization, we provide a ( 1 + [Greek Lunate Epsilon] , 1 ) -approximation algorithm of the Pareto-front. Next, for independent arbitrary tasks, we propose a aOE[copy 2 I" , 1 aOEa -approximation algorithm (i.e. for any fixed value of the makespan, the obtained solution is optimal on the reliability and no more than twice the given makespan) that has a much lower complexity than the other existing algorithms. This solution is used to derive a ( 2 + [Greek Lunate Epsilon] , 1 ) -approximation of the Pareto-front of the problem. All these proposed solutions are discriminated by the value of the product {failure rate} {unitary instruction execution time} of each processor, which appears to be a crucial parameter in the context of bi-objective optimization. Based on this observation, we provide a general method for converting scheduling heuristics on heterogeneous clusters into heuristics that take into account the reliability when there are precedence constraints. The average behavior is studied by extensive simulations. Finally, we discuss the specific case of scheduling a chain of tasks which leads to optimal results. We study the problem of scheduling tasks (with and without precedence constraints) on a set of related processors which have a probability of failure governed by an exponential law. The goal is to design approximation algorithms or heuristics that optimize both makespan and reliability. First, we show that both objectives are contradictory and that the number of points of the Pareto-front can be exponential. This means that this problem cannot be approximated by a single schedule. Second, for independent unitary tasks, we provide an optimal scheduling algorithm where the objective is to maximize the reliability subject to makespan minimization. For the bi-objective optimization, we provide a ( 1 + ϵ , 1 ) -approximation algorithm of the Pareto-front. Next, for independent arbitrary tasks, we propose a 〈 2 ̄ , 1 〉 -approximation algorithm ( i.e. for any fixed value of the makespan, the obtained solution is optimal on the reliability and no more than twice the given makespan) that has a much lower complexity than the other existing algorithms. This solution is used to derive a ( 2 + ϵ , 1 ) -approximation of the Pareto-front of the problem. All these proposed solutions are discriminated by the value of the product { failure rate} × { unitary instruction execution time} of each processor, which appears to be a crucial parameter in the context of bi-objective optimization. Based on this observation, we provide a general method for converting scheduling heuristics on heterogeneous clusters into heuristics that take into account the reliability when there are precedence constraints. The average behavior is studied by extensive simulations. Finally, we discuss the specific case of scheduling a chain of tasks which leads to optimal results. ► We study the problem of scheduling task in case of failures. ► We target optimizing makespan and reliability. ► We provide approximation algorithms of the Pareto front for important subcases. ► We provide an efficient heuristic for the general case. |
| Author | Saule, Erik Trystram, Denis Jeannot, Emmanuel |
| Author_xml | – sequence: 1 givenname: Emmanuel surname: Jeannot fullname: Jeannot, Emmanuel email: emmanuel.jeannot@inria.fr organization: INRIA Bordeaux Sud-Ouest, Talence, France – sequence: 2 givenname: Erik surname: Saule fullname: Saule, Erik organization: BMI, Ohio State University, Columbus 43210, OH, USA – sequence: 3 givenname: Denis surname: Trystram fullname: Trystram, Denis organization: Grenoble Institute of Technology, Grenoble, France |
| BackLink | https://inria.hal.science/hal-00788219$$DView record in HAL |
| BookMark | eNp9kUFrGzEQhUVJoU7aP9DTHtvDOpLWu9KWXkxom4Ihl_YsZrWz9hjtaiPJoc6vr2yHHnoIDAyI7z00712zq8lPyNhHwZeCi-Z2v9zPvV1KLsQyD-fVG7YQvG1Krlf6ii24WlWlqkT9jl3HuOcZrJVesMeHOdFIzzRtixnD4MMIk8UCpr4I6Ag6cpSOhZ-KHSYMfosT-kMsZgjgHLoiHmPCMX4p1vMc_B8aIVGmwW19oLQb49lrh4dAMZGN79nbAVzEDy_7hv3-_u3X3X25efjx8269KW3VqFRWHVddyyVKJaDpa1i1FdTtYJUF2egOZctrLZqm0lZD3SGqoQEhO617gL6vbtjni-8OnJlD_lc4Gg9k7tcbc3rjXGktRfskMvvpwuYLHg8YkxkpWnQOzseanDHXuhVaZlReUBt8jAGHf96Cn7jG7M2pC3PqwuTJXWSR_k9kKZ1zSgHIvS79epFizuqJMJhoCXNFPQW0yfSeXpP_BW0Cqac |
| CitedBy_id | crossref_primary_10_1016_j_jpdc_2014_01_005 crossref_primary_10_1002_cpe_3179 crossref_primary_10_1007_s10723_015_9331_1 crossref_primary_10_1016_j_jnca_2018_03_028 crossref_primary_10_1002_cpe_7252 crossref_primary_10_1007_s10586_014_0372_1 crossref_primary_10_1109_TPDS_2015_2403861 crossref_primary_10_3390_sym15112047 crossref_primary_10_1109_TCC_2016_2608345 crossref_primary_10_32604_cmc_2022_021410 crossref_primary_10_1007_s10586_021_03464_4 crossref_primary_10_1007_s11227_021_03764_x crossref_primary_10_1007_s12652_018_0926_9 crossref_primary_10_1016_j_jpdc_2015_07_002 crossref_primary_10_1109_ACCESS_2020_2970475 crossref_primary_10_1155_2014_496843 crossref_primary_10_1016_j_ins_2016_08_003 |
| Cites_doi | 10.1109/HCW.1998.666545 10.1137/0206013 10.1109/TSE.1987.232562 10.1093/comjnl/bxh086 10.1109/71.207593 10.1109/TEVC.2003.810758 10.1109/TPDS.2009.84 10.1145/1248377.1248423 10.1109/HCW.1999.765092 10.1109/71.993209 10.1016/j.jpdc.2008.11.002 10.1016/j.parco.2010.03.001 10.1137/0217033 10.1109/12.165396 10.1109/SFCS.2000.892068 10.1006/jagm.2001.1184 10.1007/978-3-540-85451-7_94 10.1177/1094342006067469 |
| ContentType | Journal Article |
| Copyright | 2011 Elsevier Inc. Distributed under a Creative Commons Attribution 4.0 International License |
| Copyright_xml | – notice: 2011 Elsevier Inc. – notice: Distributed under a Creative Commons Attribution 4.0 International License |
| DBID | AAYXX CITATION 7SC 8FD JQ2 L7M L~C L~D 1XC |
| DOI | 10.1016/j.jpdc.2011.11.003 |
| DatabaseName | CrossRef Computer and Information Systems Abstracts Technology Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional Hyper Article en Ligne (HAL) |
| DatabaseTitle | CrossRef Computer and Information Systems Abstracts Technology Research Database Computer and Information Systems Abstracts – Academic Advanced Technologies Database with Aerospace ProQuest Computer Science Collection Computer and Information Systems Abstracts Professional |
| DatabaseTitleList | Computer and Information Systems Abstracts |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Computer Science |
| EISSN | 1096-0848 |
| EndPage | 280 |
| ExternalDocumentID | oai:HAL:hal-00788219v1 10_1016_j_jpdc_2011_11_003 S074373151100219X |
| GroupedDBID | --K --M -~X .~1 0R~ 1B1 1~. 1~5 29L 4.4 457 4G. 5GY 5VS 7-5 71M 8P~ 9JN AACTN AAEDT AAEDW AAIAV AAIKJ AAKOC AALRI AAOAW AAQFI AAQXK AAXUO AAYFN ABBOA ABEFU ABFNM ABFSI ABJNI ABMAC ABTAH ABXDB ABYKQ ACDAQ ACGFS ACNNM ACRLP ACZNC ADBBV ADEZE ADFGL ADHUB ADJOM ADMUD ADTZH AEBSH AECPX AEKER AENEX AFKWA AFTJW AGHFR AGUBO AGYEJ AHHHB AHJVU AHZHX AIALX AIEXJ AIKHN AITUG AJBFU AJOXV ALMA_UNASSIGNED_HOLDINGS AMFUW AMRAJ AOUOD ASPBG AVWKF AXJTR AZFZN BJAXD BKOJK BLXMC CAG COF CS3 DM4 DU5 E.L EBS EFBJH EFLBG EJD EO8 EO9 EP2 EP3 F5P FDB FEDTE FGOYB FIRID FNPLU FYGXN G-2 G-Q G8K GBLVA GBOLZ HLZ HVGLF HZ~ H~9 IHE J1W JJJVA K-O KOM LG5 LG9 LY7 M41 MO0 N9A O-L O9- OAUVE OZT P-8 P-9 P2P PC. Q38 R2- RIG ROL RPZ SBC SDF SDG SDP SES SET SEW SPC SPCBC SST SSV SSZ T5K TN5 TWZ WUQ XJT XOL XPP ZMT ZU3 ZY4 ~G- ~G0 9DU AATTM AAXKI AAYWO AAYXX ABDPE ABWVN ACLOT ACRPL ACVFH ADCNI ADNMO ADVLN AEIPS AEUPX AFJKZ AFPUW AGQPQ AIGII AIIUN AKBMS AKRWK AKYEP ANKPU APXCP CITATION EFKBS ~HD 7SC 8FD JQ2 L7M L~C L~D 1XC |
| ID | FETCH-LOGICAL-c367t-3b07b902e271a6d5a493a59fc7ca268be2905816638c8a5bee7f6a12b88daadd3 |
| ISICitedReferencesCount | 18 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000299139000015&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 0743-7315 |
| IngestDate | Tue Oct 14 20:52:23 EDT 2025 Sat Sep 27 22:00:36 EDT 2025 Sat Nov 29 07:10:53 EST 2025 Tue Nov 18 21:45:36 EST 2025 Fri Feb 23 02:27:54 EST 2024 |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 2 |
| Keywords | Pareto-front approximation Scheduling Makespan Reliability Precedence task graphs Precedence Task Graphs |
| Language | English |
| License | Distributed under a Creative Commons Attribution 4.0 International License: http://creativecommons.org/licenses/by/4.0 |
| LinkModel | OpenURL |
| MergedId | FETCHMERGED-LOGICAL-c367t-3b07b902e271a6d5a493a59fc7ca268be2905816638c8a5bee7f6a12b88daadd3 |
| Notes | ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 23 |
| ORCID | 0000-0002-2623-6922 0000-0002-3956-2997 |
| PQID | 1010889182 |
| PQPubID | 23500 |
| PageCount | 13 |
| ParticipantIDs | hal_primary_oai_HAL_hal_00788219v1 proquest_miscellaneous_1010889182 crossref_primary_10_1016_j_jpdc_2011_11_003 crossref_citationtrail_10_1016_j_jpdc_2011_11_003 elsevier_sciencedirect_doi_10_1016_j_jpdc_2011_11_003 |
| PublicationCentury | 2000 |
| PublicationDate | 2012-02-01 |
| PublicationDateYYYYMMDD | 2012-02-01 |
| PublicationDate_xml | – month: 02 year: 2012 text: 2012-02-01 day: 01 |
| PublicationDecade | 2010 |
| PublicationTitle | Journal of parallel and distributed computing |
| PublicationYear | 2012 |
| Publisher | Elsevier Inc Elsevier |
| Publisher_xml | – name: Elsevier Inc – name: Elsevier |
| References | Koo, Toueg (br000095) 1987; 13 Shatz, Wang (br000135) 1992; 41 Dogan, Ozgüner (br000040) 2002; 13 Dogan, Ozgüner (br000045) 2005; 48 Sih, Lee (br000150) 1993; 4 Oh, Ha (br000115) 1996; vol. 1124 Benoit, Robert, Rosenberg, Vivien (br000010) 2011; 37 Canon, Jeannot (br000025) 2010; 21 A. Benoit, L.-C. Canon, E. Jeannot, Y. Robert, Reliability of task graph schedules with transient and fail-stop failures: complexity and algorithms, Journal of Scheduling (in press). M. Maheswaran, H.J. Siegel, A dynamic matching and scheduling algorithm for heterogeneous computing systems, in: Heterogeneous Computing Workshop, 1998, pp. 57–69. Graham, Lawler, Lenstra, Kan (br000075) 1979; 5 (br000105) 2004 Chekuri, Bender (br000030) 2001; 41 J.J. Dongarra, E. Jeannot, E. Saule, Z. Shi, Bi-objective scheduling algorithms for optimizing makespan and reliability on heterogeneous systems, in: Proc. of SPAA, 2007, pp. 280–288. Cormen, Leiserson, Rivest, Stein (br000035) 2001 Garey, Johnson (br000055) 1979 Giroudeau, König (br000065) 2007 Hochbaum, Shmoys (br000085) 1988; 17 Pinedo (br000125) 2008 Besseron, Bouguerra, Gautier, Saule, Trystram (br000015) 2009 H. Topcuoglu, S. Hariri, M.-Y. Wu, Task scheduling algorithms for heterogeneous processors, in: 8th IEEE Heterogeneous Computing Workshop (HCW’99), 1999 3–14. Zitzler, Thiele, Laumanns, Fonseca, Grunert da Fonseca (br000160) 2003; 7 Gonzalez, Ibarra, Sahni (br000070) 1977; 6 Legrand, Robert (br000100) 2005 D.B. Shmoys, E. Tardos, Scheduling unrelated machines with costs, in: Proceedings of the Fourth Annual ACM/SIGACT-SIAM Symposium on Discrete Algorithms, 1993, pp. 448–454. . E. Jeannot, E. Saule, D. Trystram, Bi-Objective Approximation Scheme for Makespan and Reliability Optimization on Uniform Parallel Machines, in: The 14th International Euro-Par Conference on Parallel and Distributed Computing (Euro-Par 2008), Las Palmas de Gran Canaria, Spain, 2008. Bouteiller, Herault, Krawezik, Lemarinier, Cappello (br000020) 2006; 20 Girault, Saule, Trystram (br000060) 2009; 69 C.H. Papadimitriou, M. Yannakakis, On the approximability of trade-offs and optimal access of web sources, in: Proc. of FOCS, 2000, pp. 86–92. Sardiña, Boeres, de A. Drummond (br000130) 2010; vol. 6043 Shatz, Wang, Goto (br000140) 1992; 41 M. Hakem, F. Butelle, A bi-objective algortithm for scheduling parallel applications on heterogeneous systems subject to failures, in: Renpar 17, 2006. Garey (10.1016/j.jpdc.2011.11.003_br000055) 1979 Shatz (10.1016/j.jpdc.2011.11.003_br000135) 1992; 41 Pinedo (10.1016/j.jpdc.2011.11.003_br000125) 2008 10.1016/j.jpdc.2011.11.003_br000145 Chekuri (10.1016/j.jpdc.2011.11.003_br000030) 2001; 41 10.1016/j.jpdc.2011.11.003_br000120 Canon (10.1016/j.jpdc.2011.11.003_br000025) 2010; 21 Girault (10.1016/j.jpdc.2011.11.003_br000060) 2009; 69 10.1016/j.jpdc.2011.11.003_br000005 Hochbaum (10.1016/j.jpdc.2011.11.003_br000085) 1988; 17 Besseron (10.1016/j.jpdc.2011.11.003_br000015) 2009 Benoit (10.1016/j.jpdc.2011.11.003_br000010) 2011; 37 Oh (10.1016/j.jpdc.2011.11.003_br000115) 1996; vol. 1124 Koo (10.1016/j.jpdc.2011.11.003_br000095) 1987; 13 Cormen (10.1016/j.jpdc.2011.11.003_br000035) 2001 10.1016/j.jpdc.2011.11.003_br000050 10.1016/j.jpdc.2011.11.003_br000090 Shatz (10.1016/j.jpdc.2011.11.003_br000140) 1992; 41 10.1016/j.jpdc.2011.11.003_br000155 Sih (10.1016/j.jpdc.2011.11.003_br000150) 1993; 4 Zitzler (10.1016/j.jpdc.2011.11.003_br000160) 2003; 7 (10.1016/j.jpdc.2011.11.003_br000105) 2004 Dogan (10.1016/j.jpdc.2011.11.003_br000040) 2002; 13 10.1016/j.jpdc.2011.11.003_br000110 Sardiña (10.1016/j.jpdc.2011.11.003_br000130) 2010; vol. 6043 Giroudeau (10.1016/j.jpdc.2011.11.003_br000065) 2007 Graham (10.1016/j.jpdc.2011.11.003_br000075) 1979; 5 Gonzalez (10.1016/j.jpdc.2011.11.003_br000070) 1977; 6 Bouteiller (10.1016/j.jpdc.2011.11.003_br000020) 2006; 20 Legrand (10.1016/j.jpdc.2011.11.003_br000100) 2005 Dogan (10.1016/j.jpdc.2011.11.003_br000045) 2005; 48 10.1016/j.jpdc.2011.11.003_br000080 |
| References_xml | – reference: D.B. Shmoys, E. Tardos, Scheduling unrelated machines with costs, in: Proceedings of the Fourth Annual ACM/SIGACT-SIAM Symposium on Discrete Algorithms, 1993, pp. 448–454. – volume: 4 start-page: 175 year: 1993 end-page: 187 ident: br000150 article-title: A compile-time scheduling heuristic for interconnection-constrained heterogenous processor architectures publication-title: IEEE Transactions on Parallel and Distributed Systems – volume: 7 start-page: 117 year: 2003 end-page: 132 ident: br000160 article-title: Performance assessment of multiobjective optimizers: an analysis and review publication-title: IEEE Transactions on Evolutionary Computation – volume: 17 start-page: 539 year: 1988 end-page: 551 ident: br000085 article-title: A polynomial approximation scheme for scheduling on uniform processors: using the dual approximation approach publication-title: SIAM Journal on Computing – reference: M. Maheswaran, H.J. Siegel, A dynamic matching and scheduling algorithm for heterogeneous computing systems, in: Heterogeneous Computing Workshop, 1998, pp. 57–69. – volume: 13 start-page: 308 year: 2002 end-page: 323 ident: br000040 article-title: Matching and scheduling algorithms for minimizing execution time and failure probability of applications in heterogeneous computing publication-title: IEEE Transactions on Parallel and Distributed Systems – year: 2004 ident: br000105 publication-title: Handbook of Scheduling. Algorithms, Models and Performance Analysis – volume: 21 start-page: 532 year: 2010 end-page: 546 ident: br000025 article-title: Evaluation and optimization of the robustness of dag schedules in heterogeneous environments publication-title: IEEE Transactions on Parallel and Distributed Systems – year: 2008 ident: br000125 article-title: Scheduling: Theory, Algorithms, and Systems – reference: J.J. Dongarra, E. Jeannot, E. Saule, Z. Shi, Bi-objective scheduling algorithms for optimizing makespan and reliability on heterogeneous systems, in: Proc. of SPAA, 2007, pp. 280–288. – reference: E. Jeannot, E. Saule, D. Trystram, Bi-Objective Approximation Scheme for Makespan and Reliability Optimization on Uniform Parallel Machines, in: The 14th International Euro-Par Conference on Parallel and Distributed Computing (Euro-Par 2008), Las Palmas de Gran Canaria, Spain, 2008. – volume: 20 start-page: 319 year: 2006 end-page: 333 ident: br000020 article-title: MPICH-V: a multiprotocol fault tolerant MPI publication-title: International Journal of High Performance Computing and Applications – year: 2009 ident: br000015 article-title: Fault tolerance and availability awarness in computational grids publication-title: Fundamentals of Grid Computing – volume: 6 start-page: 155 year: 1977 end-page: 166 ident: br000070 article-title: Bounds for LPT schedules on uniform processors publication-title: SIAM Journal of Computing – year: 2005 ident: br000100 article-title: Algorithmique Parallèle – year: 2001 ident: br000035 article-title: Introduction to Algorithms – year: 1979 ident: br000055 article-title: Computers and Intractability – reference: C.H. Papadimitriou, M. Yannakakis, On the approximability of trade-offs and optimal access of web sources, in: Proc. of FOCS, 2000, pp. 86–92. – volume: 37 start-page: 365 year: 2011 end-page: 378 ident: br000010 article-title: Static worksharing strategies for heterogeneous computers with unrecoverable interruptions publication-title: Parallel Computing – volume: vol. 1124 start-page: 573 year: 1996 end-page: 577 ident: br000115 article-title: A static scheduling heuristic for heterogeneous processors publication-title: Euro-Par, Vol. II – volume: 41 start-page: 1156 year: 1992 end-page: 1169 ident: br000135 article-title: Task allocation for maximizing reliability of distribued computer systems publication-title: IEEE Transactions on Computers – reference: H. Topcuoglu, S. Hariri, M.-Y. Wu, Task scheduling algorithms for heterogeneous processors, in: 8th IEEE Heterogeneous Computing Workshop (HCW’99), 1999 3–14. – reference: M. Hakem, F. Butelle, A bi-objective algortithm for scheduling parallel applications on heterogeneous systems subject to failures, in: Renpar 17, 2006. – volume: 41 start-page: 212 year: 2001 end-page: 224 ident: br000030 article-title: An efficient approximation algorithm for minimizing makespan on uniformly related machines publication-title: Journal of Algorithms – volume: 5 start-page: 287 year: 1979 end-page: 326 ident: br000075 article-title: Optimization and approximation in deterministic sequencing and scheduling : a survey, ann publication-title: Discrete Mathematics – volume: 41 start-page: 1156 year: 1992 end-page: 1168 ident: br000140 article-title: Task allocation for maximizing reliability of distributed computer systems publication-title: Computers, IEEE Transactions on – volume: 48 start-page: 300 year: 2005 end-page: 314 ident: br000045 article-title: Bi-objective scheduling algorithms for execution time-reliability trade-off in heterogeneous computing systems publication-title: Computer Journal – volume: 13 start-page: 23 year: 1987 end-page: 31 ident: br000095 publication-title: IEEE Transactions on Software Engineering – year: 2007 ident: br000065 article-title: Multiprocessor Scheduling: Theory and Applications – volume: vol. 6043 start-page: 102 year: 2010 end-page: 111 ident: br000130 article-title: An efficient weighted bi-objective scheduling algorithm for heterogeneous systems publication-title: Euro-Par 2009–Parallel Processing Workshops – reference: A. Benoit, L.-C. Canon, E. Jeannot, Y. Robert, Reliability of task graph schedules with transient and fail-stop failures: complexity and algorithms, Journal of Scheduling (in press). – volume: 69 start-page: 326 year: 2009 end-page: 336 ident: br000060 article-title: Reliability versus performance for critical applications publication-title: Journal of Parallel and Distributed Computing – reference: . – year: 2004 ident: 10.1016/j.jpdc.2011.11.003_br000105 – ident: 10.1016/j.jpdc.2011.11.003_br000110 doi: 10.1109/HCW.1998.666545 – year: 2007 ident: 10.1016/j.jpdc.2011.11.003_br000065 – volume: 6 start-page: 155 year: 1977 ident: 10.1016/j.jpdc.2011.11.003_br000070 article-title: Bounds for LPT schedules on uniform processors publication-title: SIAM Journal of Computing doi: 10.1137/0206013 – volume: 13 start-page: 23 year: 1987 ident: 10.1016/j.jpdc.2011.11.003_br000095 publication-title: IEEE Transactions on Software Engineering doi: 10.1109/TSE.1987.232562 – volume: 48 start-page: 300 issue: 3 year: 2005 ident: 10.1016/j.jpdc.2011.11.003_br000045 article-title: Bi-objective scheduling algorithms for execution time-reliability trade-off in heterogeneous computing systems publication-title: Computer Journal doi: 10.1093/comjnl/bxh086 – volume: vol. 1124 start-page: 573 year: 1996 ident: 10.1016/j.jpdc.2011.11.003_br000115 article-title: A static scheduling heuristic for heterogeneous processors – year: 2008 ident: 10.1016/j.jpdc.2011.11.003_br000125 – volume: 4 start-page: 175 issue: 2 year: 1993 ident: 10.1016/j.jpdc.2011.11.003_br000150 article-title: A compile-time scheduling heuristic for interconnection-constrained heterogenous processor architectures publication-title: IEEE Transactions on Parallel and Distributed Systems doi: 10.1109/71.207593 – volume: 7 start-page: 117 issue: 2 year: 2003 ident: 10.1016/j.jpdc.2011.11.003_br000160 article-title: Performance assessment of multiobjective optimizers: an analysis and review publication-title: IEEE Transactions on Evolutionary Computation doi: 10.1109/TEVC.2003.810758 – year: 2009 ident: 10.1016/j.jpdc.2011.11.003_br000015 article-title: Fault tolerance and availability awarness in computational grids – volume: 21 start-page: 532 issue: 4 year: 2010 ident: 10.1016/j.jpdc.2011.11.003_br000025 article-title: Evaluation and optimization of the robustness of dag schedules in heterogeneous environments publication-title: IEEE Transactions on Parallel and Distributed Systems doi: 10.1109/TPDS.2009.84 – ident: 10.1016/j.jpdc.2011.11.003_br000050 doi: 10.1145/1248377.1248423 – volume: 5 start-page: 287 year: 1979 ident: 10.1016/j.jpdc.2011.11.003_br000075 article-title: Optimization and approximation in deterministic sequencing and scheduling : a survey, ann publication-title: Discrete Mathematics – ident: 10.1016/j.jpdc.2011.11.003_br000080 – ident: 10.1016/j.jpdc.2011.11.003_br000155 doi: 10.1109/HCW.1999.765092 – volume: 13 start-page: 308 issue: 3 year: 2002 ident: 10.1016/j.jpdc.2011.11.003_br000040 article-title: Matching and scheduling algorithms for minimizing execution time and failure probability of applications in heterogeneous computing publication-title: IEEE Transactions on Parallel and Distributed Systems doi: 10.1109/71.993209 – volume: 69 start-page: 326 issue: 3 year: 2009 ident: 10.1016/j.jpdc.2011.11.003_br000060 article-title: Reliability versus performance for critical applications publication-title: Journal of Parallel and Distributed Computing doi: 10.1016/j.jpdc.2008.11.002 – volume: 37 start-page: 365 issue: 8 year: 2011 ident: 10.1016/j.jpdc.2011.11.003_br000010 article-title: Static worksharing strategies for heterogeneous computers with unrecoverable interruptions publication-title: Parallel Computing doi: 10.1016/j.parco.2010.03.001 – volume: 17 start-page: 539 issue: 3 year: 1988 ident: 10.1016/j.jpdc.2011.11.003_br000085 article-title: A polynomial approximation scheme for scheduling on uniform processors: using the dual approximation approach publication-title: SIAM Journal on Computing doi: 10.1137/0217033 – volume: vol. 6043 start-page: 102 year: 2010 ident: 10.1016/j.jpdc.2011.11.003_br000130 article-title: An efficient weighted bi-objective scheduling algorithm for heterogeneous systems – volume: 41 start-page: 1156 issue: 9 year: 1992 ident: 10.1016/j.jpdc.2011.11.003_br000140 article-title: Task allocation for maximizing reliability of distributed computer systems publication-title: Computers, IEEE Transactions on doi: 10.1109/12.165396 – ident: 10.1016/j.jpdc.2011.11.003_br000120 doi: 10.1109/SFCS.2000.892068 – volume: 41 start-page: 212 year: 2001 ident: 10.1016/j.jpdc.2011.11.003_br000030 article-title: An efficient approximation algorithm for minimizing makespan on uniformly related machines publication-title: Journal of Algorithms doi: 10.1006/jagm.2001.1184 – year: 1979 ident: 10.1016/j.jpdc.2011.11.003_br000055 – year: 2005 ident: 10.1016/j.jpdc.2011.11.003_br000100 – ident: 10.1016/j.jpdc.2011.11.003_br000145 – ident: 10.1016/j.jpdc.2011.11.003_br000090 doi: 10.1007/978-3-540-85451-7_94 – year: 2001 ident: 10.1016/j.jpdc.2011.11.003_br000035 – ident: 10.1016/j.jpdc.2011.11.003_br000005 – volume: 20 start-page: 319 issue: 3 year: 2006 ident: 10.1016/j.jpdc.2011.11.003_br000020 article-title: MPICH-V: a multiprotocol fault tolerant MPI publication-title: International Journal of High Performance Computing and Applications doi: 10.1177/1094342006067469 – volume: 41 start-page: 1156 issue: 9 year: 1992 ident: 10.1016/j.jpdc.2011.11.003_br000135 article-title: Task allocation for maximizing reliability of distribued computer systems publication-title: IEEE Transactions on Computers doi: 10.1109/12.165396 |
| SSID | ssj0011578 |
| Score | 2.0486946 |
| Snippet | We study the problem of scheduling tasks (with and without precedence constraints) on a set of related processors which have a probability of failure governed... |
| SourceID | hal proquest crossref elsevier |
| SourceType | Open Access Repository Aggregation Database Enrichment Source Index Database Publisher |
| StartPage | 268 |
| SubjectTerms | Algorithms Approximation Computer Science Distributed, Parallel, and Cluster Computing Heuristic Makespan Mathematical models Optimization Pareto-front approximation Precedence task graphs Processors Reliability Scheduling Tasks |
| Title | Optimizing performance and reliability on heterogeneous parallel systems: Approximation algorithms and heuristics |
| URI | https://dx.doi.org/10.1016/j.jpdc.2011.11.003 https://www.proquest.com/docview/1010889182 https://inria.hal.science/hal-00788219 |
| Volume | 72 |
| WOSCitedRecordID | wos000299139000015&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVESC databaseName: ScienceDirect Freedom Collection 2021 customDbUrl: eissn: 1096-0848 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0011578 issn: 0743-7315 databaseCode: AIEXJ dateStart: 19950101 isFulltext: true titleUrlDefault: https://www.sciencedirect.com providerName: Elsevier |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3fb9MwELa6jgde-I3W8UMG8TZ1SpyltnmrUNFAaCCtSH2LnMShrdKkdEnV8VfxJ3IXO0lbtAmQeImqKHHcfte7c_zdd4S8EcJzIohU_TBOsCQnTvqQ1LsACEtkAikBZ0nVbIJfXIjJRH7pdH7WtTDrlGeZ2Gzk8r9CDecAbCyd_Qu4m0HhBHwG0OEIsMPxj4D_DE5gMftRVZnvVQWsdDozutzXuEkwRSpMDgNp5MGiCHia6tSKO1dUuSEqjm9mprzxRKXf8tWsmC6MrvNUl1bm-YYMtxkRr45RoReba-mqjm5ZFnXQrOiLKsvyKh6MFjDfUjfEj0tVGsYzeOymqmi8usZXNAvjMY04SvPyAlkgO0SQpqpmh_RZyaZyz5R5nmrjmB0kSwujyll7bs62LJRtu2HTqsdGdGZ6Rf0WLMx7i_npfBlHRswV9Vwdrw2NDWHxEueEU0KFPXDykwNyyLgvRZccDj-MJh-bnSvXN9G__g62UMtwCvefdFMydDBFVu5eclBlPOMH5J4Fkg6NiT0kHZ09IvfrNiDURoXH5HtrcXTL4iigTrcsjuYZ3bE4WtsHtRb3lu7YG23trRqrtbcn5Ov70fjded928uhH3oAXfS90eCgdphl31SD21Zn0lC-TiEcKsAo1k46PO9ieiITyQ615MlAuC4WIFURg7ynpZnmmjwiVWuPedBIOYKEvEyfkEla8Z7HvYAPe0OsRt_5Jg8jK3GO3lTSo-YzzAGEIEAZY_6I4bo-cNPcsjcjLrVf7NVKBTVNN-hmAYd1632uAtXkA6rqfDz8FeA4TdQFmtXZ75FWNegCOHnfvVIUJjoqURFew43-cwDNyt_0PPifdYlXqF-ROtC5mV6uX1pB_ATby1gk |
| linkProvider | Elsevier |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Optimizing+performance+and+reliability+on+heterogeneous+parallel+systems%3A+Approximation+algorithms+and+heuristics&rft.jtitle=Journal+of+parallel+and+distributed+computing&rft.au=Jeannot%2C+Emmanuel&rft.au=Saule%2C+Erik&rft.au=Trystram%2C+Denis&rft.date=2012-02-01&rft.pub=Elsevier+Inc&rft.issn=0743-7315&rft.eissn=1096-0848&rft.volume=72&rft.issue=2&rft.spage=268&rft.epage=280&rft_id=info:doi/10.1016%2Fj.jpdc.2011.11.003&rft.externalDocID=S074373151100219X |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0743-7315&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0743-7315&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0743-7315&client=summon |