New results for the Longest Haplotype Reconstruction problem
The haplotyping problem has emerged in recent years as one of the most relevant problems in Computational Biology. In particular, in the Single Individual Haplotyping (SIH) problem, starting from a matrix of incomplete haplotype fragments, the goal is the reconstruction of the two complete haplotype...
Uložené v:
| Vydané v: | Discrete Applied Mathematics Ročník 160; číslo 9; s. 1299 - 1310 |
|---|---|
| Hlavný autor: | |
| Médium: | Journal Article |
| Jazyk: | English |
| Vydavateľské údaje: |
Elsevier B.V
01.06.2012
|
| Predmet: | |
| ISSN: | 0166-218X, 1872-6771 |
| On-line prístup: | Získať plný text |
| Tagy: |
Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
|
| Abstract | The haplotyping problem has emerged in recent years as one of the most relevant problems in Computational Biology. In particular, in the Single Individual Haplotyping (SIH) problem, starting from a matrix of incomplete haplotype fragments, the goal is the reconstruction of the two complete haplotypes of an individual. In this paper we consider one of the variants of the Single Individual Haplotyping problem, the Longest Haplotyping Reconstruction (LHR) problem. We prove that the LHR problem is NP-hard even in the restricted case when the input matrix is error-free. Furthermore, we investigate the approximation complexity of the problem, and we show that the problem cannot be approximated within factor 2logδnm for any constant δ<1, unless NP⊆DTIME[2polylognm]. Finally, we exhibit a fixed-parameter algorithm for the LHR problem, where the parameter is the size of the two reconstructed haplotypes. |
|---|---|
| AbstractList | The haplotyping problem has emerged in recent years as one of the most relevant problems in Computational Biology. In particular, in the Single Individual Haplotyping (SIH) problem, starting from a matrix of incomplete haplotype fragments, the goal is the reconstruction of the two complete haplotypes of an individual. In this paper we consider one of the variants of the Single Individual Haplotyping problem, the Longest Haplotyping Reconstruction (LHR) problem. We prove that the LHR problem is NP-hard even in the restricted case when the input matrix is error-free. Furthermore, we investigate the approximation complexity of the problem, and we show that the problem cannot be approximated within factor 2logδnm for any constant δ<1, unless NP⊆DTIME[2polylognm]. Finally, we exhibit a fixed-parameter algorithm for the LHR problem, where the parameter is the size of the two reconstructed haplotypes. The haplotyping problem has emerged in recent years as one of the most relevant problems in Computational Biology. In particular, in the Single Individual Haplotyping (SIH) problem, starting from a matrix of incomplete haplotype fragments, the goal is the reconstruction of the two complete haplotypes of an individual. In this paper we consider one of the variants of the Single Individual Haplotyping problem, the Longest Haplotyping Reconstruction (LHR) problem. We prove that the LHR problem is NP - hard even in the restricted case when the input matrix is error-free. Furthermore, we investigate the approximation complexity of the problem, and we show that the problem cannot be approximated within factor 2 log delta n m for any constant delta < 1 , unless NP [subE] D T I M E [ 2 p o l y log n m ] . Finally, we exhibit a fixed-parameter algorithm for the LHR problem, where the parameter is the size of the two reconstructed haplotypes. |
| Author | Dondi, Riccardo |
| Author_xml | – sequence: 1 givenname: Riccardo surname: Dondi fullname: Dondi, Riccardo email: riccardo.dondi@unibg.it organization: Dipartimento di Scienze dei Linguaggi, della Comunicazione e degli Studi Culturali, Università degli Studi di Bergamo, Via Donizetti 3, 24129 Bergamo, Italy |
| BookMark | eNp9kE9Lw0AQxRepYK1-AG85eknd2TSbBL1IUSsUBVHwtuxOppqSZOPuRum3d0s9exre8N78-Z2ySW97YuwC-Bw4yKvtvNbdXHCAqOccFkdsCmUhUlkUMGHT6JGpgPL9hJ16v-WcQ1RTdvNEP4kjP7bBJxvrkvBJydr2H-RDstJDa8NuoOSF0PY-uBFDY_tkcNa01J2x441uPZ3_1Rl7u797Xa7S9fPD4_J2naIoRUhBouSLMs_qMtNopDGmxHxRgMmq2EKqKlkXudR5JiqsctSIdVHlnLQ2Qppsxi4Pc-PerzFeprrGI7Wt7smOXgEXouIZzyFa4WBFZ713tFGDazrtdtGk9qTUVkVSak9q34qkYub6kKH4w3dDTnlsqEeqG0cYVG2bf9K_athy0g |
| Cites_doi | 10.1287/ijoc.1040.0073 10.1145/210332.210337 10.1016/S0304-3975(98)00158-3 10.1007/BF02945456 10.1016/S0166-218X(96)00062-5 10.1007/s00453-007-0029-z 10.1145/278298.278306 10.1137/S009753979223842X |
| ContentType | Journal Article |
| Copyright | 2011 Elsevier B.V. |
| Copyright_xml | – notice: 2011 Elsevier B.V. |
| DBID | 6I. AAFTH AAYXX CITATION 7SC 8FD JQ2 L7M L~C L~D |
| DOI | 10.1016/j.dam.2011.10.014 |
| DatabaseName | ScienceDirect Open Access Titles Elsevier:ScienceDirect:Open Access CrossRef Computer and Information Systems Abstracts Technology Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional |
| DatabaseTitle | CrossRef Computer and Information Systems Abstracts Technology Research Database Computer and Information Systems Abstracts – Academic Advanced Technologies Database with Aerospace ProQuest Computer Science Collection Computer and Information Systems Abstracts Professional |
| DatabaseTitleList | Computer and Information Systems Abstracts |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Mathematics Biology |
| EISSN | 1872-6771 |
| EndPage | 1310 |
| ExternalDocumentID | 10_1016_j_dam_2011_10_014 S0166218X11003891 |
| GroupedDBID | -~X 6I. AAFTH ADEZE AFTJW AI. ALMA_UNASSIGNED_HOLDINGS FA8 FDB OAUVE VH1 WUQ AAYXX CITATION 7SC 8FD JQ2 L7M L~C L~D |
| ID | FETCH-LOGICAL-c282t-16c604853d83acb6bbb8c5471b39d83ce996d756a5329c95caccd7950eaab26b3 |
| ISICitedReferencesCount | 1 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000303286800001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 0166-218X |
| IngestDate | Fri Jul 11 15:11:14 EDT 2025 Sat Nov 29 02:59:31 EST 2025 Sat Apr 29 22:45:10 EDT 2023 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 9 |
| Keywords | Haplotyping Approximation complexity Fixed-parameter algorithms Computational biology |
| Language | English |
| License | http://www.elsevier.com/open-access/userlicense/1.0 https://www.elsevier.com/tdm/userlicense/1.0 https://www.elsevier.com/open-access/userlicense/1.0 |
| LinkModel | OpenURL |
| MergedId | FETCHMERGED-LOGICAL-c282t-16c604853d83acb6bbb8c5471b39d83ce996d756a5329c95caccd7950eaab26b3 |
| Notes | ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 23 |
| OpenAccessLink | https://dx.doi.org/10.1016/j.dam.2011.10.014 |
| PQID | 1022903051 |
| PQPubID | 23500 |
| PageCount | 12 |
| ParticipantIDs | proquest_miscellaneous_1022903051 crossref_primary_10_1016_j_dam_2011_10_014 elsevier_sciencedirect_doi_10_1016_j_dam_2011_10_014 |
| PublicationCentury | 2000 |
| PublicationDate | June 2012 2012-06-00 20120601 |
| PublicationDateYYYYMMDD | 2012-06-01 |
| PublicationDate_xml | – month: 06 year: 2012 text: June 2012 |
| PublicationDecade | 2010 |
| PublicationTitle | Discrete Applied Mathematics |
| PublicationYear | 2012 |
| Publisher | Elsevier B.V |
| Publisher_xml | – name: Elsevier B.V |
| References | Bonizzoni, Della Vedova, Dondi, Li (br000025) 2003; 18 Chen, Zhao, Zhang, Zhu (br000030) 2010 Karger, Motwani, Ramkumar (br000060) 1995; 24 Hein, Jiang, Wang, Zhang (br000050) 1996; 71 Alon, Yuster, Zwick (br000010) 1995; 42 Downey, Fellows (br000040) 1999 Greenberg, Hart, Lancia (br000045) 2004; 16 Niedermeier (br000070) 2006 Alimonti, Kann (br000005) 2000; 237 Arora, Lund, Motwani, Sudan, Szegedy (br000015) 1998; 45 Ausiello, Crescenzi, Gambosi, Kann, Marchetti-Spaccamela, Protasi (br000020) 1999 Cilibrasi, van Iersel, Kelk, Tromp (br000035) 2007; 49 Jiang, Li (br000055) 1995; 24 Lancia, Bafna, Istrail, Lippert, Schwartz (br000065) 2001; vol. 2161 Schaefer (br000075) 1978 Arora (10.1016/j.dam.2011.10.014_br000015) 1998; 45 Chen (10.1016/j.dam.2011.10.014_br000030) 2010 Ausiello (10.1016/j.dam.2011.10.014_br000020) 1999 Hein (10.1016/j.dam.2011.10.014_br000050) 1996; 71 Lancia (10.1016/j.dam.2011.10.014_br000065) 2001; vol. 2161 Niedermeier (10.1016/j.dam.2011.10.014_br000070) 2006 Alimonti (10.1016/j.dam.2011.10.014_br000005) 2000; 237 Greenberg (10.1016/j.dam.2011.10.014_br000045) 2004; 16 Jiang (10.1016/j.dam.2011.10.014_br000055) 1995; 24 Alon (10.1016/j.dam.2011.10.014_br000010) 1995; 42 Schaefer (10.1016/j.dam.2011.10.014_br000075) 1978 Downey (10.1016/j.dam.2011.10.014_br000040) 1999 Cilibrasi (10.1016/j.dam.2011.10.014_br000035) 2007; 49 Bonizzoni (10.1016/j.dam.2011.10.014_br000025) 2003; 18 Karger (10.1016/j.dam.2011.10.014_br000060) 1995; 24 |
| References_xml | – volume: 24 start-page: 1122 year: 1995 end-page: 1139 ident: br000055 article-title: On the approximation of shortest common supersequences and longest common subsequences publication-title: SIAM Journal on Computing – volume: 49 start-page: 13 year: 2007 end-page: 36 ident: br000035 article-title: The complexity of the single individual SNP haplotyping problem publication-title: Algorithmica – year: 1999 ident: br000020 article-title: Complexity and Approximation: Combinatorial Optimization Problems and their Approximability Properties – start-page: 173 year: 2010 end-page: 183 ident: br000030 article-title: New aspects of haplotype inference from SNP fragments publication-title: A Practical Guide to Bioinformatics Analysis – volume: 237 start-page: 123 year: 2000 end-page: 134 ident: br000005 article-title: Some APX-completeness results for cubic graphs publication-title: Theoretical Computer Science – volume: 18 start-page: 675 year: 2003 end-page: 688 ident: br000025 article-title: The haplotyping problem: an overview of computational models and solutions publication-title: Journal of Computer Science and Technology – year: 2006 ident: br000070 article-title: Invitation to Fixed-Parameter Algorithms – volume: 42 start-page: 844 year: 1995 end-page: 856 ident: br000010 article-title: Color-coding publication-title: Journal of the ACM – start-page: 216 year: 1978 end-page: 226 ident: br000075 article-title: The complexity of satisfiability problems publication-title: Tenth Annual ACM Symposium on Theory of Computing – volume: vol. 2161 start-page: 182 year: 2001 end-page: 193 ident: br000065 article-title: SNPs problems, complexity and algorithms publication-title: ESA 2001 – year: 1999 ident: br000040 article-title: Parameterized Complexity – volume: 24 start-page: 1122 year: 1995 end-page: 1139 ident: br000060 article-title: On approximating the longest path in a graph publication-title: SIAM Journal on Computing – volume: 16 start-page: 211 year: 2004 end-page: 231 ident: br000045 article-title: Opportunities for combinatorial optimization in computational biology publication-title: INFORMS Journal on Computing – volume: 45 start-page: 501 year: 1998 end-page: 555 ident: br000015 article-title: Proof verification and the hardness of approximation problems publication-title: Journal of the ACM – volume: 71 start-page: 153 year: 1996 end-page: 169 ident: br000050 article-title: On the complexity of comparing evolutionary trees publication-title: Discrete Applied Mathematics – volume: 16 start-page: 211 issue: 3 year: 2004 ident: 10.1016/j.dam.2011.10.014_br000045 article-title: Opportunities for combinatorial optimization in computational biology publication-title: INFORMS Journal on Computing doi: 10.1287/ijoc.1040.0073 – start-page: 216 year: 1978 ident: 10.1016/j.dam.2011.10.014_br000075 article-title: The complexity of satisfiability problems – volume: 42 start-page: 844 issue: 4 year: 1995 ident: 10.1016/j.dam.2011.10.014_br000010 article-title: Color-coding publication-title: Journal of the ACM doi: 10.1145/210332.210337 – volume: 237 start-page: 123 issue: 1–2 year: 2000 ident: 10.1016/j.dam.2011.10.014_br000005 article-title: Some APX-completeness results for cubic graphs publication-title: Theoretical Computer Science doi: 10.1016/S0304-3975(98)00158-3 – year: 1999 ident: 10.1016/j.dam.2011.10.014_br000040 – volume: 18 start-page: 675 year: 2003 ident: 10.1016/j.dam.2011.10.014_br000025 article-title: The haplotyping problem: an overview of computational models and solutions publication-title: Journal of Computer Science and Technology doi: 10.1007/BF02945456 – year: 1999 ident: 10.1016/j.dam.2011.10.014_br000020 – volume: 71 start-page: 153 year: 1996 ident: 10.1016/j.dam.2011.10.014_br000050 article-title: On the complexity of comparing evolutionary trees publication-title: Discrete Applied Mathematics doi: 10.1016/S0166-218X(96)00062-5 – volume: 24 start-page: 1122 year: 1995 ident: 10.1016/j.dam.2011.10.014_br000060 article-title: On approximating the longest path in a graph publication-title: SIAM Journal on Computing – start-page: 173 year: 2010 ident: 10.1016/j.dam.2011.10.014_br000030 article-title: New aspects of haplotype inference from SNP fragments – volume: vol. 2161 start-page: 182 year: 2001 ident: 10.1016/j.dam.2011.10.014_br000065 article-title: SNPs problems, complexity and algorithms – year: 2006 ident: 10.1016/j.dam.2011.10.014_br000070 – volume: 49 start-page: 13 issue: 1 year: 2007 ident: 10.1016/j.dam.2011.10.014_br000035 article-title: The complexity of the single individual SNP haplotyping problem publication-title: Algorithmica doi: 10.1007/s00453-007-0029-z – volume: 45 start-page: 501 issue: 3 year: 1998 ident: 10.1016/j.dam.2011.10.014_br000015 article-title: Proof verification and the hardness of approximation problems publication-title: Journal of the ACM doi: 10.1145/278298.278306 – volume: 24 start-page: 1122 year: 1995 ident: 10.1016/j.dam.2011.10.014_br000055 article-title: On the approximation of shortest common supersequences and longest common subsequences publication-title: SIAM Journal on Computing doi: 10.1137/S009753979223842X |
| SSID | ssj0001218 ssj0000186 ssj0006644 |
| Score | 1.958431 |
| Snippet | The haplotyping problem has emerged in recent years as one of the most relevant problems in Computational Biology. In particular, in the Single Individual... |
| SourceID | proquest crossref elsevier |
| SourceType | Aggregation Database Index Database Publisher |
| StartPage | 1299 |
| SubjectTerms | Algorithms Approximation Approximation complexity Biology Computation Computational biology Fixed-parameter algorithms Haplotyping Mathematical analysis Mathematical models Reconstruction Surface hardness |
| Title | New results for the Longest Haplotype Reconstruction problem |
| URI | https://dx.doi.org/10.1016/j.dam.2011.10.014 https://www.proquest.com/docview/1022903051 |
| Volume | 160 |
| WOSCitedRecordID | wos000303286800001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVESC databaseName: Elsevier SD Freedom Collection Journals 2021 customDbUrl: eissn: 1872-6771 dateEnd: 20171231 omitProxy: false ssIdentifier: ssj0001218 issn: 0166-218X databaseCode: AIEXJ dateStart: 19950101 isFulltext: true titleUrlDefault: https://www.sciencedirect.com providerName: Elsevier |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3fS9xAEF7q6YM-FFuVqlVW8Kmy4CW3myz4crSKLVWEKtxb2Gwi3KHJ0TvFP99vsptcPKmo0JcQ9kjYm5nMfPNjZxjbN5qMsDkUJs0C0dNdKXRsYmGNUtRfndKK1bCJ6Pw8Hgz0hR8JP6nGCURFET886PF_ZTXWwGw6OvsGdjcvxQLuwXRcwXZcX8V4qliED313M500NYS_y4LySAenZnxTVlFXcjtnzWMP_FyZNlT9MYRGAaRugOpZ0-F1Noy-LLKhP59vIWtlO4hA1RiqHUR4frrFBRuVEoAAA2crnIKMo0CoyI1NaTSoGwngRUW39CHQhG7Z1m7oalif6W0XQhhhg7eurSpV3LnTpXPtsP_QrmhT1OuOkqwLbDGIpIZSXuz_PB78ajUPo854y3W4bZZdAsrq-Z7v7v_V2e6q7m9uC__CK3OWu4Ijl6vso_cjeN_x_xP7kBef2UqLRWvsCJLAvSRwSALHb9xLAm8kgT-VBO4lYZ1dnRxffj8VfliGsPCap6KrrII2lmEWh8amKk3T2EpAjzTUWLI5HNssksrIMNBWS2uszSItD3Nj0kCl4QbrFGWRf2E8k7GBl2-NtHkvktcGPj5QKz1uAIDVJvtWkyQZu54oSV0sOEpAv4ToR0ug3ybr1URLPKhzYC0B7196bK8mcAKFR1ksU-Tl3SShEIUmM9Xdet-rt9ny7BP4yjogcL7Dluz9dDj5u-ul6BG0GHcH |
| linkProvider | Elsevier |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=New+results+for+the+Longest+Haplotype+Reconstruction+problem&rft.jtitle=Discrete+Applied+Mathematics&rft.au=Dondi%2C+Riccardo&rft.date=2012-06-01&rft.pub=Elsevier+B.V&rft.issn=0166-218X&rft.eissn=1872-6771&rft.volume=160&rft.issue=9&rft.spage=1299&rft.epage=1310&rft_id=info:doi/10.1016%2Fj.dam.2011.10.014&rft.externalDocID=S0166218X11003891 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0166-218X&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0166-218X&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0166-218X&client=summon |