New results for the Longest Haplotype Reconstruction problem

The haplotyping problem has emerged in recent years as one of the most relevant problems in Computational Biology. In particular, in the Single Individual Haplotyping (SIH) problem, starting from a matrix of incomplete haplotype fragments, the goal is the reconstruction of the two complete haplotype...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:Discrete Applied Mathematics Ročník 160; číslo 9; s. 1299 - 1310
Hlavný autor: Dondi, Riccardo
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: Elsevier B.V 01.06.2012
Predmet:
ISSN:0166-218X, 1872-6771
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Abstract The haplotyping problem has emerged in recent years as one of the most relevant problems in Computational Biology. In particular, in the Single Individual Haplotyping (SIH) problem, starting from a matrix of incomplete haplotype fragments, the goal is the reconstruction of the two complete haplotypes of an individual. In this paper we consider one of the variants of the Single Individual Haplotyping problem, the Longest Haplotyping Reconstruction (LHR) problem. We prove that the LHR problem is NP-hard even in the restricted case when the input matrix is error-free. Furthermore, we investigate the approximation complexity of the problem, and we show that the problem cannot be approximated within factor 2logδnm for any constant δ<1, unless NP⊆DTIME[2polylognm]. Finally, we exhibit a fixed-parameter algorithm for the LHR problem, where the parameter is the size of the two reconstructed haplotypes.
AbstractList The haplotyping problem has emerged in recent years as one of the most relevant problems in Computational Biology. In particular, in the Single Individual Haplotyping (SIH) problem, starting from a matrix of incomplete haplotype fragments, the goal is the reconstruction of the two complete haplotypes of an individual. In this paper we consider one of the variants of the Single Individual Haplotyping problem, the Longest Haplotyping Reconstruction (LHR) problem. We prove that the LHR problem is NP-hard even in the restricted case when the input matrix is error-free. Furthermore, we investigate the approximation complexity of the problem, and we show that the problem cannot be approximated within factor 2logδnm for any constant δ<1, unless NP⊆DTIME[2polylognm]. Finally, we exhibit a fixed-parameter algorithm for the LHR problem, where the parameter is the size of the two reconstructed haplotypes.
The haplotyping problem has emerged in recent years as one of the most relevant problems in Computational Biology. In particular, in the Single Individual Haplotyping (SIH) problem, starting from a matrix of incomplete haplotype fragments, the goal is the reconstruction of the two complete haplotypes of an individual. In this paper we consider one of the variants of the Single Individual Haplotyping problem, the Longest Haplotyping Reconstruction (LHR) problem. We prove that the LHR problem is NP - hard even in the restricted case when the input matrix is error-free. Furthermore, we investigate the approximation complexity of the problem, and we show that the problem cannot be approximated within factor 2 log delta n m for any constant delta < 1 , unless NP [subE] D T I M E [ 2 p o l y log n m ] . Finally, we exhibit a fixed-parameter algorithm for the LHR problem, where the parameter is the size of the two reconstructed haplotypes.
Author Dondi, Riccardo
Author_xml – sequence: 1
  givenname: Riccardo
  surname: Dondi
  fullname: Dondi, Riccardo
  email: riccardo.dondi@unibg.it
  organization: Dipartimento di Scienze dei Linguaggi, della Comunicazione e degli Studi Culturali, Università degli Studi di Bergamo, Via Donizetti 3, 24129 Bergamo, Italy
BookMark eNp9kE9Lw0AQxRepYK1-AG85eknd2TSbBL1IUSsUBVHwtuxOppqSZOPuRum3d0s9exre8N78-Z2ySW97YuwC-Bw4yKvtvNbdXHCAqOccFkdsCmUhUlkUMGHT6JGpgPL9hJ16v-WcQ1RTdvNEP4kjP7bBJxvrkvBJydr2H-RDstJDa8NuoOSF0PY-uBFDY_tkcNa01J2x441uPZ3_1Rl7u797Xa7S9fPD4_J2naIoRUhBouSLMs_qMtNopDGmxHxRgMmq2EKqKlkXudR5JiqsctSIdVHlnLQ2Qppsxi4Pc-PerzFeprrGI7Wt7smOXgEXouIZzyFa4WBFZ713tFGDazrtdtGk9qTUVkVSak9q34qkYub6kKH4w3dDTnlsqEeqG0cYVG2bf9K_athy0g
Cites_doi 10.1287/ijoc.1040.0073
10.1145/210332.210337
10.1016/S0304-3975(98)00158-3
10.1007/BF02945456
10.1016/S0166-218X(96)00062-5
10.1007/s00453-007-0029-z
10.1145/278298.278306
10.1137/S009753979223842X
ContentType Journal Article
Copyright 2011 Elsevier B.V.
Copyright_xml – notice: 2011 Elsevier B.V.
DBID 6I.
AAFTH
AAYXX
CITATION
7SC
8FD
JQ2
L7M
L~C
L~D
DOI 10.1016/j.dam.2011.10.014
DatabaseName ScienceDirect Open Access Titles
Elsevier:ScienceDirect:Open Access
CrossRef
Computer and Information Systems Abstracts
Technology Research Database
ProQuest Computer Science Collection
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
DatabaseTitle CrossRef
Computer and Information Systems Abstracts
Technology Research Database
Computer and Information Systems Abstracts – Academic
Advanced Technologies Database with Aerospace
ProQuest Computer Science Collection
Computer and Information Systems Abstracts Professional
DatabaseTitleList
Computer and Information Systems Abstracts
DeliveryMethod fulltext_linktorsrc
Discipline Mathematics
Biology
EISSN 1872-6771
EndPage 1310
ExternalDocumentID 10_1016_j_dam_2011_10_014
S0166218X11003891
GroupedDBID -~X
6I.
AAFTH
ADEZE
AFTJW
AI.
ALMA_UNASSIGNED_HOLDINGS
FA8
FDB
OAUVE
VH1
WUQ
AAYXX
CITATION
7SC
8FD
JQ2
L7M
L~C
L~D
ID FETCH-LOGICAL-c282t-16c604853d83acb6bbb8c5471b39d83ce996d756a5329c95caccd7950eaab26b3
ISICitedReferencesCount 1
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000303286800001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 0166-218X
IngestDate Fri Jul 11 15:11:14 EDT 2025
Sat Nov 29 02:59:31 EST 2025
Sat Apr 29 22:45:10 EDT 2023
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 9
Keywords Haplotyping
Approximation complexity
Fixed-parameter algorithms
Computational biology
Language English
License http://www.elsevier.com/open-access/userlicense/1.0
https://www.elsevier.com/tdm/userlicense/1.0
https://www.elsevier.com/open-access/userlicense/1.0
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c282t-16c604853d83acb6bbb8c5471b39d83ce996d756a5329c95caccd7950eaab26b3
Notes ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 23
OpenAccessLink https://dx.doi.org/10.1016/j.dam.2011.10.014
PQID 1022903051
PQPubID 23500
PageCount 12
ParticipantIDs proquest_miscellaneous_1022903051
crossref_primary_10_1016_j_dam_2011_10_014
elsevier_sciencedirect_doi_10_1016_j_dam_2011_10_014
PublicationCentury 2000
PublicationDate June 2012
2012-06-00
20120601
PublicationDateYYYYMMDD 2012-06-01
PublicationDate_xml – month: 06
  year: 2012
  text: June 2012
PublicationDecade 2010
PublicationTitle Discrete Applied Mathematics
PublicationYear 2012
Publisher Elsevier B.V
Publisher_xml – name: Elsevier B.V
References Bonizzoni, Della Vedova, Dondi, Li (br000025) 2003; 18
Chen, Zhao, Zhang, Zhu (br000030) 2010
Karger, Motwani, Ramkumar (br000060) 1995; 24
Hein, Jiang, Wang, Zhang (br000050) 1996; 71
Alon, Yuster, Zwick (br000010) 1995; 42
Downey, Fellows (br000040) 1999
Greenberg, Hart, Lancia (br000045) 2004; 16
Niedermeier (br000070) 2006
Alimonti, Kann (br000005) 2000; 237
Arora, Lund, Motwani, Sudan, Szegedy (br000015) 1998; 45
Ausiello, Crescenzi, Gambosi, Kann, Marchetti-Spaccamela, Protasi (br000020) 1999
Cilibrasi, van Iersel, Kelk, Tromp (br000035) 2007; 49
Jiang, Li (br000055) 1995; 24
Lancia, Bafna, Istrail, Lippert, Schwartz (br000065) 2001; vol. 2161
Schaefer (br000075) 1978
Arora (10.1016/j.dam.2011.10.014_br000015) 1998; 45
Chen (10.1016/j.dam.2011.10.014_br000030) 2010
Ausiello (10.1016/j.dam.2011.10.014_br000020) 1999
Hein (10.1016/j.dam.2011.10.014_br000050) 1996; 71
Lancia (10.1016/j.dam.2011.10.014_br000065) 2001; vol. 2161
Niedermeier (10.1016/j.dam.2011.10.014_br000070) 2006
Alimonti (10.1016/j.dam.2011.10.014_br000005) 2000; 237
Greenberg (10.1016/j.dam.2011.10.014_br000045) 2004; 16
Jiang (10.1016/j.dam.2011.10.014_br000055) 1995; 24
Alon (10.1016/j.dam.2011.10.014_br000010) 1995; 42
Schaefer (10.1016/j.dam.2011.10.014_br000075) 1978
Downey (10.1016/j.dam.2011.10.014_br000040) 1999
Cilibrasi (10.1016/j.dam.2011.10.014_br000035) 2007; 49
Bonizzoni (10.1016/j.dam.2011.10.014_br000025) 2003; 18
Karger (10.1016/j.dam.2011.10.014_br000060) 1995; 24
References_xml – volume: 24
  start-page: 1122
  year: 1995
  end-page: 1139
  ident: br000055
  article-title: On the approximation of shortest common supersequences and longest common subsequences
  publication-title: SIAM Journal on Computing
– volume: 49
  start-page: 13
  year: 2007
  end-page: 36
  ident: br000035
  article-title: The complexity of the single individual SNP haplotyping problem
  publication-title: Algorithmica
– year: 1999
  ident: br000020
  article-title: Complexity and Approximation: Combinatorial Optimization Problems and their Approximability Properties
– start-page: 173
  year: 2010
  end-page: 183
  ident: br000030
  article-title: New aspects of haplotype inference from SNP fragments
  publication-title: A Practical Guide to Bioinformatics Analysis
– volume: 237
  start-page: 123
  year: 2000
  end-page: 134
  ident: br000005
  article-title: Some APX-completeness results for cubic graphs
  publication-title: Theoretical Computer Science
– volume: 18
  start-page: 675
  year: 2003
  end-page: 688
  ident: br000025
  article-title: The haplotyping problem: an overview of computational models and solutions
  publication-title: Journal of Computer Science and Technology
– year: 2006
  ident: br000070
  article-title: Invitation to Fixed-Parameter Algorithms
– volume: 42
  start-page: 844
  year: 1995
  end-page: 856
  ident: br000010
  article-title: Color-coding
  publication-title: Journal of the ACM
– start-page: 216
  year: 1978
  end-page: 226
  ident: br000075
  article-title: The complexity of satisfiability problems
  publication-title: Tenth Annual ACM Symposium on Theory of Computing
– volume: vol. 2161
  start-page: 182
  year: 2001
  end-page: 193
  ident: br000065
  article-title: SNPs problems, complexity and algorithms
  publication-title: ESA 2001
– year: 1999
  ident: br000040
  article-title: Parameterized Complexity
– volume: 24
  start-page: 1122
  year: 1995
  end-page: 1139
  ident: br000060
  article-title: On approximating the longest path in a graph
  publication-title: SIAM Journal on Computing
– volume: 16
  start-page: 211
  year: 2004
  end-page: 231
  ident: br000045
  article-title: Opportunities for combinatorial optimization in computational biology
  publication-title: INFORMS Journal on Computing
– volume: 45
  start-page: 501
  year: 1998
  end-page: 555
  ident: br000015
  article-title: Proof verification and the hardness of approximation problems
  publication-title: Journal of the ACM
– volume: 71
  start-page: 153
  year: 1996
  end-page: 169
  ident: br000050
  article-title: On the complexity of comparing evolutionary trees
  publication-title: Discrete Applied Mathematics
– volume: 16
  start-page: 211
  issue: 3
  year: 2004
  ident: 10.1016/j.dam.2011.10.014_br000045
  article-title: Opportunities for combinatorial optimization in computational biology
  publication-title: INFORMS Journal on Computing
  doi: 10.1287/ijoc.1040.0073
– start-page: 216
  year: 1978
  ident: 10.1016/j.dam.2011.10.014_br000075
  article-title: The complexity of satisfiability problems
– volume: 42
  start-page: 844
  issue: 4
  year: 1995
  ident: 10.1016/j.dam.2011.10.014_br000010
  article-title: Color-coding
  publication-title: Journal of the ACM
  doi: 10.1145/210332.210337
– volume: 237
  start-page: 123
  issue: 1–2
  year: 2000
  ident: 10.1016/j.dam.2011.10.014_br000005
  article-title: Some APX-completeness results for cubic graphs
  publication-title: Theoretical Computer Science
  doi: 10.1016/S0304-3975(98)00158-3
– year: 1999
  ident: 10.1016/j.dam.2011.10.014_br000040
– volume: 18
  start-page: 675
  year: 2003
  ident: 10.1016/j.dam.2011.10.014_br000025
  article-title: The haplotyping problem: an overview of computational models and solutions
  publication-title: Journal of Computer Science and Technology
  doi: 10.1007/BF02945456
– year: 1999
  ident: 10.1016/j.dam.2011.10.014_br000020
– volume: 71
  start-page: 153
  year: 1996
  ident: 10.1016/j.dam.2011.10.014_br000050
  article-title: On the complexity of comparing evolutionary trees
  publication-title: Discrete Applied Mathematics
  doi: 10.1016/S0166-218X(96)00062-5
– volume: 24
  start-page: 1122
  year: 1995
  ident: 10.1016/j.dam.2011.10.014_br000060
  article-title: On approximating the longest path in a graph
  publication-title: SIAM Journal on Computing
– start-page: 173
  year: 2010
  ident: 10.1016/j.dam.2011.10.014_br000030
  article-title: New aspects of haplotype inference from SNP fragments
– volume: vol. 2161
  start-page: 182
  year: 2001
  ident: 10.1016/j.dam.2011.10.014_br000065
  article-title: SNPs problems, complexity and algorithms
– year: 2006
  ident: 10.1016/j.dam.2011.10.014_br000070
– volume: 49
  start-page: 13
  issue: 1
  year: 2007
  ident: 10.1016/j.dam.2011.10.014_br000035
  article-title: The complexity of the single individual SNP haplotyping problem
  publication-title: Algorithmica
  doi: 10.1007/s00453-007-0029-z
– volume: 45
  start-page: 501
  issue: 3
  year: 1998
  ident: 10.1016/j.dam.2011.10.014_br000015
  article-title: Proof verification and the hardness of approximation problems
  publication-title: Journal of the ACM
  doi: 10.1145/278298.278306
– volume: 24
  start-page: 1122
  year: 1995
  ident: 10.1016/j.dam.2011.10.014_br000055
  article-title: On the approximation of shortest common supersequences and longest common subsequences
  publication-title: SIAM Journal on Computing
  doi: 10.1137/S009753979223842X
SSID ssj0001218
ssj0000186
ssj0006644
Score 1.958431
Snippet The haplotyping problem has emerged in recent years as one of the most relevant problems in Computational Biology. In particular, in the Single Individual...
SourceID proquest
crossref
elsevier
SourceType Aggregation Database
Index Database
Publisher
StartPage 1299
SubjectTerms Algorithms
Approximation
Approximation complexity
Biology
Computation
Computational biology
Fixed-parameter algorithms
Haplotyping
Mathematical analysis
Mathematical models
Reconstruction
Surface hardness
Title New results for the Longest Haplotype Reconstruction problem
URI https://dx.doi.org/10.1016/j.dam.2011.10.014
https://www.proquest.com/docview/1022903051
Volume 160
WOSCitedRecordID wos000303286800001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVESC
  databaseName: Elsevier SD Freedom Collection Journals 2021
  customDbUrl:
  eissn: 1872-6771
  dateEnd: 20171231
  omitProxy: false
  ssIdentifier: ssj0001218
  issn: 0166-218X
  databaseCode: AIEXJ
  dateStart: 19950101
  isFulltext: true
  titleUrlDefault: https://www.sciencedirect.com
  providerName: Elsevier
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3fS9xAEF7q6YM-FFuVqlVW8Kmy4CW3myz4crSKLVWEKtxb2Gwi3KHJ0TvFP99vsptcPKmo0JcQ9kjYm5nMfPNjZxjbN5qMsDkUJs0C0dNdKXRsYmGNUtRfndKK1bCJ6Pw8Hgz0hR8JP6nGCURFET886PF_ZTXWwGw6OvsGdjcvxQLuwXRcwXZcX8V4qliED313M500NYS_y4LySAenZnxTVlFXcjtnzWMP_FyZNlT9MYRGAaRugOpZ0-F1Noy-LLKhP59vIWtlO4hA1RiqHUR4frrFBRuVEoAAA2crnIKMo0CoyI1NaTSoGwngRUW39CHQhG7Z1m7oalif6W0XQhhhg7eurSpV3LnTpXPtsP_QrmhT1OuOkqwLbDGIpIZSXuz_PB78ajUPo854y3W4bZZdAsrq-Z7v7v_V2e6q7m9uC__CK3OWu4Ijl6vso_cjeN_x_xP7kBef2UqLRWvsCJLAvSRwSALHb9xLAm8kgT-VBO4lYZ1dnRxffj8VfliGsPCap6KrrII2lmEWh8amKk3T2EpAjzTUWLI5HNssksrIMNBWS2uszSItD3Nj0kCl4QbrFGWRf2E8k7GBl2-NtHkvktcGPj5QKz1uAIDVJvtWkyQZu54oSV0sOEpAv4ToR0ug3ybr1URLPKhzYC0B7196bK8mcAKFR1ksU-Tl3SShEIUmM9Xdet-rt9ny7BP4yjogcL7Dluz9dDj5u-ul6BG0GHcH
linkProvider Elsevier
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=New+results+for+the+Longest+Haplotype+Reconstruction+problem&rft.jtitle=Discrete+Applied+Mathematics&rft.au=Dondi%2C+Riccardo&rft.date=2012-06-01&rft.pub=Elsevier+B.V&rft.issn=0166-218X&rft.eissn=1872-6771&rft.volume=160&rft.issue=9&rft.spage=1299&rft.epage=1310&rft_id=info:doi/10.1016%2Fj.dam.2011.10.014&rft.externalDocID=S0166218X11003891
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0166-218X&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0166-218X&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0166-218X&client=summon