Bounds for parametric sequence comparison

We consider the problem of computing a global alignment between two or more sequences subject to varying mismatch and indel penalties. We prove a tight 3(n/2π) 2/3+O(n 1/3 log n) bound on the worst-case number of distinct optimum alignments for two sequences of length n as the parameters are varied....

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:Discrete Applied Mathematics Ročník 118; číslo 3; s. 181 - 198
Hlavní autori: Fernández-Baca, David, Seppäläinen, Timo, Slutzki, Giora
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: Lausanne Elsevier B.V 15.05.2002
Amsterdam Elsevier
New York, NY
Predmet:
ISSN:0166-218X, 1872-6771
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Abstract We consider the problem of computing a global alignment between two or more sequences subject to varying mismatch and indel penalties. We prove a tight 3(n/2π) 2/3+O(n 1/3 log n) bound on the worst-case number of distinct optimum alignments for two sequences of length n as the parameters are varied. This refines a O( n 2/3) upper bound by Gusfield et al., answering a question posed by Pevzner and Waterman. Our lower bound requires an unbounded alphabet. For strings over a binary alphabet, we prove a Ω(n 1/2) lower bound. For the parametric global alignment of k⩾2 sequences under sum-of-pairs scoring we prove a 3(( k 2 )n/2π) 2/3+O(k 2/3n 1/3 log n) upper bound on the number of distinct optimality regions and a Ω(n 2/3) lower bound, partially answering a problem of Pevzner. Based on experimental evidence, we conjecture that for two random sequences, the number of optimality regions is approximately n with high probability.
AbstractList We consider the problem of computing a global alignment between two or more sequences subject to varying mismatch and indel penalties. We prove a tight 3(n/2π) 2/3+O(n 1/3 log n) bound on the worst-case number of distinct optimum alignments for two sequences of length n as the parameters are varied. This refines a O( n 2/3) upper bound by Gusfield et al., answering a question posed by Pevzner and Waterman. Our lower bound requires an unbounded alphabet. For strings over a binary alphabet, we prove a Ω(n 1/2) lower bound. For the parametric global alignment of k⩾2 sequences under sum-of-pairs scoring we prove a 3(( k 2 )n/2π) 2/3+O(k 2/3n 1/3 log n) upper bound on the number of distinct optimality regions and a Ω(n 2/3) lower bound, partially answering a problem of Pevzner. Based on experimental evidence, we conjecture that for two random sequences, the number of optimality regions is approximately n with high probability.
Author Fernández-Baca, David
Slutzki, Giora
Seppäläinen, Timo
Author_xml – sequence: 1
  givenname: David
  surname: Fernández-Baca
  fullname: Fernández-Baca, David
  email: fernande@cs.iastate.edu
  organization: Department of Computer Science, Iowa State University, 226 Atanasoff Hall, Ames, IA 50011, USA
– sequence: 2
  givenname: Timo
  surname: Seppäläinen
  fullname: Seppäläinen, Timo
  email: seppalai@iastate.edu
  organization: Department of Mathematics, Iowa State University, Ames, IA 50011, USA
– sequence: 3
  givenname: Giora
  surname: Slutzki
  fullname: Slutzki, Giora
  email: slutzki@cs.iastate.edu
  organization: Department of Computer Science, Iowa State University, 226 Atanasoff Hall, Ames, IA 50011, USA
BackLink http://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=13530995$$DView record in Pascal Francis
BookMark eNqFkE9LAzEQxYNUsFU_grAXwR5WJ5vuZhcPosV_UPCggrcwO5tApE1qshX89qat9OCllxlmeO8N8xuxgfNOM3bG4ZIDr65eU6nygtcfF8DHAAWk6YANeS2LvJKSD9hwJzlioxg_AYCnacjGd37lupgZH7IlBlzoPljKov5aaUc6I79Iaxu9O2GHBudRn_71Y_b-cP82fcpnL4_P09tZTkI0fd5MSi6NqUkUKMuSTGHKiiR22NXCiAomQgAJalG2sq4KrIVsDXKqtYSWd-KYnW9zlxgJ5yagIxvVMtgFhh_FRSmgacqku97qKPgYgzaKbI-99a4PaOeKg1rTURs6av26Aq42dFSR3OU_9-7AHt_N1qcTgm-rg4pk16Q6GzT1qvN2T8IvWGJ-Lw
CODEN DAMADU
CitedBy_id crossref_primary_10_1007_s11040_018_9276_2
crossref_primary_10_1145_1597036_1597048
crossref_primary_10_1016_j_jalgor_2004_04_008
Cites_doi 10.1007/BF01185430
10.1016/S0076-6879(96)66030-3
10.1073/pnas.89.13.6090
10.1016/S0022-2836(05)80006-3
10.1137/0148063
10.1109/ISTCS.1995.377035
ContentType Journal Article
Copyright 2002 Elsevier Science B.V.
2002 INIST-CNRS
Copyright_xml – notice: 2002 Elsevier Science B.V.
– notice: 2002 INIST-CNRS
DBID 6I.
AAFTH
AAYXX
CITATION
IQODW
DOI 10.1016/S0166-218X(01)00206-2
DatabaseName ScienceDirect Open Access Titles
Elsevier:ScienceDirect:Open Access
CrossRef
Pascal-Francis
DatabaseTitle CrossRef
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Mathematics
EISSN 1872-6771
EndPage 198
ExternalDocumentID 13530995
10_1016_S0166_218X_01_00206_2
S0166218X01002062
GroupedDBID -~X
6I.
AAFTH
ADEZE
AFTJW
AI.
ALMA_UNASSIGNED_HOLDINGS
FA8
FDB
OAUVE
VH1
WUQ
AAYXX
CITATION
IQODW
ID FETCH-LOGICAL-c339t-94517ff8c32a755cf2f56c7adad83f3604330c3cba7b7862a837bfa1c8e70b1d3
ISICitedReferencesCount 5
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000174769500002&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 0166-218X
IngestDate Wed Apr 02 07:43:11 EDT 2025
Tue Nov 18 22:27:29 EST 2025
Sat Nov 29 03:57:20 EST 2025
Sat Apr 29 22:44:07 EDT 2023
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 3
Keywords Parametric analysis
Experimental analysis of algorithms
Multiple alignment
Sequence alignment
Computational biology
Language English
License http://www.elsevier.com/open-access/userlicense/1.0
https://www.elsevier.com/tdm/userlicense/1.0
https://www.elsevier.com/open-access/userlicense/1.0
CC BY 4.0
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c339t-94517ff8c32a755cf2f56c7adad83f3604330c3cba7b7862a837bfa1c8e70b1d3
OpenAccessLink https://dx.doi.org/10.1016/S0166-218X(01)00206-2
PageCount 18
ParticipantIDs pascalfrancis_primary_13530995
crossref_citationtrail_10_1016_S0166_218X_01_00206_2
crossref_primary_10_1016_S0166_218X_01_00206_2
elsevier_sciencedirect_doi_10_1016_S0166_218X_01_00206_2
PublicationCentury 2000
PublicationDate 2002-05-15
PublicationDateYYYYMMDD 2002-05-15
PublicationDate_xml – month: 05
  year: 2002
  text: 2002-05-15
  day: 15
PublicationDecade 2000
PublicationPlace Lausanne
Amsterdam
New York, NY
PublicationPlace_xml – name: Amsterdam
– name: Lausanne
– name: New York, NY
PublicationTitle Discrete Applied Mathematics
PublicationYear 2002
Publisher Elsevier B.V
Elsevier
Publisher_xml – name: Elsevier B.V
– name: Elsevier
References Pevzner (BIB8) 2000
Gusfield (BIB4) 1997
Waterman, Eggert, Lander (BIB12) 1992; 89
Carrillo, Lipman (BIB2) 1988; 48
Gusfield, Balasubramanian, Naor (BIB5) 1994; 12
D. Gusfield, P. Stelling, Parametric and inverse-parametric sequence alignment with XPARAL, in:Russell F. Doolittle (Eds.), Computer Methods for Macromolecular Sequence Analysis, Methods in Enzymology, Vol. 266, Academic Press, New York, 1996, 481–494.
Fernández-Baca, Seppäläinen, Slutzki (BIB3) 2000; Vol. 1848
Sankoff, Kruskal (Eds.) (BIB10) 1983
P.A. Pevzner, M.S. Waterman, Open combinatorial problems in computational molecular biology, Proceedings of the Third Israeli Symposium on Theory of Computing and Systems, IEEE Computer Society Press, Silver Spring, MD, 1995, 158–173.
Huang, Pevzner, Miller (BIB7) 1994; Vol. 807
Vingron, Waterman (BIB11) 1994; 235
Apostol (BIB1) 1976
Gusfield (10.1016/S0166-218X(01)00206-2_BIB4) 1997
Pevzner (10.1016/S0166-218X(01)00206-2_BIB8) 2000
Fernández-Baca (10.1016/S0166-218X(01)00206-2_BIB3) 2000; Vol. 1848
Gusfield (10.1016/S0166-218X(01)00206-2_BIB5) 1994; 12
Sankoff (10.1016/S0166-218X(01)00206-2_BIB10) 1983
Huang (10.1016/S0166-218X(01)00206-2_BIB7) 1994; Vol. 807
Apostol (10.1016/S0166-218X(01)00206-2_BIB1) 1976
10.1016/S0166-218X(01)00206-2_BIB6
Waterman (10.1016/S0166-218X(01)00206-2_BIB12) 1992; 89
Carrillo (10.1016/S0166-218X(01)00206-2_BIB2) 1988; 48
Vingron (10.1016/S0166-218X(01)00206-2_BIB11) 1994; 235
10.1016/S0166-218X(01)00206-2_BIB9
References_xml – year: 1976
  ident: BIB1
  publication-title: Introduction to Analytic Number Theory
– volume: Vol. 1848
  start-page: 69
  year: 2000
  end-page: 83
  ident: BIB3
  article-title: Parametric multiple sequence alignment and phylogeny construction
  publication-title: Combinatorial Pattern Matching
– year: 2000
  ident: BIB8
  publication-title: Computational Molecular Biology
– volume: 48
  start-page: 1073
  year: 1988
  end-page: 1082
  ident: BIB2
  article-title: The multiple sequence alignment problem in biology
  publication-title: SIAM J. Appl. Math.
– year: 1997
  ident: BIB4
  publication-title: Algorithms on Strings, Trees, and Sequences: Computer Science and Computational Biology
– volume: Vol. 807
  start-page: 87
  year: 1994
  end-page: 101
  ident: BIB7
  article-title: Parametric recomputing in alignment graphs
  publication-title: Combinatorial Pattern Matching
– volume: 12
  start-page: 312
  year: 1994
  end-page: 326
  ident: BIB5
  article-title: Parametric optimization of sequence alignment
  publication-title: Algorithmica
– volume: 235
  start-page: 1
  year: 1994
  end-page: 12
  ident: BIB11
  article-title: Sequence alignment and penalty choice: review of concepts, case studies, and implications
  publication-title: J. Mol. Biol.
– reference: D. Gusfield, P. Stelling, Parametric and inverse-parametric sequence alignment with XPARAL, in:Russell F. Doolittle (Eds.), Computer Methods for Macromolecular Sequence Analysis, Methods in Enzymology, Vol. 266, Academic Press, New York, 1996, 481–494.
– year: 1983
  ident: BIB10
  publication-title: Time Warps, String Edits, and Macromolecules: The Theory and Practice of Sequence Comparison
– volume: 89
  start-page: 6090
  year: 1992
  end-page: 6093
  ident: BIB12
  article-title: Parametric sequence comparisons
  publication-title: Proc. Natl. Acad. Sci. USA
– reference: P.A. Pevzner, M.S. Waterman, Open combinatorial problems in computational molecular biology, Proceedings of the Third Israeli Symposium on Theory of Computing and Systems, IEEE Computer Society Press, Silver Spring, MD, 1995, 158–173.
– volume: Vol. 1848
  start-page: 69
  year: 2000
  ident: 10.1016/S0166-218X(01)00206-2_BIB3
  article-title: Parametric multiple sequence alignment and phylogeny construction
– volume: Vol. 807
  start-page: 87
  year: 1994
  ident: 10.1016/S0166-218X(01)00206-2_BIB7
  article-title: Parametric recomputing in alignment graphs
– volume: 12
  start-page: 312
  year: 1994
  ident: 10.1016/S0166-218X(01)00206-2_BIB5
  article-title: Parametric optimization of sequence alignment
  publication-title: Algorithmica
  doi: 10.1007/BF01185430
– ident: 10.1016/S0166-218X(01)00206-2_BIB6
  doi: 10.1016/S0076-6879(96)66030-3
– year: 2000
  ident: 10.1016/S0166-218X(01)00206-2_BIB8
– year: 1976
  ident: 10.1016/S0166-218X(01)00206-2_BIB1
– volume: 89
  start-page: 6090
  year: 1992
  ident: 10.1016/S0166-218X(01)00206-2_BIB12
  article-title: Parametric sequence comparisons
  publication-title: Proc. Natl. Acad. Sci. USA
  doi: 10.1073/pnas.89.13.6090
– year: 1983
  ident: 10.1016/S0166-218X(01)00206-2_BIB10
– volume: 235
  start-page: 1
  year: 1994
  ident: 10.1016/S0166-218X(01)00206-2_BIB11
  article-title: Sequence alignment and penalty choice: review of concepts, case studies, and implications
  publication-title: J. Mol. Biol.
  doi: 10.1016/S0022-2836(05)80006-3
– year: 1997
  ident: 10.1016/S0166-218X(01)00206-2_BIB4
– volume: 48
  start-page: 1073
  year: 1988
  ident: 10.1016/S0166-218X(01)00206-2_BIB2
  article-title: The multiple sequence alignment problem in biology
  publication-title: SIAM J. Appl. Math.
  doi: 10.1137/0148063
– ident: 10.1016/S0166-218X(01)00206-2_BIB9
  doi: 10.1109/ISTCS.1995.377035
SSID ssj0001218
ssj0000186
ssj0006644
Score 1.6487896
Snippet We consider the problem of computing a global alignment between two or more sequences subject to varying mismatch and indel penalties. We prove a tight 3(n/2π)...
SourceID pascalfrancis
crossref
elsevier
SourceType Index Database
Enrichment Source
Publisher
StartPage 181
SubjectTerms Computational biology
Exact sciences and technology
Experimental analysis of algorithms
Mathematics
Multiple alignment
Parametric analysis
Sciences and techniques of general use
Sequence alignment
Title Bounds for parametric sequence comparison
URI https://dx.doi.org/10.1016/S0166-218X(01)00206-2
Volume 118
WOSCitedRecordID wos000174769500002&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVESC
  databaseName: Elsevier SD Freedom Collection Journals 2021
  customDbUrl:
  eissn: 1872-6771
  dateEnd: 20171231
  omitProxy: false
  ssIdentifier: ssj0001218
  issn: 0166-218X
  databaseCode: AIEXJ
  dateStart: 19950101
  isFulltext: true
  titleUrlDefault: https://www.sciencedirect.com
  providerName: Elsevier
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3PT9swFLa6sgPTNMF-aGwM5cABVGVLYsd2DhxgZFoHiypRpIpL5DixhARZBQXx5-85duKUCXU77JJWUZxYfs_Pn5_f-x5CuwJEWcVS-ZRWkU8U2EEhlII9T8AJpgUpmWyKTbAs47NZMhkMVJsLc3_F6po_PCTz_ypquAfC1qmz_yDu7qVwA_6D0OEKYofrXwn-SBdKalgWRprX-1qXzJKjNmTaBp3rGKI-Lj2-BPMB-HkkLCq97uhcXYKI5s7NjtML_wgE-Cgg_u7gLJ1M9LE7J6fmZ5ylWasO3UOn59OLk3HjjdfkAEtOh0ifl5u0S-MJs8t23zFJqQ9wYbZkWZ1pdTvvxk6Gpk6LXXJDU4j6D2tuHAtn3bsBc-u6AonGuNRmTS4xaD9a2bp4Q13cA6Bw_AytRSxO-BCtHY7T2Y8e0Zhm0VtvXXPuJAoQGbH88KYPLgvsi-vYXhDu2049hW9ezsUtzDplyqX0MMx0A72ymw_v0CjNJhpU9Wv04qcT9Ru0b9THA_XxnPp4rfp4Tn3eovNv6fTrd9-W0_AlxsnCT0gcMqW4xJFgMczPSMVUMlGKkmOFqaayCySWhWAFg42u4JgVSoSSVywowhK_Q8P6V129Rx5sqmUgqILZnBCY2pyUNFQYc8ILJlm8hUg7CLm0XPO65MlV3gsqpDTXY5cHYd6MXR5toc9ds7khW1nVgLcjnFvEaJBgDiq0qunOkkTcB62ufFj1wEe07qbGNhoubu6qT-i5vF9c3t7sWA37Dam-h7E
linkProvider Elsevier
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Bounds+for+parametric+sequence+comparison&rft.jtitle=Discrete+applied+mathematics&rft.au=FERNANDEZ-BACA%2C+David&rft.au=SEPP%C3%84L%C3%84INEN%2C+Timo&rft.au=SLUTZKI%2C+Giora&rft.date=2002-05-15&rft.pub=Elsevier&rft.issn=0166-218X&rft.volume=118&rft.issue=3&rft.spage=181&rft.epage=198&rft_id=info:doi/10.1016%2FS0166-218X%2801%2900206-2&rft.externalDBID=n%2Fa&rft.externalDocID=13530995
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0166-218X&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0166-218X&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0166-218X&client=summon