Faster space-efficient STR-IC-LCS computation

One of the most fundamental method for comparing two given strings A and B is the longest common subsequence (LCS), where the task is to find (the length) of an LCS of A and B. In this paper, we deal with the STR-IC-LCS1 problem which is one of the constrained LCS problems proposed by Chen and Chao...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Theoretical computer science Jg. 1003; S. 114607
Hauptverfasser: Yonemoto, Yuki, Nakashima, Yuto, Inenaga, Shunsuke, Bannai, Hideo
Format: Journal Article
Sprache:Englisch
Veröffentlicht: Elsevier B.V 01.07.2024
Schlagworte:
ISSN:0304-3975, 1879-2294
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Abstract One of the most fundamental method for comparing two given strings A and B is the longest common subsequence (LCS), where the task is to find (the length) of an LCS of A and B. In this paper, we deal with the STR-IC-LCS1 problem which is one of the constrained LCS problems proposed by Chen and Chao [J. Comb. Optim, 2011]. A string Z is said to be an STR-IC-LCS of three given strings A, B, and P, if Z is a longest string satisfying that (1) Z includes P as a substring and (2) Z is a common subsequence of A and B. We present three efficient algorithms for this problem: First, we begin with a space-efficient solution which computes the length of an STR-IC-LCS in O(n2) time and O((ℓ+1)(n−ℓ+1)) space, where ℓ is the length of an LCS of A and B of length n. When ℓ=O(1) or n−ℓ=O(1), then this algorithm uses only linear O(n) space. Second, we present a faster algorithm that works in O(nr/log⁡r+n(n−ℓ+1)) time, where r is the length of P, while retaining the O((ℓ+1)(n−ℓ+1)) space efficiency. Third, we give an alternative algorithm that runs in O(nr/log⁡r+n(n−ℓ′+1)) time with O((ℓ′+1)(n−ℓ′+1)) space, where ℓ′ denotes the STR-IC-LCS length for input strings A, B, and P.
AbstractList One of the most fundamental method for comparing two given strings A and B is the longest common subsequence (LCS), where the task is to find (the length) of an LCS of A and B. In this paper, we deal with the STR-IC-LCS1 problem which is one of the constrained LCS problems proposed by Chen and Chao [J. Comb. Optim, 2011]. A string Z is said to be an STR-IC-LCS of three given strings A, B, and P, if Z is a longest string satisfying that (1) Z includes P as a substring and (2) Z is a common subsequence of A and B. We present three efficient algorithms for this problem: First, we begin with a space-efficient solution which computes the length of an STR-IC-LCS in O(n2) time and O((ℓ+1)(n−ℓ+1)) space, where ℓ is the length of an LCS of A and B of length n. When ℓ=O(1) or n−ℓ=O(1), then this algorithm uses only linear O(n) space. Second, we present a faster algorithm that works in O(nr/log⁡r+n(n−ℓ+1)) time, where r is the length of P, while retaining the O((ℓ+1)(n−ℓ+1)) space efficiency. Third, we give an alternative algorithm that runs in O(nr/log⁡r+n(n−ℓ′+1)) time with O((ℓ′+1)(n−ℓ′+1)) space, where ℓ′ denotes the STR-IC-LCS length for input strings A, B, and P.
ArticleNumber 114607
Author Yonemoto, Yuki
Nakashima, Yuto
Bannai, Hideo
Inenaga, Shunsuke
Author_xml – sequence: 1
  givenname: Yuki
  surname: Yonemoto
  fullname: Yonemoto, Yuki
  email: yonemoto.yuuki.240@s.kyushu-u.ac.jp
  organization: Department of Information Science and Technology, Kyushu University, Fukuoka, Japan
– sequence: 2
  givenname: Yuto
  surname: Nakashima
  fullname: Nakashima, Yuto
  email: nakashima.yuto.003@m.kyushu-u.ac.jp
  organization: Department of Informatics, Kyushu University, Fukuoka, Japan
– sequence: 3
  givenname: Shunsuke
  orcidid: 0000-0002-1833-010X
  surname: Inenaga
  fullname: Inenaga, Shunsuke
  email: inenaga.shunsuke.380@m.kyushu-u.ac.jp
  organization: Department of Informatics, Kyushu University, Fukuoka, Japan
– sequence: 4
  givenname: Hideo
  surname: Bannai
  fullname: Bannai, Hideo
  email: hdbn.dsc@tmd.ac.jp
  organization: M&D Data Science Center, Tokyo Medical and Dental University, Tokyo, Japan
BookMark eNp9kM1KAzEUhYNUsK0-gLu-QMYkk58GVzJYLQwItvswc-cGUtqZkkTBt3fKuHLRs7mr73K-syCzfuiRkEfOCs64fjoUGVIhmJAF51Izc0PmfG0sFcLKGZmzkklaWqPuyCKlAxujjJ4TumlSxrhK5waQovcBAvZ5tdt_0m1F62q3guF0_spNDkN_T259c0z48HeXZL953VfvtP5421YvNQVhTaadUb61rZaASjFmleBS-FKZprNad1prxDVIkK1oLbCxsOy0l7CGshXGl0tiprcQh5QiegdhKpBjE46OM3eRdgc3SruLtJukR5L_I88xnJr4c5V5nhgcjb4DRpcuGwB2ISJk1w3hCv0LHHdv1Q
CitedBy_id crossref_primary_10_1016_j_ic_2025_105311
Cites_doi 10.1016/j.tcs.2008.08.042
10.1016/j.ipl.2012.02.007
10.1016/0022-0000(80)90002-1
10.1016/j.ipl.2003.07.001
10.1007/s10878-009-9262-5
10.1016/j.ipl.2004.02.008
10.1145/321796.321811
10.1093/comjnl/bxu012
10.1007/BF00264437
10.1145/360825.360861
ContentType Journal Article
Copyright 2024 Elsevier B.V.
Copyright_xml – notice: 2024 Elsevier B.V.
DBID AAYXX
CITATION
DOI 10.1016/j.tcs.2024.114607
DatabaseName CrossRef
DatabaseTitle CrossRef
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Mathematics
Computer Science
EISSN 1879-2294
ExternalDocumentID 10_1016_j_tcs_2024_114607
S0304397524002226
GroupedDBID --K
--M
-~X
.DC
.~1
0R~
0SF
123
1B1
1RT
1~.
1~5
4.4
457
4G.
5VS
7-5
71M
8P~
9JN
AABNK
AACTN
AAEDW
AAFTH
AAIAV
AAIKJ
AAKOC
AALRI
AAOAW
AAQFI
AAXUO
AAYFN
ABAOU
ABBOA
ABJNI
ABMAC
ACDAQ
ACGFS
ACRLP
ACZNC
ADBBV
ADEZE
AEBSH
AEKER
AENEX
AFKWA
AFTJW
AGUBO
AGYEJ
AHHHB
AHZHX
AIALX
AIEXJ
AIKHN
AITUG
AJOXV
AKRWK
ALMA_UNASSIGNED_HOLDINGS
AMFUW
AMRAJ
AOUOD
ARUGR
AXJTR
BKOJK
BLXMC
CS3
DU5
EBS
EFJIC
EO8
EO9
EP2
EP3
F5P
FDB
FEDTE
FIRID
FNPLU
FYGXN
G-Q
GBLVA
GBOLZ
HVGLF
IHE
IXB
J1W
KOM
MHUIS
MO0
N9A
O-L
O9-
OAUVE
OK1
OZT
P-8
P-9
P2P
PC.
Q38
RIG
ROL
RPZ
SCC
SDF
SDG
SES
SEW
SPC
SPCBC
SSV
SSW
T5K
TN5
WH7
YNT
ZMT
~G-
29Q
9DU
AAEDT
AAQXK
AATTM
AAXKI
AAYWO
AAYXX
ABDPE
ABEFU
ABFNM
ABWVN
ABXDB
ACLOT
ACNNM
ACRPL
ACVFH
ADCNI
ADMUD
ADNMO
ADVLN
AEIPS
AEUPX
AEXQZ
AFJKZ
AFPUW
AGHFR
AGQPQ
AIGII
AIIUN
AKBMS
AKYEP
ANKPU
APXCP
ASPBG
AVWKF
AZFZN
CITATION
EFKBS
EFLBG
EJD
FGOYB
G-2
HZ~
LG9
M26
M41
R2-
SSZ
TAE
WUQ
ZY4
~HD
ID FETCH-LOGICAL-c297t-d75fb9b64ce5500952142f357ad966d666ee8c4c4b2b9c01144d6f4c8c3b27f3
ISICitedReferencesCount 1
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001265648300001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 0304-3975
IngestDate Sat Nov 29 07:47:46 EST 2025
Tue Nov 18 21:45:06 EST 2025
Sat Jun 01 15:42:54 EDT 2024
IsPeerReviewed true
IsScholarly true
Keywords String algorithms
Constrained longest common subsequence
Dynamic programming
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c297t-d75fb9b64ce5500952142f357ad966d666ee8c4c4b2b9c01144d6f4c8c3b27f3
ORCID 0000-0002-1833-010X
ParticipantIDs crossref_citationtrail_10_1016_j_tcs_2024_114607
crossref_primary_10_1016_j_tcs_2024_114607
elsevier_sciencedirect_doi_10_1016_j_tcs_2024_114607
PublicationCentury 2000
PublicationDate 2024-07-01
2024-07-00
PublicationDateYYYYMMDD 2024-07-01
PublicationDate_xml – month: 07
  year: 2024
  text: 2024-07-01
  day: 01
PublicationDecade 2020
PublicationTitle Theoretical computer science
PublicationYear 2024
Publisher Elsevier B.V
Publisher_xml – name: Elsevier B.V
References Yamada, Nakashima, Inenaga, Bannai, Takeda (br0140) 2020; vol. 12011
Yonemoto, Nakashima, Inenaga, Bannai (br0130) 2023; vol. 13878
Bille, Farach-Colton (br0040) 2008; 409
Liu, Wang, Chiu (br0100) 2014; 58
Nakatsu, Kambayashi, Yajima (br0120) 1982; 18
Chen, Chao (br0010) 2011; 21
Hirschberg (br0060) 1975; 18
Wagner, Fischer (br0020) 1974; 21
Abboud, Backurs, Williams (br0050) 2015
Kuboi, Fujishige, Inenaga, Bannai, Takeda (br0110) 2017
Tsai (br0070) 2003; 88
Chin, Santis, Ferrara, Ho, Kim (br0080) 2004; 90
Deorowicz (br0090) 2012; 112
Das, Fleischer, Gasieniec, Gunopulos, Kärkkäinen (br0150) 1997
Masek, Paterson (br0030) 1980; 20
Bille, Gørtz, Mozes, Steiner, Weimann (br0160) 2022; vol. 223
Tsai (10.1016/j.tcs.2024.114607_br0070) 2003; 88
Bille (10.1016/j.tcs.2024.114607_br0040) 2008; 409
Kuboi (10.1016/j.tcs.2024.114607_br0110) 2017
Yonemoto (10.1016/j.tcs.2024.114607_br0130) 2023; vol. 13878
Masek (10.1016/j.tcs.2024.114607_br0030) 1980; 20
Das (10.1016/j.tcs.2024.114607_br0150) 1997
Bille (10.1016/j.tcs.2024.114607_br0160) 2022; vol. 223
Chen (10.1016/j.tcs.2024.114607_br0010) 2011; 21
Chin (10.1016/j.tcs.2024.114607_br0080) 2004; 90
Deorowicz (10.1016/j.tcs.2024.114607_br0090) 2012; 112
Wagner (10.1016/j.tcs.2024.114607_br0020) 1974; 21
Yamada (10.1016/j.tcs.2024.114607_br0140) 2020; vol. 12011
Liu (10.1016/j.tcs.2024.114607_br0100) 2014; 58
Nakatsu (10.1016/j.tcs.2024.114607_br0120) 1982; 18
Abboud (10.1016/j.tcs.2024.114607_br0050) 2015
Hirschberg (10.1016/j.tcs.2024.114607_br0060) 1975; 18
References_xml – volume: vol. 12011
  start-page: 125
  year: 2020
  end-page: 135
  ident: br0140
  article-title: Faster STR-EC-LCS computation
  publication-title: SOFSEM 2020
– start-page: 59
  year: 2015
  end-page: 78
  ident: br0050
  article-title: Tight hardness results for LCS and other sequence similarity measures
  publication-title: FOCS 2015
– volume: 58
  start-page: 1074
  year: 2014
  end-page: 1084
  ident: br0100
  article-title: Constrained longest common subsequences with run-length-encoded strings
  publication-title: Comput. J.
– volume: 21
  start-page: 383
  year: 2011
  end-page: 392
  ident: br0010
  article-title: On the generalized constrained longest common subsequence problems
  publication-title: J. Comb. Optim.
– volume: 20
  start-page: 18
  year: 1980
  end-page: 31
  ident: br0030
  article-title: A faster algorithm computing string edit distances
  publication-title: J. Comput. Syst. Sci.
– start-page: 20:1
  year: 2017
  end-page: 20:12
  ident: br0110
  article-title: Faster STR-IC-LCS computation via RLE
  publication-title: CPM 2017
– volume: 112
  start-page: 423
  year: 2012
  end-page: 426
  ident: br0090
  article-title: Quadratic-time algorithm for a string constrained lcs problem
  publication-title: Inf. Process. Lett.
– volume: 409
  start-page: 486
  year: 2008
  end-page: 496
  ident: br0040
  article-title: Fast and compact regular expression matching
  publication-title: Theor. Comput. Sci.
– volume: 88
  start-page: 173
  year: 2003
  end-page: 176
  ident: br0070
  article-title: The constrained longest common subsequence problem
  publication-title: Inf. Process. Lett.
– volume: vol. 223
  start-page: 4:1
  year: 2022
  end-page: 4:12
  ident: br0160
  article-title: The fine-grained complexity of episode matching
  publication-title: CPM 2022
– volume: 18
  start-page: 341
  year: 1975
  end-page: 343
  ident: br0060
  article-title: A linear space algorithm for computing maximal common subsequences
  publication-title: Commun. ACM
– volume: vol. 13878
  start-page: 372
  year: 2023
  end-page: 384
  ident: br0130
  article-title: Space-efficient STR-IC-LCS computation
  publication-title: SOFSEM 2023
– volume: 18
  start-page: 171
  year: 1982
  end-page: 179
  ident: br0120
  article-title: A longest common subsequence algorithm suitable for similar text strings
  publication-title: Acta Inform.
– volume: 21
  start-page: 168
  year: 1974
  end-page: 173
  ident: br0020
  article-title: The string-to-string correction problem
  publication-title: J. ACM
– volume: 90
  start-page: 175
  year: 2004
  end-page: 179
  ident: br0080
  article-title: A simple algorithm for the constrained sequence problems
  publication-title: Inf. Process. Lett.
– start-page: 12
  year: 1997
  end-page: 27
  ident: br0150
  article-title: Episode matching
  publication-title: CPM 1997
– volume: 409
  start-page: 486
  issue: 3
  year: 2008
  ident: 10.1016/j.tcs.2024.114607_br0040
  article-title: Fast and compact regular expression matching
  publication-title: Theor. Comput. Sci.
  doi: 10.1016/j.tcs.2008.08.042
– volume: 112
  start-page: 423
  issue: 11
  year: 2012
  ident: 10.1016/j.tcs.2024.114607_br0090
  article-title: Quadratic-time algorithm for a string constrained lcs problem
  publication-title: Inf. Process. Lett.
  doi: 10.1016/j.ipl.2012.02.007
– volume: 20
  start-page: 18
  issue: 1
  year: 1980
  ident: 10.1016/j.tcs.2024.114607_br0030
  article-title: A faster algorithm computing string edit distances
  publication-title: J. Comput. Syst. Sci.
  doi: 10.1016/0022-0000(80)90002-1
– volume: 88
  start-page: 173
  issue: 4
  year: 2003
  ident: 10.1016/j.tcs.2024.114607_br0070
  article-title: The constrained longest common subsequence problem
  publication-title: Inf. Process. Lett.
  doi: 10.1016/j.ipl.2003.07.001
– volume: 21
  start-page: 383
  issue: 3
  year: 2011
  ident: 10.1016/j.tcs.2024.114607_br0010
  article-title: On the generalized constrained longest common subsequence problems
  publication-title: J. Comb. Optim.
  doi: 10.1007/s10878-009-9262-5
– start-page: 20:1
  year: 2017
  ident: 10.1016/j.tcs.2024.114607_br0110
  article-title: Faster STR-IC-LCS computation via RLE
– volume: vol. 13878
  start-page: 372
  year: 2023
  ident: 10.1016/j.tcs.2024.114607_br0130
  article-title: Space-efficient STR-IC-LCS computation
– start-page: 12
  year: 1997
  ident: 10.1016/j.tcs.2024.114607_br0150
  article-title: Episode matching
– volume: 90
  start-page: 175
  issue: 4
  year: 2004
  ident: 10.1016/j.tcs.2024.114607_br0080
  article-title: A simple algorithm for the constrained sequence problems
  publication-title: Inf. Process. Lett.
  doi: 10.1016/j.ipl.2004.02.008
– volume: vol. 223
  start-page: 4:1
  year: 2022
  ident: 10.1016/j.tcs.2024.114607_br0160
  article-title: The fine-grained complexity of episode matching
– volume: 21
  start-page: 168
  issue: 1
  year: 1974
  ident: 10.1016/j.tcs.2024.114607_br0020
  article-title: The string-to-string correction problem
  publication-title: J. ACM
  doi: 10.1145/321796.321811
– volume: 58
  start-page: 1074
  issue: 5
  year: 2014
  ident: 10.1016/j.tcs.2024.114607_br0100
  article-title: Constrained longest common subsequences with run-length-encoded strings
  publication-title: Comput. J.
  doi: 10.1093/comjnl/bxu012
– volume: 18
  start-page: 171
  year: 1982
  ident: 10.1016/j.tcs.2024.114607_br0120
  article-title: A longest common subsequence algorithm suitable for similar text strings
  publication-title: Acta Inform.
  doi: 10.1007/BF00264437
– volume: 18
  start-page: 341
  issue: 6
  year: 1975
  ident: 10.1016/j.tcs.2024.114607_br0060
  article-title: A linear space algorithm for computing maximal common subsequences
  publication-title: Commun. ACM
  doi: 10.1145/360825.360861
– start-page: 59
  year: 2015
  ident: 10.1016/j.tcs.2024.114607_br0050
  article-title: Tight hardness results for LCS and other sequence similarity measures
– volume: vol. 12011
  start-page: 125
  year: 2020
  ident: 10.1016/j.tcs.2024.114607_br0140
  article-title: Faster STR-EC-LCS computation
SSID ssj0000576
Score 2.428175
Snippet One of the most fundamental method for comparing two given strings A and B is the longest common subsequence (LCS), where the task is to find (the length) of...
SourceID crossref
elsevier
SourceType Enrichment Source
Index Database
Publisher
StartPage 114607
SubjectTerms Constrained longest common subsequence
Dynamic programming
String algorithms
Title Faster space-efficient STR-IC-LCS computation
URI https://dx.doi.org/10.1016/j.tcs.2024.114607
Volume 1003
WOSCitedRecordID wos001265648300001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVESC
  databaseName: Elsevier SD Freedom Collection Journals 2021
  customDbUrl:
  eissn: 1879-2294
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0000576
  issn: 0304-3975
  databaseCode: AIEXJ
  dateStart: 20211209
  isFulltext: true
  titleUrlDefault: https://www.sciencedirect.com
  providerName: Elsevier
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3Nb9MwFLfKxgEOfAwQA4Zy4MTkqXWcOj5O1aZtKhOiPZRT5Lw4ou3wpjWZ9ufzHDtO1THEkLhEkRPHkd9Pz-_D_j1CPomE5yCAUaUFUK6AU5nzAu8Kix-Fq0ZDmT8W5-fpbCa_9nrj9izMzYUwJr29lVf_VdTYhsK2R2cfIO7wUWzAexQ6XlHseP0rwR8ry32wj5oCNNUNQ4TN90-m3-jpiI5Hk2Ybeb2Wgl8ExIQzjeCLPez7FbJTDkajcJv46vd6OQ-xZLW0VZl-KvegugyQM9ooF7yd_KjNql52aXxljCuGfTIv9OV6_IHxsFfVB8XuHIxxh7FswkW6oigH2unWVEjKmKtpHJRvv6E4uKvJXVBhcVCBJVVnvCE1dgVyNwiyJ3YwO5bdDovmzvAR2WYikajjtg9Pj2Zn3cqcCJe79j_XZrmb_X4bA_3eTlmzPaYvyDPvNESHTtgvSU-bHfK8LcgRef28Q55-CSS8q1eEOiREG0iIOiREa0h4TabHR9PRCfXVMSgwKSpaiKTMZT7koNHLREvZkueVcSJUgS5sgW6p1ilw4DnLJVi_lxfDkkMKcc5EGb8hWwYx85ZEgwG29GOhy3jI8zxFm2-g-5BAgsocv7VL-u1UZOCZ420Bk4us3SK4yHD2Mjt7mZu9XfI5dLlytCl_epm385t5VDt7LkMw3N_t3b91e0-edDD-QLaq61rvkcdwU81X1x89ZH4B1313Pg
linkProvider Elsevier
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Faster+space-efficient+STR-IC-LCS+computation&rft.jtitle=Theoretical+computer+science&rft.au=Yonemoto%2C+Yuki&rft.au=Nakashima%2C+Yuto&rft.au=Inenaga%2C+Shunsuke&rft.au=Bannai%2C+Hideo&rft.date=2024-07-01&rft.pub=Elsevier+B.V&rft.issn=0304-3975&rft.eissn=1879-2294&rft.volume=1003&rft_id=info:doi/10.1016%2Fj.tcs.2024.114607&rft.externalDocID=S0304397524002226
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0304-3975&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0304-3975&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0304-3975&client=summon