Joint String Complexity for Markov Sources

String complexity is defined as the cardinality of a set of all distinct words (factors) of a given string. For two strings, we define $\textit{joint string complexity}$ as the set of words that are common to both strings. We also relax this definition and introduce $\textit{joint semi-complexity}$...

Full description

Saved in:
Bibliographic Details
Published in:Discrete mathematics and theoretical computer science Vol. DMTCS Proceedings vol. AQ,...; no. Proceedings; pp. 303 - 322
Main Authors: Jacquet, Philippe, Szpankowski, Wojciech
Format: Journal Article Conference Proceeding
Language:English
Published: DMTCS 01.01.2012
Discrete Mathematics and Theoretical Computer Science
Discrete Mathematics & Theoretical Computer Science
Series:DMTCS Proceedings
Subjects:
ISSN:1365-8050, 1462-7264, 1365-8050
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract String complexity is defined as the cardinality of a set of all distinct words (factors) of a given string. For two strings, we define $\textit{joint string complexity}$ as the set of words that are common to both strings. We also relax this definition and introduce $\textit{joint semi-complexity}$ restricted to the common words appearing at least twice in both strings. String complexity finds a number of applications from capturing the richness of a language to finding similarities between two genome sequences. In this paper we analyze joint complexity and joint semi-complexity when both strings are generated by a Markov source. The problem turns out to be quite challenging requiring subtle singularity analysis and saddle point method over infinity many saddle points leading to novel oscillatory phenomena with single and double periodicities.
AbstractList String complexity is defined as the cardinality of a set of all distinct words (factors) of a given string. For two strings, we define $\textit{joint string complexity}$ as the set of words that are common to both strings. We also relax this definition and introduce $\textit{joint semi-complexity}$ restricted to the common words appearing at least twice in both strings. String complexity finds a number of applications from capturing the richness of a language to finding similarities between two genome sequences. In this paper we analyze joint complexity and joint semi-complexity when both strings are generated by a Markov source. The problem turns out to be quite challenging requiring subtle singularity analysis and saddle point method over infinity many saddle points leading to novel oscillatory phenomena with single and double periodicities.
Author Jacquet, Philippe
Szpankowski, Wojciech
Author_xml – sequence: 1
  givenname: Philippe
  surname: Jacquet
  fullname: Jacquet, Philippe
  organization: Alcatel-Lucent Bell Labs France [Nozay]
– sequence: 2
  givenname: Wojciech
  surname: Szpankowski
  fullname: Szpankowski, Wojciech
  organization: Department of Computer Science [Purdue]
BackLink https://inria.hal.science/hal-01197224$$DView record in HAL
BookMark eNptkE9PAjEUxBujiYCe_AJ7VbPYf9vtHglRwWA8oOfmbbfF4rIl3Urk27uAJmo8vZfJzG-S6aPjxjcGoQuCh1zQQt5Uq6jbIcOYHKEeYSJLJc7w8Y__FPXbdtkZaMHzHrp68K6JyTwG1yySsV-ta_Ph4jaxPiSPEN78Jpn796BNe4ZOLNStOf-6A_Ryd_s8nqSzp_vpeDRLNSUZSXlRVdaWHEAIXYK0TIDNhTYGtOSWUm0hI0Iwi6U0zJQ5pqYilhmQxGjLBmh64FYelmod3ArCVnlwai_4sFAQotO1UWXXxJhm2ljNCa8krjQD2sGxFNSajnV5YL1C_Qs1Gc3UTsOEFDmlfEM6Lzl4dfBtG4xV2kWIzjcxgKsVwWo_stqPrHYjd5nrP5nvkv_cn2oIgP8
CitedBy_id crossref_primary_10_1002_bltj_21647
ContentType Journal Article
Conference Proceeding
Copyright Distributed under a Creative Commons Attribution 4.0 International License
Copyright_xml – notice: Distributed under a Creative Commons Attribution 4.0 International License
DBID AAYXX
CITATION
1XC
VOOES
DOA
DOI 10.46298/dmtcs.3001
DatabaseName CrossRef
Hyper Article en Ligne (HAL)
Hyper Article en Ligne (HAL) (Open Access)
DOAJ Directory of Open Access Journals
DatabaseTitle CrossRef
DatabaseTitleList

CrossRef
Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
DeliveryMethod fulltext_linktorsrc
Discipline Mathematics
Computer Science
EISSN 1365-8050
EndPage 322
ExternalDocumentID oai_doaj_org_article_bdff33c3cefc414d80dc3a26630862fe
oai:HAL:hal-01197224v1
10_46298_dmtcs_3001
GroupedDBID -~9
.4S
.DC
29G
2WC
5GY
5VS
8FE
8FG
AAFWJ
AAYXX
ABDBF
ABJCF
ABUWG
ACGFO
ACIWK
ACUHS
ADBBV
ADQAK
AENEX
AFFHD
AFKRA
AFPKN
AIAGR
ALMA_UNASSIGNED_HOLDINGS
AMVHM
ARCSS
B0M
BCNDV
BENPR
BFMQW
BGLVJ
BPHCQ
C1A
CCPQU
CITATION
EAP
EBS
ECS
EDO
EJD
EMK
EPL
EST
ESX
GROUPED_DOAJ
HCIFZ
I-F
IAO
IBB
ICD
ITC
J9A
KQ8
KWQ
L6V
M7S
MK~
ML~
OK1
OVT
P2P
PHGZM
PHGZT
PIMPY
PQGLB
PQQKQ
PROAC
PTHSS
REM
RNS
RSU
TR2
TUS
XSB
~8M
1XC
VOOES
ID FETCH-LOGICAL-c2151-49ddffb4aa66cba8f36af76ceeac84f22cfa51663f088e3eb702ed1f3ea81ecf3
IEDL.DBID DOA
ISSN 1365-8050
1462-7264
IngestDate Fri Oct 03 12:52:14 EDT 2025
Sat Nov 29 15:00:35 EST 2025
Tue Nov 18 21:07:08 EST 2025
Sat Nov 29 02:48:24 EST 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue Proceedings
Keywords double Mellin transform
String complexity
saddle point method
semi-complexity
double depoissonization
Language English
License Distributed under a Creative Commons Attribution 4.0 International License: http://creativecommons.org/licenses/by/4.0
LinkModel DirectLink
MeetingName 23rd International Meeting on Probabilistic, Combinatorial, and Asymptotic Methods in the Analysis of Algorithms (AofA'12)
MergedId FETCHMERGED-LOGICAL-c2151-49ddffb4aa66cba8f36af76ceeac84f22cfa51663f088e3eb702ed1f3ea81ecf3
OpenAccessLink https://doaj.org/article/bdff33c3cefc414d80dc3a26630862fe
PageCount 20
ParticipantIDs doaj_primary_oai_doaj_org_article_bdff33c3cefc414d80dc3a26630862fe
hal_primary_oai_HAL_hal_01197224v1
crossref_citationtrail_10_46298_dmtcs_3001
crossref_primary_10_46298_dmtcs_3001
PublicationCentury 2000
PublicationDate 2012-01-01
PublicationDateYYYYMMDD 2012-01-01
PublicationDate_xml – month: 01
  year: 2012
  text: 2012-01-01
  day: 01
PublicationDecade 2010
PublicationSeriesTitle DMTCS Proceedings
PublicationTitle Discrete mathematics and theoretical computer science
PublicationYear 2012
Publisher DMTCS
Discrete Mathematics and Theoretical Computer Science
Discrete Mathematics & Theoretical Computer Science
Publisher_xml – name: Discrete Mathematics and Theoretical Computer Science
– name: DMTCS
– name: Discrete Mathematics & Theoretical Computer Science
SSID ssj0012947
ssib044734695
Score 1.8250171
Snippet String complexity is defined as the cardinality of a set of all distinct words (factors) of a given string. For two strings, we define $\textit{joint string...
SourceID doaj
hal
crossref
SourceType Open Website
Open Access Repository
Enrichment Source
Index Database
StartPage 303
SubjectTerms [info.info-cg] computer science [cs]/computational geometry [cs.cg]
[info.info-dm] computer science [cs]/discrete mathematics [cs.dm]
[info.info-ds] computer science [cs]/data structures and algorithms [cs.ds]
[math.math-co] mathematics [math]/combinatorics [math.co]
Combinatorics
Computational Geometry
Computer Science
Data Structures and Algorithms
Discrete Mathematics
double depoissonization
double mellin transform
Mathematics
saddle point method
semi-complexity
string complexity
Title Joint String Complexity for Markov Sources
URI https://inria.hal.science/hal-01197224
https://doaj.org/article/bdff33c3cefc414d80dc3a26630862fe
Volume DMTCS Proceedings vol. AQ,...
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVAON
  databaseName: DOAJ Directory of Open Access Journals
  customDbUrl:
  eissn: 1365-8050
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0012947
  issn: 1365-8050
  databaseCode: DOA
  dateStart: 19970101
  isFulltext: true
  titleUrlDefault: https://www.doaj.org/
  providerName: Directory of Open Access Journals
– providerCode: PRVHPJ
  databaseName: ROAD: Directory of Open Access Scholarly Resources
  customDbUrl:
  eissn: 1365-8050
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssib044734695
  issn: 1365-8050
  databaseCode: M~E
  dateStart: 19980101
  isFulltext: true
  titleUrlDefault: https://road.issn.org
  providerName: ISSN International Centre
– providerCode: PRVPQU
  databaseName: Continental Europe Database
  customDbUrl:
  eissn: 1365-8050
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0012947
  issn: 1365-8050
  databaseCode: BFMQW
  dateStart: 19970101
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/conteurope
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: Engineering Database (subscription)
  customDbUrl:
  eissn: 1365-8050
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0012947
  issn: 1365-8050
  databaseCode: M7S
  dateStart: 19970101
  isFulltext: true
  titleUrlDefault: http://search.proquest.com
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: ProQuest Central
  customDbUrl:
  eissn: 1365-8050
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0012947
  issn: 1365-8050
  databaseCode: BENPR
  dateStart: 19970101
  isFulltext: true
  titleUrlDefault: https://www.proquest.com/central
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: Publicly Available Content Database (subscription)
  customDbUrl:
  eissn: 1365-8050
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0012947
  issn: 1365-8050
  databaseCode: PIMPY
  dateStart: 19970101
  isFulltext: true
  titleUrlDefault: http://search.proquest.com/publiccontent
  providerName: ProQuest
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1LS8NAEB6ketCDj6pYHyVIT4XYZne72RyrtFSxpVCFegr7xIK20sSCF3-7u3kUBcGLlwSWIcl-k-zMzma_D6DBg0jZ-U3oU0aFTxTWPtMs8k1HRSKknBEkMrGJcDRi02k0_ib15f4Jy-mBc-BaQhmDscRSG0kColhbScxtWMEuGTfajb7tMConU8X6AYpImO_GIxRFrKVeU5lc4Xah_VLGn4ym30aV57KKmkWV_j7sFumg180f4wA29LwKe6XUgld8eVXYGa7pVZNDaN4tZvPUm6SuKOc5a0drmX54NgP13O6bxcqbZFX55Age-72Hm4FfiB740kVfn0TK9lgQzimVgjODKTchtbGMS0YMQtLwTmC7b-z4oLEWYRtpFRisOQu0NPgYKvPFXJ-AR4Qw1lKF1GZFpMOEPQuKhNCOl60T1KBZQhHLghHcCVO8xHZmkOEWZ7jFDrcaNNbGbzkRxu9m1w7TtYljr84arE_jwqfxXz6twaX1yI9rDLr3sWvLddIQWQWn_3GnM9i2GRDKayrnUEmX7_oCtuQqnSXLevZO2ePws1eHzfHtcPz0BQPF1k4
linkProvider Directory of Open Access Journals
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=proceeding&rft.title=Discrete+mathematics+and+theoretical+computer+science&rft.atitle=Joint+String+Complexity+for+Markov+Sources&rft.au=Jacquet%2C+Philippe&rft.au=Szpankowski%2C+Wojciech&rft.series=DMTCS+Proceedings&rft.date=2012-01-01&rft.pub=DMTCS&rft.issn=1462-7264&rft.eissn=1365-8050&rft.volume=DMTCS+Proceedings+vol.+AQ%2C+23rd+Intern.+Meeting+on+Probabilistic%2C+Combinatorial%2C+and+Asymptotic+Methods+for+the+Analysis+of+Algorithms+%28AofA%2712%29&rft.spage=303&rft.epage=322&rft_id=info:doi/10.46298%2Fdmtcs.3001&rft.externalDBID=HAS_PDF_LINK&rft.externalDocID=oai%3AHAL%3Ahal-01197224v1
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1365-8050&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1365-8050&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1365-8050&client=summon