Wang, X., Takaki, S., Yamagishi, J., King, S., & Tokuda, K. (2020). A Vector Quantized Variational Autoencoder (VQ-VAE) Autoregressive Neural $F_0$ Model for Statistical Parametric Speech Synthesis. IEEE/ACM transactions on audio, speech, and language processing, 28, 157-170. https://doi.org/10.1109/TASLP.2019.2950099
Chicago Style (17th ed.) CitationWang, Xin, Shinji Takaki, Junichi Yamagishi, Simon King, and Keiichi Tokuda. "A Vector Quantized Variational Autoencoder (VQ-VAE) Autoregressive Neural $F_0$ Model for Statistical Parametric Speech Synthesis." IEEE/ACM Transactions on Audio, Speech, and Language Processing 28 (2020): 157-170. https://doi.org/10.1109/TASLP.2019.2950099.
MLA (9th ed.) CitationWang, Xin, et al. "A Vector Quantized Variational Autoencoder (VQ-VAE) Autoregressive Neural $F_0$ Model for Statistical Parametric Speech Synthesis." IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 28, 2020, pp. 157-170, https://doi.org/10.1109/TASLP.2019.2950099.