Wang, P., Li, J., Ma, M., & Fan, X. (2022, May 23). Distributed Audio-Visual Parsing Based On Multimodal Transformer and Deep Joint Source Channel Coding. Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998), 4623-4627. https://doi.org/10.1109/ICASSP43922.2022.9746660
Citace podle Chicago (17th ed.)Wang, Penghong, Jiahui Li, Mengyao Ma, a Xiaopeng Fan. "Distributed Audio-Visual Parsing Based On Multimodal Transformer and Deep Joint Source Channel Coding." Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) 23 May. 2022: 4623-4627. https://doi.org/10.1109/ICASSP43922.2022.9746660.
Citace podle MLA (9th ed.)Wang, Penghong, et al. "Distributed Audio-Visual Parsing Based On Multimodal Transformer and Deep Joint Source Channel Coding." Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998), 23 May. 2022, pp. 4623-4627, https://doi.org/10.1109/ICASSP43922.2022.9746660.