Suchergebnisse - "Electrical Engineering and Systems Science - Audio and Speech Processing"

  1. 1

    Generalized Domain Adaptation Framework for Parametric Back-End in Speaker Recognition von Qiongqiong Wang, Koji Okabe, Kong Aik Lee, Takafumi Koshinaka

    ISSN: 1556-6013, 1556-6021
    Veröffentlicht: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2023
    Volltext
    Journal Article
  2. 2

    Versatile Time-Frequency Representations Realized by Convex Penalty on Magnitude Spectrogram von Keidai Arai, Koki Yamada, Kohei Yatabe

    ISSN: 1070-9908, 1558-2361
    Veröffentlicht: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2023
    Veröffentlicht in IEEE Signal Processing Letters (01.01.2023)
    Volltext
    Journal Article
  3. 3

    Investigation of Japanese PnG BERT Language Model in Text-to-Speech Synthesis for Pitch Accent Language von Yusuke Yasuda, Tomoki Toda

    ISSN: 1932-4553, 1941-0484
    Veröffentlicht: Institute of Electrical and Electronics Engineers (IEEE) 01.10.2022
    Volltext
    Journal Article
  4. 4

    Expression-Preserving Face Frontalization Improves Visually Assisted Speech Processing von Zhiqi Kang, Mostafa Sadeghi, Radu Horaud, Xavier Alameda-Pineda

    ISSN: 0920-5691, 1573-1405
    Veröffentlicht: Springer Science and Business Media LLC 12.01.2023
    Veröffentlicht in International Journal of Computer Vision (12.01.2023)
    Volltext
    Journal Article
  5. 5
  6. 6

    Bass Accompaniment Generation Via Latent Diffusion von Pasini, Marco, Grachten, Maarten, Lattner, Stefan

    ISSN: 2379-190X
    Veröffentlicht: IEEE 14.04.2024
    “… The ability to automatically generate music that appropriately matches an arbitrary input track is a challenging task. We present a novel controllable system …”
    Volltext
    Tagungsbericht
  7. 7
  8. 8
  9. 9

    RISC: A Corpus for Shout Type Classification and Shout Intensity Prediction von Takahiro Fukumori, Taito Ishida, Yoichi Yamashita

    ISSN: 2329-9290, 2329-9304
    Veröffentlicht: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2024
    Volltext
    Journal Article
  10. 10

    PeriodGrad: Towards Pitch-Controllable Neural Vocoder Based on a Diffusion Probabilistic Model von Hono, Yukiya, Hashimoto, Kei, Nankaku, Yoshihiko, Tokuda, Keiichi

    ISSN: 2379-190X
    Veröffentlicht: IEEE 14.04.2024
    “… This paper presents a neural vocoder based on a denoising diffusion probabilistic model (DDPM) incorporating explicit periodic signals as auxiliary …”
    Volltext
    Tagungsbericht
  11. 11

    MOS-FAD: Improving Fake Audio Detection Via Automatic Mean Opinion Score Prediction von Zhou, Wangjin, Yang, Zhengdong, Chu, Chenhui, Li, Sheng, Dabre, Raj, Zhao, Yi, Tatsuya, Kawahara

    ISSN: 2379-190X
    Veröffentlicht: IEEE 14.04.2024
    “… IEEE Automatic Mean Opinion Score (MOS) prediction is employed to evaluate the quality of synthetic speech. This study extends the application of predicted MOS …”
    Volltext
    Tagungsbericht
  12. 12
  13. 13

    Data Driven Grapheme-to-Phoneme Representations for a Lexicon-Free Text-to-Speech von Garg, Abhinav, Kim, Jiyeon, Khyalia, Sushil, Kim, Chanwoo, Gowda, Dhananjaya

    ISSN: 2379-190X
    Veröffentlicht: IEEE 14.04.2024
    “… Grapheme-to-Phoneme (G2P) is an essential first step in any modern, high-quality Text-to-Speech (TTS) system. Most of the current G2P systems rely on carefully …”
    Volltext
    Tagungsbericht
  14. 14
  15. 15

    ELP-Adapters: Parameter Efficient Adapter Tuning for Various Speech Processing Tasks von Nakamasa Inoue, Shinta Otake, Takumi Hirose, Masanari Ohi, Rei Kawakami

    ISSN: 2329-9290, 2329-9304
    Veröffentlicht: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2024
    Volltext
    Journal Article
  16. 16
  17. 17

    Localizing Acoustic Energy in Sound Field Synthesis by Directionally Weighted Exterior Radiation Suppression von Tomita, Yoshihide, Koyama, Shoichi, Saruwatari, Hiroshi

    ISSN: 2379-190X
    Veröffentlicht: IEEE 14.04.2024
    “… A method for synthesizing the desired sound field while suppressing the exterior radiation power with directional weighting is proposed. The exterior radiation …”
    Volltext
    Tagungsbericht
  18. 18

    Optimizing multi-user indoor sound communications with acoustic reconfigurable metasurfaces von Hongkuan Zhang, Qiyuan Wang, Mathias Fink, Guancong Ma

    ISSN: 2041-1723
    Veröffentlicht: Springer Science and Business Media LLC 10.02.2024
    Veröffentlicht in Nature Communications (10.02.2024)
    Volltext
    Journal Article
  19. 19
  20. 20

    Decoupling Speaker-Independent Emotions for Voice Conversion via Source-Filter Networks von Zhaojie Luo, Shoufeng Lin, Rui Liu, Jun Baba, Yuichiro Yoshikawa, Hiroshi Ishiguro

    ISSN: 2329-9290, 2329-9304
    Veröffentlicht: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2023
    Volltext
    Journal Article