Suchergebnisse - Electrical Engineering and Systems Science - Audio and Speech Processing

  1. 1
  2. 2

    Versatile Time-Frequency Representations Realized by Convex Penalty on Magnitude Spectrogram von Keidai Arai, Koki Yamada, Kohei Yatabe

    ISSN: 1070-9908, 1558-2361
    Veröffentlicht: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2023
    Veröffentlicht in IEEE Signal Processing Letters (01.01.2023)
    Volltext
    Journal Article
  3. 3

    Expression-Preserving Face Frontalization Improves Visually Assisted Speech Processing von Zhiqi Kang, Mostafa Sadeghi, Radu Horaud, Xavier Alameda-Pineda

    ISSN: 0920-5691, 1573-1405
    Veröffentlicht: Springer Science and Business Media LLC 12.01.2023
    Veröffentlicht in International Journal of Computer Vision (12.01.2023)
    Volltext
    Journal Article
  4. 4

    Generalized Domain Adaptation Framework for Parametric Back-End in Speaker Recognition von Qiongqiong Wang, Koji Okabe, Kong Aik Lee, Takafumi Koshinaka

    ISSN: 1556-6013, 1556-6021
    Veröffentlicht: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2023
    Volltext
    Journal Article
  5. 5

    Investigation of Japanese PnG BERT Language Model in Text-to-Speech Synthesis for Pitch Accent Language von Yusuke Yasuda, Tomoki Toda

    ISSN: 1932-4553, 1941-0484
    Veröffentlicht: Institute of Electrical and Electronics Engineers (IEEE) 01.10.2022
    Volltext
    Journal Article
  6. 6
  7. 7
  8. 8
  9. 9

    PeriodGrad: Towards Pitch-Controllable Neural Vocoder Based on a Diffusion Probabilistic Model von Hono, Yukiya, Hashimoto, Kei, Nankaku, Yoshihiko, Tokuda, Keiichi

    ISSN: 2379-190X
    Veröffentlicht: IEEE 14.04.2024
    “… In practical applications, such as singing voice synthesis, there is a demand for neural vocoders to generate high-fidelity speech waveforms with flexible pitch control …”
    Volltext
    Tagungsbericht
  10. 10
  11. 11

    MOS-FAD: Improving Fake Audio Detection Via Automatic Mean Opinion Score Prediction von Zhou, Wangjin, Yang, Zhengdong, Chu, Chenhui, Li, Sheng, Dabre, Raj, Zhao, Yi, Tatsuya, Kawahara

    ISSN: 2379-190X
    Veröffentlicht: IEEE 14.04.2024
    “… This study extends the application of predicted MOS to the task of Fake Audio Detection (FAD) as we expect that MOS can be used to assess how close synthesized speech is to the natural human voice …”
    Volltext
    Tagungsbericht
  12. 12

    ELP-Adapters: Parameter Efficient Adapter Tuning for Various Speech Processing Tasks von Nakamasa Inoue, Shinta Otake, Takumi Hirose, Masanari Ohi, Rei Kawakami

    ISSN: 2329-9290, 2329-9304
    Veröffentlicht: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2024
    Volltext
    Journal Article
  13. 13

    Bass Accompaniment Generation Via Latent Diffusion von Pasini, Marco, Grachten, Maarten, Lattner, Stefan

    ISSN: 2379-190X
    Veröffentlicht: IEEE 14.04.2024
    “… We present a novel controllable system for generating single stems to accompany musical mixes of arbitrary length …”
    Volltext
    Tagungsbericht
  14. 14

    RISC: A Corpus for Shout Type Classification and Shout Intensity Prediction von Takahiro Fukumori, Taito Ishida, Yoichi Yamashita

    ISSN: 2329-9290, 2329-9304
    Veröffentlicht: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2024
    Volltext
    Journal Article
  15. 15

    Data Driven Grapheme-to-Phoneme Representations for a Lexicon-Free Text-to-Speech von Garg, Abhinav, Kim, Jiyeon, Khyalia, Sushil, Kim, Chanwoo, Gowda, Dhananjaya

    ISSN: 2379-190X
    Veröffentlicht: IEEE 14.04.2024
    “… Grapheme-to-Phoneme (G2P) is an essential first step in any modern, high-quality Text-to-Speech (TTS) system …”
    Volltext
    Tagungsbericht
  16. 16
  17. 17
  18. 18

    Decoupling Speaker-Independent Emotions for Voice Conversion via Source-Filter Networks von Zhaojie Luo, Shoufeng Lin, Rui Liu, Jun Baba, Yuichiro Yoshikawa, Hiroshi Ishiguro

    ISSN: 2329-9290, 2329-9304
    Veröffentlicht: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2023
    Volltext
    Journal Article
  19. 19

    Localizing Acoustic Energy in Sound Field Synthesis by Directionally Weighted Exterior Radiation Suppression von Tomita, Yoshihide, Koyama, Shoichi, Saruwatari, Hiroshi

    ISSN: 2379-190X
    Veröffentlicht: IEEE 14.04.2024
    “… The exterior radiation from the loudspeakers in sound field synthesis systems can be problematic in practical situations …”
    Volltext
    Tagungsbericht
  20. 20