Výsledky vyhledávání - Electrical Engineering and Systems Science - Audio and Speech Processing

Upřesnit hledání
  1. 1
  2. 2

    Versatile Time-Frequency Representations Realized by Convex Penalty on Magnitude Spectrogram Autor Keidai Arai, Koki Yamada, Kohei Yatabe

    ISSN: 1070-9908, 1558-2361
    Vydáno: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2023
    Vydáno v IEEE Signal Processing Letters (01.01.2023)
    Získat plný text
    Journal Article
  3. 3
  4. 4
  5. 5

    Investigation of Japanese PnG BERT Language Model in Text-to-Speech Synthesis for Pitch Accent Language Autor Yusuke Yasuda, Tomoki Toda

    ISSN: 1932-4553, 1941-0484
    Vydáno: Institute of Electrical and Electronics Engineers (IEEE) 01.10.2022
    Získat plný text
    Journal Article
  6. 6
  7. 7
  8. 8
  9. 9

    PeriodGrad: Towards Pitch-Controllable Neural Vocoder Based on a Diffusion Probabilistic Model Autor Hono, Yukiya, Hashimoto, Kei, Nankaku, Yoshihiko, Tokuda, Keiichi

    ISSN: 2379-190X
    Vydáno: IEEE 14.04.2024
    “… In practical applications, such as singing voice synthesis, there is a demand for neural vocoders to generate high-fidelity speech waveforms with flexible pitch control…”
    Získat plný text
    Konferenční příspěvek
  10. 10
  11. 11

    MOS-FAD: Improving Fake Audio Detection Via Automatic Mean Opinion Score Prediction Autor Zhou, Wangjin, Yang, Zhengdong, Chu, Chenhui, Li, Sheng, Dabre, Raj, Zhao, Yi, Tatsuya, Kawahara

    ISSN: 2379-190X
    Vydáno: IEEE 14.04.2024
    “… This study extends the application of predicted MOS to the task of Fake Audio Detection (FAD) as we expect that MOS can be used to assess how close synthesized speech is to the natural human voice…”
    Získat plný text
    Konferenční příspěvek
  12. 12
  13. 13

    Bass Accompaniment Generation Via Latent Diffusion Autor Pasini, Marco, Grachten, Maarten, Lattner, Stefan

    ISSN: 2379-190X
    Vydáno: IEEE 14.04.2024
    “… We present a novel controllable system for generating single stems to accompany musical mixes of arbitrary length…”
    Získat plný text
    Konferenční příspěvek
  14. 14
  15. 15

    Data Driven Grapheme-to-Phoneme Representations for a Lexicon-Free Text-to-Speech Autor Garg, Abhinav, Kim, Jiyeon, Khyalia, Sushil, Kim, Chanwoo, Gowda, Dhananjaya

    ISSN: 2379-190X
    Vydáno: IEEE 14.04.2024
    “…Grapheme-to-Phoneme (G2P) is an essential first step in any modern, high-quality Text-to-Speech (TTS) system…”
    Získat plný text
    Konferenční příspěvek
  16. 16
  17. 17
  18. 18
  19. 19

    Localizing Acoustic Energy in Sound Field Synthesis by Directionally Weighted Exterior Radiation Suppression Autor Tomita, Yoshihide, Koyama, Shoichi, Saruwatari, Hiroshi

    ISSN: 2379-190X
    Vydáno: IEEE 14.04.2024
    “… The exterior radiation from the loudspeakers in sound field synthesis systems can be problematic in practical situations…”
    Získat plný text
    Konferenční příspěvek
  20. 20