Výsledky vyhledávání - Electrical Engineering and Systems Science - Audio and Speech Processing
-
1
Procurement Of Dsp Enabled Evaluation Kit For Speech And Audio Signal Processing At The Department Of Electronics Electrical Communication Engineering Department, Iit
ISSN: 2219-0112Vydáno: Camden Disco Digital Media, Inc 21.06.2025Vydáno v MENA Report (21.06.2025)Získat plný text
Newsletter -
2
Versatile Time-Frequency Representations Realized by Convex Penalty on Magnitude Spectrogram
ISSN: 1070-9908, 1558-2361Vydáno: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2023Vydáno v IEEE Signal Processing Letters (01.01.2023)Získat plný text
Journal Article -
3
Expression-Preserving Face Frontalization Improves Visually Assisted Speech Processing
ISSN: 0920-5691, 1573-1405Vydáno: Springer Science and Business Media LLC 12.01.2023Vydáno v International Journal of Computer Vision (12.01.2023)Získat plný text
Journal Article -
4
Generalized Domain Adaptation Framework for Parametric Back-End in Speaker Recognition
ISSN: 1556-6013, 1556-6021Vydáno: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2023Vydáno v IEEE Transactions on Information Forensics and Security (01.01.2023)Získat plný text
Journal Article -
5
Investigation of Japanese PnG BERT Language Model in Text-to-Speech Synthesis for Pitch Accent Language
ISSN: 1932-4553, 1941-0484Vydáno: Institute of Electrical and Electronics Engineers (IEEE) 01.10.2022Vydáno v IEEE Journal of Selected Topics in Signal Processing (01.10.2022)Získat plný text
Journal Article -
6
Learning and controlling the source-filter representation of speech with a variational autoencoder
ISSN: 0167-6393Vydáno: Elsevier BV 01.03.2023Vydáno v Speech Communication (01.03.2023)Získat plný text
Journal Article -
7
CASA-based speaker identification using cascaded GMM-CNN classifier in noisy and emotional talking conditions
ISSN: 1568-4946Vydáno: Elsevier BV 01.05.2021Vydáno v Applied Soft Computing (01.05.2021)Získat plný text
Journal Article -
8
GEDI: Gammachirp envelope distortion index for predicting intelligibility of enhanced speech
ISSN: 0167-6393Vydáno: Elsevier BV 01.10.2020Vydáno v Speech Communication (01.10.2020)Získat plný text
Journal Article -
9
PeriodGrad: Towards Pitch-Controllable Neural Vocoder Based on a Diffusion Probabilistic Model
ISSN: 2379-190XVydáno: IEEE 14.04.2024Vydáno v Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) (14.04.2024)“… In practical applications, such as singing voice synthesis, there is a demand for neural vocoders to generate high-fidelity speech waveforms with flexible pitch control…”
Získat plný text
Konferenční příspěvek -
10
A Large-Scale Evaluation of Speech Foundation Models
ISSN: 2329-9290, 2329-9304Vydáno: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2024Vydáno v IEEE/ACM Transactions on Audio, Speech, and Language Processing (01.01.2024)Získat plný text
Journal Article -
11
MOS-FAD: Improving Fake Audio Detection Via Automatic Mean Opinion Score Prediction
ISSN: 2379-190XVydáno: IEEE 14.04.2024Vydáno v Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) (14.04.2024)“… This study extends the application of predicted MOS to the task of Fake Audio Detection (FAD) as we expect that MOS can be used to assess how close synthesized speech is to the natural human voice…”
Získat plný text
Konferenční příspěvek -
12
ELP-Adapters: Parameter Efficient Adapter Tuning for Various Speech Processing Tasks
ISSN: 2329-9290, 2329-9304Vydáno: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2024Vydáno v IEEE/ACM Transactions on Audio, Speech, and Language Processing (01.01.2024)Získat plný text
Journal Article -
13
Bass Accompaniment Generation Via Latent Diffusion
ISSN: 2379-190XVydáno: IEEE 14.04.2024Vydáno v Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) (14.04.2024)“… We present a novel controllable system for generating single stems to accompany musical mixes of arbitrary length…”
Získat plný text
Konferenční příspěvek -
14
RISC: A Corpus for Shout Type Classification and Shout Intensity Prediction
ISSN: 2329-9290, 2329-9304Vydáno: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2024Vydáno v IEEE/ACM Transactions on Audio, Speech, and Language Processing (01.01.2024)Získat plný text
Journal Article -
15
Data Driven Grapheme-to-Phoneme Representations for a Lexicon-Free Text-to-Speech
ISSN: 2379-190XVydáno: IEEE 14.04.2024Vydáno v Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) (14.04.2024)“…Grapheme-to-Phoneme (G2P) is an essential first step in any modern, high-quality Text-to-Speech (TTS) system…”
Získat plný text
Konferenční příspěvek -
16
The VoicePrivacy 2022 Challenge: Progress and Perspectives in Voice Anonymisation
ISSN: 2329-9290, 2329-9304Vydáno: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2024Vydáno v IEEE/ACM Transactions on Audio, Speech, and Language Processing (01.01.2024)Získat plný text
Journal Article -
17
FastMVAE2: On Improving and Accelerating the Fast Variational Autoencoder-Based Source Separation Algorithm for Determined Mixtures
ISSN: 2329-9290, 2329-9304Vydáno: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2023Vydáno v IEEE/ACM Transactions on Audio, Speech, and Language Processing (01.01.2023)Získat plný text
Journal Article -
18
Decoupling Speaker-Independent Emotions for Voice Conversion via Source-Filter Networks
ISSN: 2329-9290, 2329-9304Vydáno: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2023Vydáno v IEEE/ACM Transactions on Audio, Speech, and Language Processing (01.01.2023)Získat plný text
Journal Article -
19
Localizing Acoustic Energy in Sound Field Synthesis by Directionally Weighted Exterior Radiation Suppression
ISSN: 2379-190XVydáno: IEEE 14.04.2024Vydáno v Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) (14.04.2024)“… The exterior radiation from the loudspeakers in sound field synthesis systems can be problematic in practical situations…”
Získat plný text
Konferenční příspěvek -
20
Cross-Corpus Multilingual Speech Emotion Recognition: Amharic vs. Other Languages
ISSN: 2076-3417Vydáno: MDPI AG 22.11.2023Vydáno v Applied Sciences (22.11.2023)Získat plný text
Journal Article