Search Results - Electrical Engineering and Systems Science - Audio and Speech Processing
-
1
Procurement Of Dsp Enabled Evaluation Kit For Speech And Audio Signal Processing At The Department Of Electronics Electrical Communication Engineering Department, Iit
ISSN: 2219-0112Published: Camden Disco Digital Media, Inc 21.06.2025Published in MENA Report (21.06.2025)Get full text
Newsletter -
2
PeriodGrad: Towards Pitch-Controllable Neural Vocoder Based on a Diffusion Probabilistic Model
ISSN: 2379-190XPublished: IEEE 14.04.2024Published in Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) (14.04.2024)“… In practical applications, such as singing voice synthesis, there is a demand for neural vocoders to generate high-fidelity speech waveforms with flexible pitch control…”
Get full text
Conference Proceeding -
3
MOS-FAD: Improving Fake Audio Detection Via Automatic Mean Opinion Score Prediction
ISSN: 2379-190XPublished: IEEE 14.04.2024Published in Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) (14.04.2024)“… This study extends the application of predicted MOS to the task of Fake Audio Detection (FAD) as we expect that MOS can be used to assess how close synthesized speech is to the natural human voice…”
Get full text
Conference Proceeding -
4
Bass Accompaniment Generation Via Latent Diffusion
ISSN: 2379-190XPublished: IEEE 14.04.2024Published in Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) (14.04.2024)“… We present a novel controllable system for generating single stems to accompany musical mixes of arbitrary length…”
Get full text
Conference Proceeding -
5
Data Driven Grapheme-to-Phoneme Representations for a Lexicon-Free Text-to-Speech
ISSN: 2379-190XPublished: IEEE 14.04.2024Published in Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) (14.04.2024)“…Grapheme-to-Phoneme (G2P) is an essential first step in any modern, high-quality Text-to-Speech (TTS) system…”
Get full text
Conference Proceeding -
6
Localizing Acoustic Energy in Sound Field Synthesis by Directionally Weighted Exterior Radiation Suppression
ISSN: 2379-190XPublished: IEEE 14.04.2024Published in Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) (14.04.2024)“… The exterior radiation from the loudspeakers in sound field synthesis systems can be problematic in practical situations…”
Get full text
Conference Proceeding -
7
Restoring speech intelligibility for hearing aid users with deep learning
ISSN: 2045-2322Published: Springer Science and Business Media LLC 15.02.2023Published in Scientific Reports (15.02.2023)Get full text
Journal Article -
8
Versatile Time-Frequency Representations Realized by Convex Penalty on Magnitude Spectrogram
ISSN: 1070-9908, 1558-2361Published: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2023Published in IEEE Signal Processing Letters (01.01.2023)Get full text
Journal Article -
9
Expression-Preserving Face Frontalization Improves Visually Assisted Speech Processing
ISSN: 0920-5691, 1573-1405Published: Springer Science and Business Media LLC 12.01.2023Published in International Journal of Computer Vision (12.01.2023)Get full text
Journal Article -
10
Generalized Domain Adaptation Framework for Parametric Back-End in Speaker Recognition
ISSN: 1556-6013, 1556-6021Published: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2023Published in IEEE Transactions on Information Forensics and Security (01.01.2023)Get full text
Journal Article -
11
Investigation of Japanese PnG BERT Language Model in Text-to-Speech Synthesis for Pitch Accent Language
ISSN: 1932-4553, 1941-0484Published: Institute of Electrical and Electronics Engineers (IEEE) 01.10.2022Published in IEEE Journal of Selected Topics in Signal Processing (01.10.2022)Get full text
Journal Article -
12
Learning and controlling the source-filter representation of speech with a variational autoencoder
ISSN: 0167-6393Published: Elsevier BV 01.03.2023Published in Speech Communication (01.03.2023)Get full text
Journal Article -
13
CASA-based speaker identification using cascaded GMM-CNN classifier in noisy and emotional talking conditions
ISSN: 1568-4946Published: Elsevier BV 01.05.2021Published in Applied Soft Computing (01.05.2021)Get full text
Journal Article -
14
GEDI: Gammachirp envelope distortion index for predicting intelligibility of enhanced speech
ISSN: 0167-6393Published: Elsevier BV 01.10.2020Published in Speech Communication (01.10.2020)Get full text
Journal Article -
15
A Large-Scale Evaluation of Speech Foundation Models
ISSN: 2329-9290, 2329-9304Published: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2024Published in IEEE/ACM Transactions on Audio, Speech, and Language Processing (01.01.2024)Get full text
Journal Article -
16
ELP-Adapters: Parameter Efficient Adapter Tuning for Various Speech Processing Tasks
ISSN: 2329-9290, 2329-9304Published: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2024Published in IEEE/ACM Transactions on Audio, Speech, and Language Processing (01.01.2024)Get full text
Journal Article -
17
RISC: A Corpus for Shout Type Classification and Shout Intensity Prediction
ISSN: 2329-9290, 2329-9304Published: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2024Published in IEEE/ACM Transactions on Audio, Speech, and Language Processing (01.01.2024)Get full text
Journal Article -
18
The VoicePrivacy 2022 Challenge: Progress and Perspectives in Voice Anonymisation
ISSN: 2329-9290, 2329-9304Published: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2024Published in IEEE/ACM Transactions on Audio, Speech, and Language Processing (01.01.2024)Get full text
Journal Article -
19
FastMVAE2: On Improving and Accelerating the Fast Variational Autoencoder-Based Source Separation Algorithm for Determined Mixtures
ISSN: 2329-9290, 2329-9304Published: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2023Published in IEEE/ACM Transactions on Audio, Speech, and Language Processing (01.01.2023)Get full text
Journal Article -
20
Decoupling Speaker-Independent Emotions for Voice Conversion via Source-Filter Networks
ISSN: 2329-9290, 2329-9304Published: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2023Published in IEEE/ACM Transactions on Audio, Speech, and Language Processing (01.01.2023)Get full text
Journal Article