Suchergebnisse - "Electrical Engineering and Systems Science - Audio and Speech Processing"
-
1
Generalized Domain Adaptation Framework for Parametric Back-End in Speaker Recognition
ISSN: 1556-6013, 1556-6021Veröffentlicht: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2023Veröffentlicht in IEEE Transactions on Information Forensics and Security (01.01.2023)Volltext
Journal Article -
2
Versatile Time-Frequency Representations Realized by Convex Penalty on Magnitude Spectrogram
ISSN: 1070-9908, 1558-2361Veröffentlicht: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2023Veröffentlicht in IEEE Signal Processing Letters (01.01.2023)Volltext
Journal Article -
3
Investigation of Japanese PnG BERT Language Model in Text-to-Speech Synthesis for Pitch Accent Language
ISSN: 1932-4553, 1941-0484Veröffentlicht: Institute of Electrical and Electronics Engineers (IEEE) 01.10.2022Veröffentlicht in IEEE Journal of Selected Topics in Signal Processing (01.10.2022)Volltext
Journal Article -
4
Expression-Preserving Face Frontalization Improves Visually Assisted Speech Processing
ISSN: 0920-5691, 1573-1405Veröffentlicht: Springer Science and Business Media LLC 12.01.2023Veröffentlicht in International Journal of Computer Vision (12.01.2023)Volltext
Journal Article -
5
CASA-based speaker identification using cascaded GMM-CNN classifier in noisy and emotional talking conditions
ISSN: 1568-4946Veröffentlicht: Elsevier BV 01.05.2021Veröffentlicht in Applied Soft Computing (01.05.2021)Volltext
Journal Article -
6
Bass Accompaniment Generation Via Latent Diffusion
ISSN: 2379-190XVeröffentlicht: IEEE 14.04.2024Veröffentlicht in Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) (14.04.2024)“… The ability to automatically generate music that appropriately matches an arbitrary input track is a challenging task. We present a novel controllable system …”
Volltext
Tagungsbericht -
7
GEDI: Gammachirp envelope distortion index for predicting intelligibility of enhanced speech
ISSN: 0167-6393Veröffentlicht: Elsevier BV 01.10.2020Veröffentlicht in Speech Communication (01.10.2020)Volltext
Journal Article -
8
Learning and controlling the source-filter representation of speech with a variational autoencoder
ISSN: 0167-6393Veröffentlicht: Elsevier BV 01.03.2023Veröffentlicht in Speech Communication (01.03.2023)Volltext
Journal Article -
9
RISC: A Corpus for Shout Type Classification and Shout Intensity Prediction
ISSN: 2329-9290, 2329-9304Veröffentlicht: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2024Veröffentlicht in IEEE/ACM Transactions on Audio, Speech, and Language Processing (01.01.2024)Volltext
Journal Article -
10
PeriodGrad: Towards Pitch-Controllable Neural Vocoder Based on a Diffusion Probabilistic Model
ISSN: 2379-190XVeröffentlicht: IEEE 14.04.2024Veröffentlicht in Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) (14.04.2024)“… This paper presents a neural vocoder based on a denoising diffusion probabilistic model (DDPM) incorporating explicit periodic signals as auxiliary …”
Volltext
Tagungsbericht -
11
MOS-FAD: Improving Fake Audio Detection Via Automatic Mean Opinion Score Prediction
ISSN: 2379-190XVeröffentlicht: IEEE 14.04.2024Veröffentlicht in Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) (14.04.2024)“… IEEE Automatic Mean Opinion Score (MOS) prediction is employed to evaluate the quality of synthetic speech. This study extends the application of predicted MOS …”
Volltext
Tagungsbericht -
12
FastMVAE2: On Improving and Accelerating the Fast Variational Autoencoder-Based Source Separation Algorithm for Determined Mixtures
ISSN: 2329-9290, 2329-9304Veröffentlicht: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2023Veröffentlicht in IEEE/ACM Transactions on Audio, Speech, and Language Processing (01.01.2023)Volltext
Journal Article -
13
Data Driven Grapheme-to-Phoneme Representations for a Lexicon-Free Text-to-Speech
ISSN: 2379-190XVeröffentlicht: IEEE 14.04.2024Veröffentlicht in Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) (14.04.2024)“… Grapheme-to-Phoneme (G2P) is an essential first step in any modern, high-quality Text-to-Speech (TTS) system. Most of the current G2P systems rely on carefully …”
Volltext
Tagungsbericht -
14
The VoicePrivacy 2022 Challenge: Progress and Perspectives in Voice Anonymisation
ISSN: 2329-9290, 2329-9304Veröffentlicht: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2024Veröffentlicht in IEEE/ACM Transactions on Audio, Speech, and Language Processing (01.01.2024)Volltext
Journal Article -
15
ELP-Adapters: Parameter Efficient Adapter Tuning for Various Speech Processing Tasks
ISSN: 2329-9290, 2329-9304Veröffentlicht: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2024Veröffentlicht in IEEE/ACM Transactions on Audio, Speech, and Language Processing (01.01.2024)Volltext
Journal Article -
16
A Large-Scale Evaluation of Speech Foundation Models
ISSN: 2329-9290, 2329-9304Veröffentlicht: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2024Veröffentlicht in IEEE/ACM Transactions on Audio, Speech, and Language Processing (01.01.2024)Volltext
Journal Article -
17
Localizing Acoustic Energy in Sound Field Synthesis by Directionally Weighted Exterior Radiation Suppression
ISSN: 2379-190XVeröffentlicht: IEEE 14.04.2024Veröffentlicht in Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) (14.04.2024)“… A method for synthesizing the desired sound field while suppressing the exterior radiation power with directional weighting is proposed. The exterior radiation …”
Volltext
Tagungsbericht -
18
Optimizing multi-user indoor sound communications with acoustic reconfigurable metasurfaces
ISSN: 2041-1723Veröffentlicht: Springer Science and Business Media LLC 10.02.2024Veröffentlicht in Nature Communications (10.02.2024)Volltext
Journal Article -
19
A Novel Temporal Attentive-Pooling based Convolutional Recurrent Architecture for Acoustic Signal Enhancement
ISSN: 2691-4581Veröffentlicht: Institute of Electrical and Electronics Engineers (IEEE) 01.10.2022Veröffentlicht in IEEE Transactions on Artificial Intelligence (01.10.2022)Volltext
Journal Article -
20
Decoupling Speaker-Independent Emotions for Voice Conversion via Source-Filter Networks
ISSN: 2329-9290, 2329-9304Veröffentlicht: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2023Veröffentlicht in IEEE/ACM Transactions on Audio, Speech, and Language Processing (01.01.2023)Volltext
Journal Article