Výsledky vyhľadávania - "Electrical Engineering and Systems Science - Audio and Speech Processing"
-
1
Generalized Domain Adaptation Framework for Parametric Back-End in Speaker Recognition
ISSN: 1556-6013, 1556-6021Vydavateľské údaje: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2023Vydané v IEEE Transactions on Information Forensics and Security (01.01.2023)Získať plný text
Journal Article -
2
Versatile Time-Frequency Representations Realized by Convex Penalty on Magnitude Spectrogram
ISSN: 1070-9908, 1558-2361Vydavateľské údaje: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2023Vydané v IEEE Signal Processing Letters (01.01.2023)Získať plný text
Journal Article -
3
Investigation of Japanese PnG BERT Language Model in Text-to-Speech Synthesis for Pitch Accent Language
ISSN: 1932-4553, 1941-0484Vydavateľské údaje: Institute of Electrical and Electronics Engineers (IEEE) 01.10.2022Vydané v IEEE Journal of Selected Topics in Signal Processing (01.10.2022)Získať plný text
Journal Article -
4
Expression-Preserving Face Frontalization Improves Visually Assisted Speech Processing
ISSN: 0920-5691, 1573-1405Vydavateľské údaje: Springer Science and Business Media LLC 12.01.2023Vydané v International Journal of Computer Vision (12.01.2023)Získať plný text
Journal Article -
5
CASA-based speaker identification using cascaded GMM-CNN classifier in noisy and emotional talking conditions
ISSN: 1568-4946Vydavateľské údaje: Elsevier BV 01.05.2021Vydané v Applied Soft Computing (01.05.2021)Získať plný text
Journal Article -
6
Bass Accompaniment Generation Via Latent Diffusion
ISSN: 2379-190XVydavateľské údaje: IEEE 14.04.2024Vydané v Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) (14.04.2024)“…The ability to automatically generate music that appropriately matches an arbitrary input track is a challenging task. We present a novel controllable system…”
Získať plný text
Konferenčný príspevok.. -
7
GEDI: Gammachirp envelope distortion index for predicting intelligibility of enhanced speech
ISSN: 0167-6393Vydavateľské údaje: Elsevier BV 01.10.2020Vydané v Speech Communication (01.10.2020)Získať plný text
Journal Article -
8
Learning and controlling the source-filter representation of speech with a variational autoencoder
ISSN: 0167-6393Vydavateľské údaje: Elsevier BV 01.03.2023Vydané v Speech Communication (01.03.2023)Získať plný text
Journal Article -
9
RISC: A Corpus for Shout Type Classification and Shout Intensity Prediction
ISSN: 2329-9290, 2329-9304Vydavateľské údaje: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2024Vydané v IEEE/ACM Transactions on Audio, Speech, and Language Processing (01.01.2024)Získať plný text
Journal Article -
10
PeriodGrad: Towards Pitch-Controllable Neural Vocoder Based on a Diffusion Probabilistic Model
ISSN: 2379-190XVydavateľské údaje: IEEE 14.04.2024Vydané v Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) (14.04.2024)“…This paper presents a neural vocoder based on a denoising diffusion probabilistic model (DDPM) incorporating explicit periodic signals as auxiliary…”
Získať plný text
Konferenčný príspevok.. -
11
MOS-FAD: Improving Fake Audio Detection Via Automatic Mean Opinion Score Prediction
ISSN: 2379-190XVydavateľské údaje: IEEE 14.04.2024Vydané v Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) (14.04.2024)“…IEEE Automatic Mean Opinion Score (MOS) prediction is employed to evaluate the quality of synthetic speech. This study extends the application of predicted MOS…”
Získať plný text
Konferenčný príspevok.. -
12
FastMVAE2: On Improving and Accelerating the Fast Variational Autoencoder-Based Source Separation Algorithm for Determined Mixtures
ISSN: 2329-9290, 2329-9304Vydavateľské údaje: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2023Vydané v IEEE/ACM Transactions on Audio, Speech, and Language Processing (01.01.2023)Získať plný text
Journal Article -
13
Data Driven Grapheme-to-Phoneme Representations for a Lexicon-Free Text-to-Speech
ISSN: 2379-190XVydavateľské údaje: IEEE 14.04.2024Vydané v Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) (14.04.2024)“…Grapheme-to-Phoneme (G2P) is an essential first step in any modern, high-quality Text-to-Speech (TTS) system. Most of the current G2P systems rely on carefully…”
Získať plný text
Konferenčný príspevok.. -
14
ELP-Adapters: Parameter Efficient Adapter Tuning for Various Speech Processing Tasks
ISSN: 2329-9290, 2329-9304Vydavateľské údaje: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2024Vydané v IEEE/ACM Transactions on Audio, Speech, and Language Processing (01.01.2024)Získať plný text
Journal Article -
15
The VoicePrivacy 2022 Challenge: Progress and Perspectives in Voice Anonymisation
ISSN: 2329-9290, 2329-9304Vydavateľské údaje: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2024Vydané v IEEE/ACM Transactions on Audio, Speech, and Language Processing (01.01.2024)Získať plný text
Journal Article -
16
A Large-Scale Evaluation of Speech Foundation Models
ISSN: 2329-9290, 2329-9304Vydavateľské údaje: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2024Vydané v IEEE/ACM Transactions on Audio, Speech, and Language Processing (01.01.2024)Získať plný text
Journal Article -
17
Localizing Acoustic Energy in Sound Field Synthesis by Directionally Weighted Exterior Radiation Suppression
ISSN: 2379-190XVydavateľské údaje: IEEE 14.04.2024Vydané v Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) (14.04.2024)“…A method for synthesizing the desired sound field while suppressing the exterior radiation power with directional weighting is proposed. The exterior radiation…”
Získať plný text
Konferenčný príspevok.. -
18
Optimizing multi-user indoor sound communications with acoustic reconfigurable metasurfaces
ISSN: 2041-1723Vydavateľské údaje: Springer Science and Business Media LLC 10.02.2024Vydané v Nature Communications (10.02.2024)Získať plný text
Journal Article -
19
A Novel Temporal Attentive-Pooling based Convolutional Recurrent Architecture for Acoustic Signal Enhancement
ISSN: 2691-4581Vydavateľské údaje: Institute of Electrical and Electronics Engineers (IEEE) 01.10.2022Vydané v IEEE Transactions on Artificial Intelligence (01.10.2022)Získať plný text
Journal Article -
20
Decoupling Speaker-Independent Emotions for Voice Conversion via Source-Filter Networks
ISSN: 2329-9290, 2329-9304Vydavateľské údaje: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2023Vydané v IEEE/ACM Transactions on Audio, Speech, and Language Processing (01.01.2023)Získať plný text
Journal Article