Výsledky vyhľadávania - Electrical Engineering and Systems Science - Audio and Speech Processing
-
1
Procurement Of Dsp Enabled Evaluation Kit For Speech And Audio Signal Processing At The Department Of Electronics Electrical Communication Engineering Department, Iit
ISSN: 2219-0112Vydavateľské údaje: Camden Disco Digital Media, Inc 21.06.2025Vydané v MENA Report (21.06.2025)Získať plný text
Newsletter -
2
Versatile Time-Frequency Representations Realized by Convex Penalty on Magnitude Spectrogram
ISSN: 1070-9908, 1558-2361Vydavateľské údaje: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2023Vydané v IEEE Signal Processing Letters (01.01.2023)Získať plný text
Journal Article -
3
Expression-Preserving Face Frontalization Improves Visually Assisted Speech Processing
ISSN: 0920-5691, 1573-1405Vydavateľské údaje: Springer Science and Business Media LLC 12.01.2023Vydané v International Journal of Computer Vision (12.01.2023)Získať plný text
Journal Article -
4
Generalized Domain Adaptation Framework for Parametric Back-End in Speaker Recognition
ISSN: 1556-6013, 1556-6021Vydavateľské údaje: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2023Vydané v IEEE Transactions on Information Forensics and Security (01.01.2023)Získať plný text
Journal Article -
5
Investigation of Japanese PnG BERT Language Model in Text-to-Speech Synthesis for Pitch Accent Language
ISSN: 1932-4553, 1941-0484Vydavateľské údaje: Institute of Electrical and Electronics Engineers (IEEE) 01.10.2022Vydané v IEEE Journal of Selected Topics in Signal Processing (01.10.2022)Získať plný text
Journal Article -
6
Learning and controlling the source-filter representation of speech with a variational autoencoder
ISSN: 0167-6393Vydavateľské údaje: Elsevier BV 01.03.2023Vydané v Speech Communication (01.03.2023)Získať plný text
Journal Article -
7
CASA-based speaker identification using cascaded GMM-CNN classifier in noisy and emotional talking conditions
ISSN: 1568-4946Vydavateľské údaje: Elsevier BV 01.05.2021Vydané v Applied Soft Computing (01.05.2021)Získať plný text
Journal Article -
8
GEDI: Gammachirp envelope distortion index for predicting intelligibility of enhanced speech
ISSN: 0167-6393Vydavateľské údaje: Elsevier BV 01.10.2020Vydané v Speech Communication (01.10.2020)Získať plný text
Journal Article -
9
PeriodGrad: Towards Pitch-Controllable Neural Vocoder Based on a Diffusion Probabilistic Model
ISSN: 2379-190XVydavateľské údaje: IEEE 14.04.2024Vydané v Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) (14.04.2024)“… In practical applications, such as singing voice synthesis, there is a demand for neural vocoders to generate high-fidelity speech waveforms with flexible pitch control…”
Získať plný text
Konferenčný príspevok.. -
10
A Large-Scale Evaluation of Speech Foundation Models
ISSN: 2329-9290, 2329-9304Vydavateľské údaje: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2024Vydané v IEEE/ACM Transactions on Audio, Speech, and Language Processing (01.01.2024)Získať plný text
Journal Article -
11
MOS-FAD: Improving Fake Audio Detection Via Automatic Mean Opinion Score Prediction
ISSN: 2379-190XVydavateľské údaje: IEEE 14.04.2024Vydané v Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) (14.04.2024)“… This study extends the application of predicted MOS to the task of Fake Audio Detection (FAD) as we expect that MOS can be used to assess how close synthesized speech is to the natural human voice…”
Získať plný text
Konferenčný príspevok.. -
12
ELP-Adapters: Parameter Efficient Adapter Tuning for Various Speech Processing Tasks
ISSN: 2329-9290, 2329-9304Vydavateľské údaje: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2024Vydané v IEEE/ACM Transactions on Audio, Speech, and Language Processing (01.01.2024)Získať plný text
Journal Article -
13
Bass Accompaniment Generation Via Latent Diffusion
ISSN: 2379-190XVydavateľské údaje: IEEE 14.04.2024Vydané v Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) (14.04.2024)“… We present a novel controllable system for generating single stems to accompany musical mixes of arbitrary length…”
Získať plný text
Konferenčný príspevok.. -
14
RISC: A Corpus for Shout Type Classification and Shout Intensity Prediction
ISSN: 2329-9290, 2329-9304Vydavateľské údaje: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2024Vydané v IEEE/ACM Transactions on Audio, Speech, and Language Processing (01.01.2024)Získať plný text
Journal Article -
15
Data Driven Grapheme-to-Phoneme Representations for a Lexicon-Free Text-to-Speech
ISSN: 2379-190XVydavateľské údaje: IEEE 14.04.2024Vydané v Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) (14.04.2024)“…Grapheme-to-Phoneme (G2P) is an essential first step in any modern, high-quality Text-to-Speech (TTS) system…”
Získať plný text
Konferenčný príspevok.. -
16
The VoicePrivacy 2022 Challenge: Progress and Perspectives in Voice Anonymisation
ISSN: 2329-9290, 2329-9304Vydavateľské údaje: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2024Vydané v IEEE/ACM Transactions on Audio, Speech, and Language Processing (01.01.2024)Získať plný text
Journal Article -
17
FastMVAE2: On Improving and Accelerating the Fast Variational Autoencoder-Based Source Separation Algorithm for Determined Mixtures
ISSN: 2329-9290, 2329-9304Vydavateľské údaje: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2023Vydané v IEEE/ACM Transactions on Audio, Speech, and Language Processing (01.01.2023)Získať plný text
Journal Article -
18
Decoupling Speaker-Independent Emotions for Voice Conversion via Source-Filter Networks
ISSN: 2329-9290, 2329-9304Vydavateľské údaje: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2023Vydané v IEEE/ACM Transactions on Audio, Speech, and Language Processing (01.01.2023)Získať plný text
Journal Article -
19
Localizing Acoustic Energy in Sound Field Synthesis by Directionally Weighted Exterior Radiation Suppression
ISSN: 2379-190XVydavateľské údaje: IEEE 14.04.2024Vydané v Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) (14.04.2024)“… The exterior radiation from the loudspeakers in sound field synthesis systems can be problematic in practical situations…”
Získať plný text
Konferenčný príspevok.. -
20
Cross-Corpus Multilingual Speech Emotion Recognition: Amharic vs. Other Languages
ISSN: 2076-3417Vydavateľské údaje: MDPI AG 22.11.2023Vydané v Applied Sciences (22.11.2023)Získať plný text
Journal Article