Search Results - MelSpectrogram*
-
1
A Novel Melspectrogram Snippet Representation Learning Framework for Severity Detection of Chronic Obstructive Pulmonary Diseases
ISSN: 0018-9456, 1557-9662Published: New York IEEE 2023Published in IEEE transactions on instrumentation and measurement (2023)“…A chronic obstructive pulmonary disease (COPD) is a major public health concern across the world. Since it is an incurable disease, early detection and…”
Get full text
Journal Article -
2
A comparative study of the spectrogram, scalogram, melspectrogram and gammatonegram time-frequency representations for the classification of lung sounds using the ICBHI database based on CNNs
ISSN: 1862-278X, 1862-278XPublished: 26.10.2022Published in Biomedizinische Technik (26.10.2022)“… This study aims to evaluate and compare the performance of the spectrogram, scalogram, melspectrogram and gammatonegram representations, and provide comparative information to users regarding…”
Get more information
Journal Article -
3
Quad-Net: Melspectrogram Vocoder with Convolutional Layers Restricted by the Quadrature Mirror Filter for Perfect Reconstruction
ISSN: 2379-190XPublished: IEEE 06.04.2025Published in Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) (06.04.2025)“…Recently, neural vocoders have applied signal processing methods to synthesize speech to reduce computational complexity. However, most methods lack the…”
Get full text
Conference Proceeding -
4
Improvise approach for respiratory pathologies classification with multilayer convolutional neural networks
ISSN: 1380-7501, 1573-7721Published: New York Springer US 01.11.2022Published in Multimedia tools and applications (01.11.2022)“… The combination of pre-processing steps MFCC, Melspectrogram, and Chroma CENS with CNN improvise the performance of the proposed system…”
Get full text
Journal Article -
5
A hybrid noise robust model for multireplay attack detection in Automatic speaker verification systems
ISSN: 1746-8094, 1746-8108Published: Elsevier Ltd 01.04.2022Published in Biomedical signal processing and control (01.04.2022)“…Biometric Systems are automatic methods of verifying the identity of a person based on some characteristics such as fingerprint, face, speech etc. Speech…”
Get full text
Journal Article -
6
Multi-View Spectrogram Transformer for Respiratory Sound Classification
ISSN: 2379-190XPublished: IEEE 14.04.2024Published in Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) (14.04.2024)“…Deep neural networks have been applied to audio spectrograms for respiratory sound classification. Existing models often treat the spectrogram as a synthetic…”
Get full text
Conference Proceeding -
7
Classification of Human and Synthesized Audio in Marathi Speech using Melspectrogram Analysis and Convolutional Neural Network (CNN) Model
Published: IEEE 19.12.2024Published in 2024 International Conference on Communication, Computing, Smart Materials and Devices (ICCCSMD) (19.12.2024)“… We used Melspectrograms-based pre-processing to extract key features such as formants and harmonics…”
Get full text
Conference Proceeding -
8
Deep Learning-Driven Phonopneumographic Analysis For Pulmonary Disease Recognition Using Dft And Melspectrograms
ISSN: 2582-2160, 2582-2160Published: 22.03.2025Published in International Journal For Multidisciplinary Research (22.03.2025)“… With advancements in deep learning, novel approaches using Digital Fourier Transform (DFT) and MelSpectrograms have emerged for automated and accurate disease recognition…”
Get full text
Journal Article -
9
Melspectrogram Based Music Genre Classification System Using Vision Transformer
Published: IEEE 07.12.2024Published in 2024 IEEE International Conference on Intelligent Signal Processing and Effective Communication Technologies (INSPECT) (07.12.2024)“…In recent years, the popularity of music recommendation systems has surged, driven by the growth of diverse music content and the variety of digital music…”
Get full text
Conference Proceeding -
10
Audio Emotion Mapping using Enhanced Techniques with Augmented Log-Melspectrograms
Published: IEEE 23.08.2024Published in 2024 4th Asian Conference on Innovation in Technology (ASIANCON) (23.08.2024)“…Using a novel approach based on log-Mel spectrogram with augmentation, this research piece delves into the realm of audio emotion classification. Although it…”
Get full text
Conference Proceeding -
11
A Symphony of Sentiments using Log-Melspectrogram Techniques for Emotional Classification
Published: IEEE 08.08.2024Published in 2024 7th International Conference on Circuit Power and Computing Technologies (ICCPCT) (08.08.2024)“…This research paper investigates the topic of audio emotion categorization via a novel approach involving log-Mel spectrogram with augmentation. Though it is…”
Get full text
Conference Proceeding -
12
Harmonizing Emotions: A Novel Approach to Audio Emotion Classification using Log-Melspectrogram with Augmentation
Published: IEEE 17.04.2024Published in 2024 International Conference on Communication, Computing and Internet of Things (IC3IoT) (17.04.2024)“…This study article explores the field of audio emotion categorization, using a unique method that involves log-Mel spectrogram with augmentation. The research…”
Get full text
Conference Proceeding -
13
Emotional Resonance Unleashed by exploring Novel Audio Classification Techniques with Log- Melspectrogram Augmentation
Published: IEEE 05.06.2024Published in 2024 OPJU International Technology Conference (OTCON) on Smart Computing for Innovation and Advancement in Industry 4.0 (05.06.2024)“…This research work examines the domain of audio emotion categorization, employing a unique approach that incorporates log-Mel spectrogram with augmentation…”
Get full text
Conference Proceeding -
14
Automatic Speech Recognition using the Melspectrogram-based method for English Phonemes
Published: IEEE 14.12.2022Published in 2022 International Conference on Computer, Power and Communications (ICCPC) (14.12.2022)“…An automatic speech recognition (ASR) technique may be set up to forecast the pronunciation of textual identifiers (such as song names) based on assumptions…”
Get full text
Conference Proceeding -
15
Audio Classification for Feature-Based Majority Voting Optimization and Hyperparametric Tuning
Published: European Association for Signal Processing - EURASIP 08.09.2025Published in 2025 33rd European Signal Processing Conference (EUSIPCO) (08.09.2025)“…This paper presents an optimized audio recognition system that integrates feature-based and deep learning approaches, fine-tuned for high-accuracy…”
Get full text
Conference Proceeding -
16
Noise Pollution Classification Using Deep Learning
Published: IEEE 22.08.2025Published in 2025 International Conference on Sustainability, Innovation & Technology (ICSIT) (22.08.2025)“…With the rise and rapid growth in industrialization as well as urbanization, noise pollution has become a significant yet often overlooked threat to our…”
Get full text
Conference Proceeding -
17
Optimizing Audio Recognition for Assistive Robotics with Feature Optimization, Machine Learning and Data Augmentation
ISSN: 2836-9866Published: IEEE 29.05.2025Published in International Conference on Engineering of Modern Electric Systems (Online) (29.05.2025)“…Audio recognition plays a crucial role in assistive robotics, enabling intelligent systems to interpret and respond to various sound inputs. This study…”
Get full text
Conference Proceeding -
18
BirdClassifier: An Advanced Bird Classification Model Using Deep Neural Networks
ISSN: 2767-7788Published: IEEE 23.04.2025Published in International Conference on Inventive Computation Technologies (Online) (23.04.2025)“…Automated bird species classification plays a crucial role in biodiversity conservation and ecological monitoring. This study proposes a hybrid deep learning…”
Get full text
Conference Proceeding -
19
Exploring Human Non-Speech Sound Recognition: Insights from the Nonspeech7K Dataset
Published: IEEE 28.02.2025Published in 2025 International Conference on Innovation in Computing and Engineering (ICE) (28.02.2025)“…Analysis of non-speech sounds produced by humans is an area of speech recognition that has largely not been paid much heed. The lack of proper exhaustive…”
Get full text
Conference Proceeding -
20
Accurate Anomia Severity Detection in Post-Stroke Aphasia Patients using MobileNetV2
ISSN: 2473-7674Published: IEEE 24.06.2024Published in International Conference on Computing, Communication, and Networking Technologies (Online) (24.06.2024)“… In the proposed method, we train the melspectrograms taken from the audio recordings of the aphasia patients' using MobileNetV2…”
Get full text
Conference Proceeding