Audio quality assessment in packet networks: an "inter-subjective" neural network model

Transmitting digital audio signals in real time over packet switched networks (e.g. the Internet) has set forth the need for developing signal processing algorithms that objectively evaluate audio quality. So far, the best way to assess audio quality are subjective listening tests, the most commonly...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:15th International Conference on Information Networking (ICOIN-15 2001) S. 579 - 586
Hauptverfasser: Mohamed, S., Cervantes-Perez, F., Afifi, H.
Format: Tagungsbericht
Sprache:Englisch
Veröffentlicht: IEEE 2001
Schlagworte:
ISBN:0769509517, 9780769509518
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Transmitting digital audio signals in real time over packet switched networks (e.g. the Internet) has set forth the need for developing signal processing algorithms that objectively evaluate audio quality. So far, the best way to assess audio quality are subjective listening tests, the most commonly used being the mean opinion score (MOS) recommended by the International Telecommunication Union (ITU). The goal of this paper is to show how artificial neural networks (ANNs) can be used to mimic the way human subjects estimate the quality of audio signals when distorted by changes in several parameters that affect the transmitted audio quality. To validate the approach, we carried out an MOS experiment for speech signals distorted by different values of IP-network parameters (e.g. loss rate, loss distribution, packetization interval, etc.), and changes in the encoding algorithm used to compress the original signal. Our results allow us to show that ANNs can capture the nonlinear mapping, between certain characteristics of audio signals and a subjective five points quality scale, "built" by a group of human subjects when participating in an MOS experiment, creating, in this way, an "inter-subjective" neural network (INN) model that might effectively "evaluate", in real time, the audio quality in packet switched networks.
ISBN:0769509517
9780769509518
DOI:10.1109/ICOIN.2001.905514