Content-based audio classification using collective network of binary classifiers

Uloženo v:
Podrobná bibliografie
Název: Content-based audio classification using collective network of binary classifiers
Autoři: Mäkinen, T., Kiranyaz, S., Gabbouj, M.
Rok vydání: 2011
Sbírka: The Hong Kong University of Science and Technology: HKUST Institutional Repository
Témata: Audio content - based classification, Evolutionary neural networks, Multilayer perceptron, Particle swarm optimization
Popis: In this paper, a novel collective network of binary classifiers (CNBC) framework is presented for content-based audio classification. The topic has been studied in several publications before, but in many cases the number of different classification categories is quite limited and needed to be fixed a priori. We focus our efforts to increase both the classification accuracy and the number of classes, as well as to create a scalable network design, which allows introducing new audio classes incrementally. The approach is based on dividing a major classification problem into several networks of binary classifiers (NBCs), where each NBC adapts its internal topology according to the classification problem at hand, by using evolutionary Artificial Neural Networks (ANNs). In the current work, feed-forward ANNs, or the so-called Multilayer Perceptrons (MLPs), are evolved within an architecture space, where a stochastic optimization is applied to seek for the optimal classifier configuration and parameters. The performance evaluations of the proposed framework over an 8-class benchmark audio database demonstrate its scalability and notable potential, as classification error rates of less than 9% are achieved. © 2011 IEEE.
Druh dokumentu: conference object
Jazyk: English
Relation: https://doi.org/10.1109/EAIS.2011.5945911
DOI: 10.1109/EAIS.2011.5945911
Dostupnost: http://repository.hkust.edu.hk/ir/Record/1783.1-53770
https://doi.org/10.1109/EAIS.2011.5945911
http://www.scopus.com/record/display.url?eid=2-s2.0-80051479510&origin=inward
Přístupové číslo: edsbas.D7EE55A6
Databáze: BASE
FullText Text:
  Availability: 0
CustomLinks:
  – Url: http://repository.hkust.edu.hk/ir/Record/1783.1-53770#
    Name: EDS - BASE (s4221598)
    Category: fullText
    Text: View record from BASE
  – Url: https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=EBSCO&SrcAuth=EBSCO&DestApp=WOS&ServiceName=TransferToWoS&DestLinkType=GeneralSearchSummary&Func=Links&author=M%C3%A4kinen%20T
    Name: ISI
    Category: fullText
    Text: Nájsť tento článok vo Web of Science
    Icon: https://imagesrvr.epnet.com/ls/20docs.gif
    MouseOverText: Nájsť tento článok vo Web of Science
Header DbId: edsbas
DbLabel: BASE
An: edsbas.D7EE55A6
RelevancyScore: 788
AccessLevel: 3
PubType: Conference
PubTypeId: conference
PreciseRelevancyScore: 787.654235839844
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: Content-based audio classification using collective network of binary classifiers
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Mäkinen%2C+T%2E%22">Mäkinen, T.</searchLink><br /><searchLink fieldCode="AR" term="%22Kiranyaz%2C+S%2E%22">Kiranyaz, S.</searchLink><br /><searchLink fieldCode="AR" term="%22Gabbouj%2C+M%2E%22">Gabbouj, M.</searchLink>
– Name: DatePubCY
  Label: Publication Year
  Group: Date
  Data: 2011
– Name: Subset
  Label: Collection
  Group: HoldingsInfo
  Data: The Hong Kong University of Science and Technology: HKUST Institutional Repository
– Name: Subject
  Label: Subject Terms
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22Audio+content+-+based+classification%22">Audio content - based classification</searchLink><br /><searchLink fieldCode="DE" term="%22Evolutionary+neural+networks%22">Evolutionary neural networks</searchLink><br /><searchLink fieldCode="DE" term="%22Multilayer+perceptron%22">Multilayer perceptron</searchLink><br /><searchLink fieldCode="DE" term="%22Particle+swarm+optimization%22">Particle swarm optimization</searchLink>
– Name: Abstract
  Label: Description
  Group: Ab
  Data: In this paper, a novel collective network of binary classifiers (CNBC) framework is presented for content-based audio classification. The topic has been studied in several publications before, but in many cases the number of different classification categories is quite limited and needed to be fixed a priori. We focus our efforts to increase both the classification accuracy and the number of classes, as well as to create a scalable network design, which allows introducing new audio classes incrementally. The approach is based on dividing a major classification problem into several networks of binary classifiers (NBCs), where each NBC adapts its internal topology according to the classification problem at hand, by using evolutionary Artificial Neural Networks (ANNs). In the current work, feed-forward ANNs, or the so-called Multilayer Perceptrons (MLPs), are evolved within an architecture space, where a stochastic optimization is applied to seek for the optimal classifier configuration and parameters. The performance evaluations of the proposed framework over an 8-class benchmark audio database demonstrate its scalability and notable potential, as classification error rates of less than 9% are achieved. © 2011 IEEE.
– Name: TypeDocument
  Label: Document Type
  Group: TypDoc
  Data: conference object
– Name: Language
  Label: Language
  Group: Lang
  Data: English
– Name: NoteTitleSource
  Label: Relation
  Group: SrcInfo
  Data: https://doi.org/10.1109/EAIS.2011.5945911
– Name: DOI
  Label: DOI
  Group: ID
  Data: 10.1109/EAIS.2011.5945911
– Name: URL
  Label: Availability
  Group: URL
  Data: http://repository.hkust.edu.hk/ir/Record/1783.1-53770<br />https://doi.org/10.1109/EAIS.2011.5945911<br />http://www.scopus.com/record/display.url?eid=2-s2.0-80051479510&origin=inward
– Name: AN
  Label: Accession Number
  Group: ID
  Data: edsbas.D7EE55A6
PLink https://erproxy.cvtisr.sk/sfx/access?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=edsbas&AN=edsbas.D7EE55A6
RecordInfo BibRecord:
  BibEntity:
    Identifiers:
      – Type: doi
        Value: 10.1109/EAIS.2011.5945911
    Languages:
      – Text: English
    Subjects:
      – SubjectFull: Audio content - based classification
        Type: general
      – SubjectFull: Evolutionary neural networks
        Type: general
      – SubjectFull: Multilayer perceptron
        Type: general
      – SubjectFull: Particle swarm optimization
        Type: general
    Titles:
      – TitleFull: Content-based audio classification using collective network of binary classifiers
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Mäkinen, T.
      – PersonEntity:
          Name:
            NameFull: Kiranyaz, S.
      – PersonEntity:
          Name:
            NameFull: Gabbouj, M.
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 01
              M: 01
              Type: published
              Y: 2011
          Identifiers:
            – Type: issn-locals
              Value: edsbas
ResultId 1