Content-based audio classification using collective network of binary classifiers
Uloženo v:
| Název: | Content-based audio classification using collective network of binary classifiers |
|---|---|
| Autoři: | Mäkinen, T., Kiranyaz, S., Gabbouj, M. |
| Rok vydání: | 2011 |
| Sbírka: | The Hong Kong University of Science and Technology: HKUST Institutional Repository |
| Témata: | Audio content - based classification, Evolutionary neural networks, Multilayer perceptron, Particle swarm optimization |
| Popis: | In this paper, a novel collective network of binary classifiers (CNBC) framework is presented for content-based audio classification. The topic has been studied in several publications before, but in many cases the number of different classification categories is quite limited and needed to be fixed a priori. We focus our efforts to increase both the classification accuracy and the number of classes, as well as to create a scalable network design, which allows introducing new audio classes incrementally. The approach is based on dividing a major classification problem into several networks of binary classifiers (NBCs), where each NBC adapts its internal topology according to the classification problem at hand, by using evolutionary Artificial Neural Networks (ANNs). In the current work, feed-forward ANNs, or the so-called Multilayer Perceptrons (MLPs), are evolved within an architecture space, where a stochastic optimization is applied to seek for the optimal classifier configuration and parameters. The performance evaluations of the proposed framework over an 8-class benchmark audio database demonstrate its scalability and notable potential, as classification error rates of less than 9% are achieved. © 2011 IEEE. |
| Druh dokumentu: | conference object |
| Jazyk: | English |
| Relation: | https://doi.org/10.1109/EAIS.2011.5945911 |
| DOI: | 10.1109/EAIS.2011.5945911 |
| Dostupnost: | http://repository.hkust.edu.hk/ir/Record/1783.1-53770 https://doi.org/10.1109/EAIS.2011.5945911 http://www.scopus.com/record/display.url?eid=2-s2.0-80051479510&origin=inward |
| Přístupové číslo: | edsbas.D7EE55A6 |
| Databáze: | BASE |
| FullText | Text: Availability: 0 CustomLinks: – Url: http://repository.hkust.edu.hk/ir/Record/1783.1-53770# Name: EDS - BASE (s4221598) Category: fullText Text: View record from BASE – Url: https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=EBSCO&SrcAuth=EBSCO&DestApp=WOS&ServiceName=TransferToWoS&DestLinkType=GeneralSearchSummary&Func=Links&author=M%C3%A4kinen%20T Name: ISI Category: fullText Text: Nájsť tento článok vo Web of Science Icon: https://imagesrvr.epnet.com/ls/20docs.gif MouseOverText: Nájsť tento článok vo Web of Science |
|---|---|
| Header | DbId: edsbas DbLabel: BASE An: edsbas.D7EE55A6 RelevancyScore: 788 AccessLevel: 3 PubType: Conference PubTypeId: conference PreciseRelevancyScore: 787.654235839844 |
| IllustrationInfo | |
| Items | – Name: Title Label: Title Group: Ti Data: Content-based audio classification using collective network of binary classifiers – Name: Author Label: Authors Group: Au Data: <searchLink fieldCode="AR" term="%22Mäkinen%2C+T%2E%22">Mäkinen, T.</searchLink><br /><searchLink fieldCode="AR" term="%22Kiranyaz%2C+S%2E%22">Kiranyaz, S.</searchLink><br /><searchLink fieldCode="AR" term="%22Gabbouj%2C+M%2E%22">Gabbouj, M.</searchLink> – Name: DatePubCY Label: Publication Year Group: Date Data: 2011 – Name: Subset Label: Collection Group: HoldingsInfo Data: The Hong Kong University of Science and Technology: HKUST Institutional Repository – Name: Subject Label: Subject Terms Group: Su Data: <searchLink fieldCode="DE" term="%22Audio+content+-+based+classification%22">Audio content - based classification</searchLink><br /><searchLink fieldCode="DE" term="%22Evolutionary+neural+networks%22">Evolutionary neural networks</searchLink><br /><searchLink fieldCode="DE" term="%22Multilayer+perceptron%22">Multilayer perceptron</searchLink><br /><searchLink fieldCode="DE" term="%22Particle+swarm+optimization%22">Particle swarm optimization</searchLink> – Name: Abstract Label: Description Group: Ab Data: In this paper, a novel collective network of binary classifiers (CNBC) framework is presented for content-based audio classification. The topic has been studied in several publications before, but in many cases the number of different classification categories is quite limited and needed to be fixed a priori. We focus our efforts to increase both the classification accuracy and the number of classes, as well as to create a scalable network design, which allows introducing new audio classes incrementally. The approach is based on dividing a major classification problem into several networks of binary classifiers (NBCs), where each NBC adapts its internal topology according to the classification problem at hand, by using evolutionary Artificial Neural Networks (ANNs). In the current work, feed-forward ANNs, or the so-called Multilayer Perceptrons (MLPs), are evolved within an architecture space, where a stochastic optimization is applied to seek for the optimal classifier configuration and parameters. The performance evaluations of the proposed framework over an 8-class benchmark audio database demonstrate its scalability and notable potential, as classification error rates of less than 9% are achieved. © 2011 IEEE. – Name: TypeDocument Label: Document Type Group: TypDoc Data: conference object – Name: Language Label: Language Group: Lang Data: English – Name: NoteTitleSource Label: Relation Group: SrcInfo Data: https://doi.org/10.1109/EAIS.2011.5945911 – Name: DOI Label: DOI Group: ID Data: 10.1109/EAIS.2011.5945911 – Name: URL Label: Availability Group: URL Data: http://repository.hkust.edu.hk/ir/Record/1783.1-53770<br />https://doi.org/10.1109/EAIS.2011.5945911<br />http://www.scopus.com/record/display.url?eid=2-s2.0-80051479510&origin=inward – Name: AN Label: Accession Number Group: ID Data: edsbas.D7EE55A6 |
| PLink | https://erproxy.cvtisr.sk/sfx/access?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=edsbas&AN=edsbas.D7EE55A6 |
| RecordInfo | BibRecord: BibEntity: Identifiers: – Type: doi Value: 10.1109/EAIS.2011.5945911 Languages: – Text: English Subjects: – SubjectFull: Audio content - based classification Type: general – SubjectFull: Evolutionary neural networks Type: general – SubjectFull: Multilayer perceptron Type: general – SubjectFull: Particle swarm optimization Type: general Titles: – TitleFull: Content-based audio classification using collective network of binary classifiers Type: main BibRelationships: HasContributorRelationships: – PersonEntity: Name: NameFull: Mäkinen, T. – PersonEntity: Name: NameFull: Kiranyaz, S. – PersonEntity: Name: NameFull: Gabbouj, M. IsPartOfRelationships: – BibEntity: Dates: – D: 01 M: 01 Type: published Y: 2011 Identifiers: – Type: issn-locals Value: edsbas |
| ResultId | 1 |
Nájsť tento článok vo Web of Science