Generalized Zero-Shot Learning Using Conditional Wasserstein Autoencoder

Generalized zero-shot learning (GZSL) is a technique to train a deep learning model to identify unseen classes. Conventionally, conditional generative models have been employed to generate training data for unseen classes from the attribute. In this paper, we propose a new conditional generative mod...

Celý popis

Uloženo v:

Podrobná bibliografie
Vydáno v:	Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) s. 3413 - 3417
Hlavní autoři:	Kim, Junhan, Shim, Byonghyo
Médium:	Konferenční příspěvek
Jazyk:	angličtina
Vydáno:	IEEE 23.05.2022
Témata:	Acoustics Benchmark testing Conferences Data models Deep learning Generalized zero-shot learning generative adversarial network generative model Signal processing Training data variational autoencoder
ISSN:	2379-190X
On-line přístup:	Získat plný text
Tagy:	Přidat tag Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!

Abstract	Generalized zero-shot learning (GZSL) is a technique to train a deep learning model to identify unseen classes. Conventionally, conditional generative models have been employed to generate training data for unseen classes from the attribute. In this paper, we propose a new conditional generative model that improves the GZSL performance greatly. In a nutshell, the proposed model, called conditional Wasserstein autoencoder (CWAE), minimizes the Wasserstein distance between the real and generated image feature distributions using an encoder-decoder architecture. From the extensive experiments on various benchmark datasets, we show that the proposed CWAE outperforms conventional generative models in terms of the GZSL classification performance.
AbstractList	Generalized zero-shot learning (GZSL) is a technique to train a deep learning model to identify unseen classes. Conventionally, conditional generative models have been employed to generate training data for unseen classes from the attribute. In this paper, we propose a new conditional generative model that improves the GZSL performance greatly. In a nutshell, the proposed model, called conditional Wasserstein autoencoder (CWAE), minimizes the Wasserstein distance between the real and generated image feature distributions using an encoder-decoder architecture. From the extensive experiments on various benchmark datasets, we show that the proposed CWAE outperforms conventional generative models in terms of the GZSL classification performance.
Author	Kim, Junhan Shim, Byonghyo
Author_xml	– sequence: 1 givenname: Junhan surname: Kim fullname: Kim, Junhan email: junhankim@islab.snu.ac.kr organization: Seoul National University,Department of Electrical and Computer Engineering,Seoul,Korea – sequence: 2 givenname: Byonghyo surname: Shim fullname: Shim, Byonghyo email: bshim@islab.snu.ac.kr organization: Seoul National University,Department of Electrical and Computer Engineering,Seoul,Korea
BookMark	eNotj81KAzEUhaMo2FafwM28wNTcJM3PsgzaCgMKtShuSprcaKQmkowLffpWLBy-s_s4Z0zOUk5ISAN0CkDNzX03X60eBTeMTRk9wCihlIATMgYpZ4IeIk_JiHFlWjD05YKMa_2glGol9IgsF5iw2F38Rd-8Ysnt6j0PTY-2pJjemnX9Y5eTj0PMye6aZ1srljpgTM38e8iYXPZYLsl5sLuKV8eekPXd7VO3bPuHxWFk30ZG-dA6IagSIcyCRx-01Nst41ZxbZUMTrGgrRYOuEU0EMB4lF4aJ7cACsAhn5Drf29ExM1XiZ-2_GyOr_keNYVQiQ
ContentType	Conference Proceeding
DBID	6IE 6IH CBEJK RIE RIO
DOI	10.1109/ICASSP43922.2022.9747741
DatabaseName	IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan (POP) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP) 1998-present
DatabaseTitleList
Database_xml	– sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://ieeexplore.ieee.org/ sourceTypes: Publisher
DeliveryMethod	fulltext_linktorsrc
Discipline	Engineering
EISBN	1665405406 9781665405409
EISSN	2379-190X
EndPage	3417
ExternalDocumentID	9747741
Genre	orig-research
GrantInformation_xml	– fundername: National Research Foundation funderid: 10.13039/501100001321 – fundername: Samsung funderid: 10.13039/100004358
GroupedDBID	23M 6IE 6IF 6IH 6IK 6IL 6IM 6IN AAJGR AAWTH ABLEC ACGFS ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IEGSK IJVOP IPLJI M43 OCL RIE RIL RIO RNS
ID	FETCH-LOGICAL-i203t-c44074ff5fdedf868bb23a738a76fc72f8a84c13aee91f19de6d69c6b11711ce3
IEDL.DBID	RIE
ISICitedReferencesCount	2
ISICitedReferencesURI	http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000864187903140&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
IngestDate	Wed Aug 27 02:25:03 EDT 2025
IsPeerReviewed	false
IsScholarly	true
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-i203t-c44074ff5fdedf868bb23a738a76fc72f8a84c13aee91f19de6d69c6b11711ce3
PageCount	5
ParticipantIDs	ieee_primary_9747741
PublicationCentury	2000
PublicationDate	2022-May-23
PublicationDateYYYYMMDD	2022-05-23
PublicationDate_xml	– month: 05 year: 2022 text: 2022-May-23 day: 23
PublicationDecade	2020
PublicationTitle	Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998)
PublicationTitleAbbrev	ICASSP
PublicationYear	2022
Publisher	IEEE
Publisher_xml	– name: IEEE
SSID	ssj0008748
Score	2.2046373
Snippet	Generalized zero-shot learning (GZSL) is a technique to train a deep learning model to identify unseen classes. Conventionally, conditional generative models...
SourceID	ieee
SourceType	Publisher
StartPage	3413
SubjectTerms	Acoustics Benchmark testing Conferences Data models Deep learning Generalized zero-shot learning generative adversarial network generative model Signal processing Training data variational autoencoder
Title	Generalized Zero-Shot Learning Using Conditional Wasserstein Autoencoder
URI	https://ieeexplore.ieee.org/document/9747741
WOSCitedRecordID	wos000864187903140&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LSwMxEA61eNCLj1Z8k4NH1zab3U1yLMWiIKVQxeKlZDMT3UtX1q0Hf71JulYFL15CCCSBb0gmj_m-IeQitqlCxdPIuB0ySoBj5MzsFp5lAJlRaQrB0ndiPJazmZq0yOWaC4OIIfgMr3w1_OVDaZb-qaznz77Cs9Q3hMhWXK31ritFIr8idfqqdzscTKcT521jz7ZyRdP3VxKV4ENGO_-bfZd0v8l4dLJ2M3ukhYt9sv1DR7BDbhrx6OIDgT5hVUbTl7KmjXbqMw1xAdSNB8Xq6Y8-6kCz9Kku6WBZl17OErDqkofR9f3wJmpSJERF3Oe1Q9hdyBJrUwsIVmYyz2OuBZdaeBJPbKWWiWFcIypmmQLMIFMmyxkTjBnkB6S9KBd4SChKYXMHYl_nLGE21mCQ-eWsuXZ3aDgiHY_J_HWlgjFv4Dj-u_mEbHnY_T97zE9Ju66WeEY2zXtdvFXnwXSfu2Kbtw
linkProvider	IEEE
linkToHtml	http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LSwMxEA6lCurFRyu-zcGjazfJPpJjKZYWaym0YvFSsslE97Ir69aDv95ku1YFL15CCCSBGZLJTOb7BqErakIBgoWesjekF2gGnlWzPXiGaB0pEYa60vQoHo_5fC4mDXS9xsIAQJV8BjeuW_3l61wtXais496-sUOpb4RBQP0VWmt97_I44F-5Or7oDHvd6XRi7S11eCvb1LN_lVGprEh_93_776H2NxwPT9aGZh81IDtAOz-YBFtoUNNHpx-g8RMUuTd9yUtcs6c-4yozANv1dLoK_uFHWQEtXbFL3F2WuSO01FC00UP_dtYbeHWRBC-lPiutjK1LFhgTGg3a8IgnCWUyZlzGDsZDDZc8UIRJAEEMERoiHQkVJYTEhChgh6iZ5RkcIQw8NokVoi8TEhBDpVZA3IGWTFovWh-jlpPJ4nXFg7GoxXHy9_Al2hrM7keL0XB8d4q2nQrcrztlZ6hZFks4R5vqvUzfiotKjZ8DBJ7-
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=proceeding&rft.title=Proceedings+of+the+...+IEEE+International+Conference+on+Acoustics%2C+Speech+and+Signal+Processing+%281998%29&rft.atitle=Generalized+Zero-Shot+Learning+Using+Conditional+Wasserstein+Autoencoder&rft.au=Kim%2C+Junhan&rft.au=Shim%2C+Byonghyo&rft.date=2022-05-23&rft.pub=IEEE&rft.eissn=2379-190X&rft.spage=3413&rft.epage=3417&rft_id=info:doi/10.1109%2FICASSP43922.2022.9747741&rft.externalDocID=9747741