CP Tensor Decomposition with Cannot-Link Intermode Constraints

Tensor factorization is a methodology that is applied in a variety of fields, ranging from climate modeling to medical informatics. A tensor is an -way array that captures the relationship between objects. These multiway arrays can be factored to study the underlying bases present in the data. Two c...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Proceedings of the ... SIAM International Conference on Data Mining Ročník 2019; s. 711
Hlavní autoři: Henderson, Jette, Malin, Bradley A, Denny, Joshua C, Kho, Abel N, Sun, Jimeng, Ghosh, Joydeep, Ho, Joyce C
Médium: Journal Article
Jazyk:angličtina
Vydáno: United States 01.05.2019
ISSN:2167-0102
On-line přístup:Zjistit podrobnosti o přístupu
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:Tensor factorization is a methodology that is applied in a variety of fields, ranging from climate modeling to medical informatics. A tensor is an -way array that captures the relationship between objects. These multiway arrays can be factored to study the underlying bases present in the data. Two challenges arising in tensor factorization are 1) the resulting factors can be noisy and highly overlapping with one another and 2) they may not map to insights within a domain. However, incorporating supervision to increase the number of insightful factors can be costly in terms of the time and domain expertise necessary for gathering labels or domain-specific constraints. To meet these challenges, we introduce CANDECOMP/PARAFAC (CP) tensor factorization with Cannot-Link Intermode Constraints (CP-CLIC), a framework that achieves succinct, diverse, interpretable factors. This is accomplished by gradually learning constraints that are verified with auxiliary information during the decomposition process. We demonstrate CP-CLIC's potential to extract sparse, diverse, and interpretable factors through experiments on simulated data and a real-world application in medical informatics.
Bibliografie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:2167-0102
DOI:10.1137/1.9781611975673.80