Elastic deep autoencoder for text embedding clustering by an improved graph regularization

Text clustering is a task for grouping extracted information of the text in different clusters, which has many applications in recommender systems, sentiment analysis, and more. Deep learning-based methods have become increasingly popular due to their high accuracy in identifying nonlinear structure...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:Expert systems with applications Ročník 238; s. 121780
Hlavní autori: Daneshfar, Fatemeh, Soleymanbaigi, Sayvan, Nafisi, Ali, Yamini, Pedram
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: Elsevier Ltd 15.03.2024
Predmet:
ISSN:0957-4174, 1873-6793
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Abstract Text clustering is a task for grouping extracted information of the text in different clusters, which has many applications in recommender systems, sentiment analysis, and more. Deep learning-based methods have become increasingly popular due to their high accuracy in identifying nonlinear structures. They usually consist of two major parts: dimensionality reduction and clustering. Autoencoders are simple unsupervised neural networks used for better representation of low-dimensional data and have shown good performance in dealing with non-linear features. However, while they utilize the Frobenius norm to deal well with Gaussian noise, they are sensitive to outlier data and Laplacian noise. In this paper, a deep autoencoder with an adapted elastic loss for text embedding clustering (EDA-TEC) is proposed. The elastic loss is a combination of the Frobenius norm and L2,1-norm to consider both types of noises. Additionally, to maintain the high-dimensional data geometric structure, a modified graph regularization term based on the weighted cosine similarity measure is used. EDA-TEC also improves clustering results by considering the sparsity regularization of the manifold representation data. In this jointly end-to-end deep learning model, better representation and text clustering results are achieved with high accuracy on common datasets compared to existing methods.11https://github.com/safinal/text-embedding-clustering.
AbstractList Text clustering is a task for grouping extracted information of the text in different clusters, which has many applications in recommender systems, sentiment analysis, and more. Deep learning-based methods have become increasingly popular due to their high accuracy in identifying nonlinear structures. They usually consist of two major parts: dimensionality reduction and clustering. Autoencoders are simple unsupervised neural networks used for better representation of low-dimensional data and have shown good performance in dealing with non-linear features. However, while they utilize the Frobenius norm to deal well with Gaussian noise, they are sensitive to outlier data and Laplacian noise. In this paper, a deep autoencoder with an adapted elastic loss for text embedding clustering (EDA-TEC) is proposed. The elastic loss is a combination of the Frobenius norm and L2,1-norm to consider both types of noises. Additionally, to maintain the high-dimensional data geometric structure, a modified graph regularization term based on the weighted cosine similarity measure is used. EDA-TEC also improves clustering results by considering the sparsity regularization of the manifold representation data. In this jointly end-to-end deep learning model, better representation and text clustering results are achieved with high accuracy on common datasets compared to existing methods.11https://github.com/safinal/text-embedding-clustering.
ArticleNumber 121780
Author Nafisi, Ali
Daneshfar, Fatemeh
Soleymanbaigi, Sayvan
Yamini, Pedram
Author_xml – sequence: 1
  givenname: Fatemeh
  orcidid: 0000-0003-3150-3527
  surname: Daneshfar
  fullname: Daneshfar, Fatemeh
  email: f.daneshfar@uok.ac.ir
– sequence: 2
  givenname: Sayvan
  surname: Soleymanbaigi
  fullname: Soleymanbaigi, Sayvan
  email: s.soleymanbaigi@uok.ac.ir
– sequence: 3
  givenname: Ali
  orcidid: 0009-0008-5985-3266
  surname: Nafisi
  fullname: Nafisi, Ali
  email: alinafisi@uok.ac.ir
– sequence: 4
  givenname: Pedram
  orcidid: 0009-0001-1191-9579
  surname: Yamini
  fullname: Yamini, Pedram
BookMark eNp9kL1OwzAUhS0EEm3hBZj8Agn-SepEYkFV-ZEqscDCYjn2dXGVxpHtFsrTk1Amhk7nLt_VOd8UnXe-A4RuKMkpofPbTQ7xU-WMMJ5TRkVFztCEVoJnc1HzczQhdSmygoriEk1j3BBCBSFigt6XrYrJaWwAeqx2yUOnvYGArQ84wVfCsG3AGNetsW53MUEYz-aAVYfdtg9-Dwavg-o_cID1rlXBfavkfHeFLqxqI1z_5Qy9PSxfF0_Z6uXxeXG_yjQnJGUw9FCcmaY2DRSNLiio0hZCCaCWcd6UVa0JUTWDOSe8MEpYXlpdQ1lqZks-Q9Xxrw4-xgBWapd-G6SgXCspkaMjuZGjIzk6kkdHA8r-oX1wWxUOp6G7IwTDqL2DIKN2gzUwLoBO0nh3Cv8BVayEpw
CitedBy_id crossref_primary_10_3390_electronics14173426
crossref_primary_10_1016_j_ipm_2025_104409
crossref_primary_10_1002_ett_70177
crossref_primary_10_3390_math12071107
crossref_primary_10_1016_j_engappai_2024_109537
crossref_primary_10_1016_j_future_2024_107630
crossref_primary_10_1109_ACCESS_2025_3574066
crossref_primary_10_1371_journal_pone_0319747
crossref_primary_10_1007_s11042_024_19742_3
crossref_primary_10_1016_j_ins_2024_121482
crossref_primary_10_3390_math12142162
crossref_primary_10_1007_s10586_024_04549_6
crossref_primary_10_1016_j_neunet_2024_106979
crossref_primary_10_1007_s10462_023_10662_6
crossref_primary_10_1016_j_cie_2025_111048
crossref_primary_10_1109_ACCESS_2025_3604775
crossref_primary_10_1007_s10115_024_02203_6
crossref_primary_10_1016_j_neucom_2025_130896
crossref_primary_10_1007_s11042_024_18764_1
crossref_primary_10_3390_jmse13071231
crossref_primary_10_1007_s11235_024_01184_9
Cites_doi 10.1016/j.knosys.2022.108653
10.1016/j.neucom.2020.12.094
10.1016/j.ins.2022.10.052
10.1016/j.neucom.2022.10.002
10.1109/TMI.2022.3219126
10.1149/10701.1375ecst
10.1016/j.patcog.2021.108386
10.1111/j.1467-9868.2005.00503.x
10.1016/j.eswa.2022.119051
10.1016/j.eswa.2022.118565
10.1016/j.eswa.2022.118656
10.1186/s40537-022-00564-9
10.1007/978-981-16-9709-8_21
10.1016/j.patcog.2022.109144
10.1016/j.ins.2022.11.120
10.1016/j.eswa.2022.117193
10.1016/j.asoc.2021.107433
10.1145/235968.233324
10.1007/s10489-021-02524-x
10.1016/j.cam.2022.114877
10.1016/j.eswa.2022.119031
10.1016/j.eswa.2021.115729
10.1016/j.neunet.2023.03.026
10.1016/j.array.2021.100124
10.1016/j.neucom.2022.04.122
10.1016/0306-4573(88)90021-0
10.1016/j.knosys.2021.107236
10.21203/rs.3.rs-2317581/v1
10.1016/j.eswa.2022.117822
10.1155/2022/2749091
10.1007/s11042-021-11597-2
10.1007/s11042-022-12155-0
10.1016/j.neucom.2022.10.052
10.1016/j.eswa.2022.118937
10.1016/j.procs.2023.01.098
10.1007/s12559-021-09852-7
ContentType Journal Article
Copyright 2023 Elsevier Ltd
Copyright_xml – notice: 2023 Elsevier Ltd
DBID AAYXX
CITATION
DOI 10.1016/j.eswa.2023.121780
DatabaseName CrossRef
DatabaseTitle CrossRef
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISSN 1873-6793
ExternalDocumentID 10_1016_j_eswa_2023_121780
S0957417423022820
GroupedDBID --K
--M
.DC
.~1
0R~
13V
1B1
1RT
1~.
1~5
4.4
457
4G.
5GY
5VS
7-5
71M
8P~
9JN
9JO
AAAKF
AABNK
AACTN
AAEDT
AAEDW
AAIAV
AAIKJ
AAKOC
AALRI
AAOAW
AAQFI
AARIN
AAXUO
AAYFN
ABBOA
ABFNM
ABMAC
ABMVD
ABUCO
ABYKQ
ACDAQ
ACGFS
ACHRH
ACNTT
ACRLP
ACZNC
ADBBV
ADEZE
ADTZH
AEBSH
AECPX
AEKER
AENEX
AFKWA
AFTJW
AGHFR
AGJBL
AGUBO
AGUMN
AGYEJ
AHHHB
AHJVU
AHZHX
AIALX
AIEXJ
AIKHN
AITUG
AJOXV
ALEQD
ALMA_UNASSIGNED_HOLDINGS
AMFUW
AMRAJ
AOUOD
APLSM
AXJTR
BJAXD
BKOJK
BLXMC
BNSAS
CS3
DU5
EBS
EFJIC
EFLBG
EO8
EO9
EP2
EP3
F5P
FDB
FIRID
FNPLU
FYGXN
G-Q
GBLVA
GBOLZ
HAMUX
IHE
J1W
JJJVA
KOM
LG9
LY1
LY7
M41
MO0
N9A
O-L
O9-
OAUVE
OZT
P-8
P-9
P2P
PC.
PQQKQ
Q38
ROL
RPZ
SDF
SDG
SDP
SDS
SES
SEW
SPC
SPCBC
SSB
SSD
SSL
SST
SSV
SSZ
T5K
TN5
~G-
29G
9DU
AAAKG
AAQXK
AATTM
AAXKI
AAYWO
AAYXX
ABJNI
ABKBG
ABUFD
ABWVN
ABXDB
ACLOT
ACNNM
ACRPL
ACVFH
ADCNI
ADJOM
ADMUD
ADNMO
AEIPS
AEUPX
AFJKZ
AFPUW
AGQPQ
AIGII
AIIUN
AKBMS
AKRWK
AKYEP
ANKPU
APXCP
ASPBG
AVWKF
AZFZN
CITATION
EFKBS
EJD
FEDTE
FGOYB
G-2
HLZ
HVGLF
HZ~
R2-
SBC
SET
WUQ
XPP
ZMT
~HD
ID FETCH-LOGICAL-c300t-e007a32db9dbe4bc41ea5f47a7e1f233b589c00a92e63034da7f35fc9e55c2f53
ISICitedReferencesCount 25
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001088597600001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 0957-4174
IngestDate Sat Nov 29 07:09:16 EST 2025
Tue Nov 18 20:52:00 EST 2025
Fri Feb 23 02:36:07 EST 2024
IsPeerReviewed true
IsScholarly true
Keywords Text clustering
Graph regularization
Text embedding
Deep autoencoder
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c300t-e007a32db9dbe4bc41ea5f47a7e1f233b589c00a92e63034da7f35fc9e55c2f53
ORCID 0009-0008-5985-3266
0009-0001-1191-9579
0000-0003-3150-3527
ParticipantIDs crossref_citationtrail_10_1016_j_eswa_2023_121780
crossref_primary_10_1016_j_eswa_2023_121780
elsevier_sciencedirect_doi_10_1016_j_eswa_2023_121780
PublicationCentury 2000
PublicationDate 2024-03-15
PublicationDateYYYYMMDD 2024-03-15
PublicationDate_xml – month: 03
  year: 2024
  text: 2024-03-15
  day: 15
PublicationDecade 2020
PublicationTitle Expert systems with applications
PublicationYear 2024
Publisher Elsevier Ltd
Publisher_xml – name: Elsevier Ltd
References Zhou, Su, Li, Zhao, Li (b0245) 2023; 213
Cha, Mun, Roh (b0050) 2023
Walek, Fajmon (b0180) 2023; 212
Murfi, Rosaline, Hariadi (b0100) 2022; 13
Shi, Kang, Choo, Reddy (b0140) 2018
Béjar Alonso, J. (2013). K-means vs Mini Batch K-means: a comparison.
Tong, Wei, Qi, Yao, Zhang, Teng (b0160) 2023; 421
Wang, Wu, Ren, Zhang, Zhou (b0185) 2023; 213
Daneshfar, Jamshidi (b0060) 2023; 163
Wu, Liu, Zhao, Kale, Bui, Yu, Chang (b0205) 2023
Bagirov, Aliguliyev, Sultanova (b0015) 2023; 135
Tang, Xu, Jiang, Su, Sun, Luo (b0155) 2022; 1–1
Ay, Özbakır, Kulluk, Gülmez, Öztürk, Özer (b0010) 2023; 211
,
Subakti, Murfi, Hariadi (b0145) 2022; 9
.
Xiao, Wang, Guo (b0210) 2022; 1–1
Chen, Y., Li, D., & Ye, M. (2022). A Multi-label Propagation Algorithm for Community Detection Based on Average Mutual Information.
Occhipinti, Rogers, Angione (b0105) 2022; 201
Bai, Huang, Qin, Chen, Lin (b0020) 2023; 623
Xu, Gao, Zhang, Li, de Albuquerque (b0215) 2021; 108
Salahian, Tab, Seyedi, Chavoshinejad (b0120) 2023; 214
Yang, Xu (b0220) 2021; 227
Liu, Wang, Chen, Wang, Li (b0095) 2022; 245
Vinoth, Prabhavathy (b0175) 2022; 107
Wang, Zhang (b0190) 2023; 515
Li, Leng, Cheng, Basu, Jiao (b0085) 2022
Zhang, H., Zhang, D., Gao, Z., & Zhang, H. (2021). Joint segmentation and quantification of main coronary vessels using dual-branch multi-scale attention network. Medical Image Computing and Computer Assisted Intervention–MICCAI 2021: 24th International Conference, Strasbourg, France, September 27–October 1, 2021, Proceedings, Part I 24.
Diallo, Hu, Li, Khan, Liang, Zhao (b0065) 2021; 433
Cai, Wang, Guo (b0040) 2021; 186
Shao, Chen, Yuan, Wang (b0135) 2023; 517
Zou, Hastie (b0250) 2005; 67
Zhang, Ramakrishnan, Livny (b0235) 1996; 25
Wu, J., Chen, X., Cai, S., Li, Y., & Wu, H. (2022, 2022//). Deep Structured Clustering of Short Text. Big Data, Singapore.
Seyedi, Tab, Lotfi, Salahian, Chavoshinejad (b0130) 2023; 621
Sun, Shen, Gao, Ouyang, Cheng (b0150) 2017
Berahmand, K., Daneshfar, F., Dorosti, M., & Aghajani, M. J. (2022). An Improved Deep Text Clustering via Local Manifold of an Autoencoder Embedding.
Revathy, Pillai, Daneshfar (b0110) 2023; 218
Sahu, Rani (b0115) 2022; 206
Aghdam, Zanjani (b0005) 2021; 58
Zhou, Wang, Zhu (b0240) 2022; 52
Wang, Li, Wang, Tao, Liu (b0195) 2022; 81
Veiga Simão, Costa Ferreira, Pereira, Oliveira, Paulino, Rosa, Trancoso (b0170) 2021; 13
Hosseini, Varzaneh (b0075) 2022; 81
Febrissy, Salah, Ailem, Nadif (b0070) 2022; 495
Salton, Buckley (b0125) 1988; 24
Cai, Wang, Xu, Guo (b0045) 2022; 123
Li, Han (b0080) 2013
Zhang, Gao, Zhang, Hau, Zhang (b0225) 2022; 42
Xiao (10.1016/j.eswa.2023.121780_b0210) 2022; 1–1
Diallo (10.1016/j.eswa.2023.121780_b0065) 2021; 433
Li (10.1016/j.eswa.2023.121780_b0085) 2022
Wang (10.1016/j.eswa.2023.121780_b0195) 2022; 81
10.1016/j.eswa.2023.121780_b0030
Cha (10.1016/j.eswa.2023.121780_b0050) 2023
Salton (10.1016/j.eswa.2023.121780_b0125) 1988; 24
10.1016/j.eswa.2023.121780_b0055
Hosseini (10.1016/j.eswa.2023.121780_b0075) 2022; 81
10.1016/j.eswa.2023.121780_b0230
Xu (10.1016/j.eswa.2023.121780_b0215) 2021; 108
Yang (10.1016/j.eswa.2023.121780_b0220) 2021; 227
Sahu (10.1016/j.eswa.2023.121780_b0115) 2022; 206
Wu (10.1016/j.eswa.2023.121780_b0205) 2023
Veiga Simão (10.1016/j.eswa.2023.121780_b0170) 2021; 13
Walek (10.1016/j.eswa.2023.121780_b0180) 2023; 212
Zhou (10.1016/j.eswa.2023.121780_b0245) 2023; 213
Ay (10.1016/j.eswa.2023.121780_b0010) 2023; 211
Sun (10.1016/j.eswa.2023.121780_b0150) 2017
Subakti (10.1016/j.eswa.2023.121780_b0145) 2022; 9
Revathy (10.1016/j.eswa.2023.121780_b0110) 2023; 218
Shao (10.1016/j.eswa.2023.121780_b0135) 2023; 517
Tang (10.1016/j.eswa.2023.121780_b0155) 2022; 1–1
Bagirov (10.1016/j.eswa.2023.121780_b0015) 2023; 135
Vinoth (10.1016/j.eswa.2023.121780_b0175) 2022; 107
Zhang (10.1016/j.eswa.2023.121780_b0225) 2022; 42
Liu (10.1016/j.eswa.2023.121780_b0095) 2022; 245
Bai (10.1016/j.eswa.2023.121780_b0020) 2023; 623
Tong (10.1016/j.eswa.2023.121780_b0160) 2023; 421
Daneshfar (10.1016/j.eswa.2023.121780_b0060) 2023; 163
Zou (10.1016/j.eswa.2023.121780_b0250) 2005; 67
10.1016/j.eswa.2023.121780_b0200
Shi (10.1016/j.eswa.2023.121780_b0140) 2018
10.1016/j.eswa.2023.121780_b0025
Cai (10.1016/j.eswa.2023.121780_b0045) 2022; 123
Zhou (10.1016/j.eswa.2023.121780_b0240) 2022; 52
Murfi (10.1016/j.eswa.2023.121780_b0100) 2022; 13
Aghdam (10.1016/j.eswa.2023.121780_b0005) 2021; 58
Cai (10.1016/j.eswa.2023.121780_b0040) 2021; 186
Occhipinti (10.1016/j.eswa.2023.121780_b0105) 2022; 201
Wang (10.1016/j.eswa.2023.121780_b0190) 2023; 515
Zhang (10.1016/j.eswa.2023.121780_b0235) 1996; 25
Febrissy (10.1016/j.eswa.2023.121780_b0070) 2022; 495
Wang (10.1016/j.eswa.2023.121780_b0185) 2023; 213
Salahian (10.1016/j.eswa.2023.121780_b0120) 2023; 214
Seyedi (10.1016/j.eswa.2023.121780_b0130) 2023; 621
Li (10.1016/j.eswa.2023.121780_b0080) 2013
References_xml – volume: 107
  start-page: 1375
  year: 2022
  ident: b0175
  article-title: A short text clustering approaches in social media
  publication-title: ECS Transactions
– reference: Wu, J., Chen, X., Cai, S., Li, Y., & Wu, H. (2022, 2022//). Deep Structured Clustering of Short Text. Big Data, Singapore.
– volume: 421
  year: 2023
  ident: b0160
  article-title: A majorization–minimization based solution to penalized nonnegative matrix factorization with orthogonal regularization
  publication-title: Journal of Computational and Applied Mathematics
– reference: Chen, Y., Li, D., & Ye, M. (2022). A Multi-label Propagation Algorithm for Community Detection Based on Average Mutual Information.
– reference: Berahmand, K., Daneshfar, F., Dorosti, M., & Aghajani, M. J. (2022). An Improved Deep Text Clustering via Local Manifold of an Autoencoder Embedding.
– volume: 13
  year: 2022
  ident: b0100
  article-title: Deep autoencoder-based fuzzy c-means for topic detection
  publication-title: Array
– volume: 433
  start-page: 96
  year: 2021
  end-page: 107
  ident: b0065
  article-title: Deep embedding clustering based on contractive autoencoder
  publication-title: Neurocomputing
– volume: 245
  year: 2022
  ident: b0095
  article-title: Bilateral discriminative autoencoder model orienting co-representation learning
  publication-title: Knowledge-Based Systems
– volume: 515
  start-page: 157
  year: 2023
  end-page: 173
  ident: b0190
  article-title: Deep NMF topic modeling
  publication-title: Neurocomputing
– volume: 42
  start-page: 864
  year: 2022
  end-page: 879
  ident: b0225
  article-title: Progressive perception learning for main coronary segmentation in X-ray angiography
  publication-title: IEEE Transactions on Medical Imaging
– volume: 214
  year: 2023
  ident: b0120
  article-title: Deep autoencoder-like NMF with contrastive regularization and feature relationship preservation
  publication-title: Expert Systems with Applications
– volume: 135
  year: 2023
  ident: b0015
  article-title: Finding compact and well-separated clusters: Clustering using silhouette coefficients
  publication-title: Pattern Recognition
– volume: 218
  start-page: 1196
  year: 2023
  end-page: 1208
  ident: b0110
  article-title: LyEmoBERT: Classification of lyrics’ emotion and recommendation using a pre-trained model
  publication-title: Procedia Computer Science
– volume: 123
  year: 2022
  ident: b0045
  article-title: Unsupervised deep clustering via contractive feature representation and focal loss
  publication-title: Pattern Recognition
– volume: 186
  year: 2021
  ident: b0040
  article-title: Unsupervised embedded feature learning for deep clustering with stacked sparse auto-encoder
  publication-title: Expert Systems with Applications
– volume: 621
  start-page: 562
  year: 2023
  end-page: 579
  ident: b0130
  article-title: Elastic adversarial deep nonnegative matrix factorization for matrix completion
  publication-title: Information Sciences
– year: 2023
  ident: b0050
  article-title: Learning to generate text-grounded mask for open-world semantic segmentation from only image-text pairs
  publication-title: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition
– volume: 9
  start-page: 15
  year: 2022
  ident: b0145
  article-title: The performance of BERT as data representation of text clustering
  publication-title: Journal of Big Data
– volume: 25
  start-page: 103
  year: 1996
  end-page: 114
  ident: b0235
  article-title: BIRCH: An efficient data clustering method for very large databases
  publication-title: ACM Sigmod Record
– year: 2013
  ident: b0080
  article-title: Distance weighted cosine similarity measure for text classification
  publication-title: International conference on intelligent data engineering and automated learning
– year: 2022
  ident: b0085
  article-title: Dual-graph global and local concept factorization for data clustering
  publication-title: IEEE Transactions on Neural Networks and Learning Systems
– volume: 108
  year: 2021
  ident: b0215
  article-title: Video salient object detection using dual-stream spatiotemporal attention
  publication-title: Applied Soft Computing
– volume: 211
  year: 2023
  ident: b0010
  article-title: FC-Kmeans: Fixed-centered K-means algorithm
  publication-title: Expert Systems with Applications
– volume: 67
  start-page: 301
  year: 2005
  end-page: 320
  ident: b0250
  article-title: Regularization and variable selection via the elastic net
  publication-title: Journal of the Royal Statistical Society: Series B (Statistical Methodology)
– reference: ,
– volume: 495
  start-page: 105
  year: 2022
  end-page: 117
  ident: b0070
  article-title: Improving NMF clustering by leveraging contextual relationships among words
  publication-title: Neurocomputing
– year: 2023
  ident: b0205
  article-title: Uncovering the disentanglement capability in text-to-image diffusion models
  publication-title: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition
– volume: 81
  start-page: 10861
  year: 2022
  end-page: 10881
  ident: b0075
  article-title: Deep text clustering using stacked AutoEncoder
  publication-title: Multimedia Tools and Applications
– volume: 212
  year: 2023
  ident: b0180
  article-title: A hybrid recommender system for an online store using a fuzzy expert system
  publication-title: Expert Systems with Applications
– volume: 201
  year: 2022
  ident: b0105
  article-title: A pipeline and comparative study of 12 machine learning models for text classification
  publication-title: Expert Systems with Applications
– volume: 1–1
  year: 2022
  ident: b0155
  article-title: Selecting the best part from multiple laplacian autoencoders for multi-view subspace clustering
  publication-title: IEEE Transactions on Knowledge and Data Engineering
– year: 2018
  ident: b0140
  article-title: Short-text topic modeling via non-negative matrix factorization enriched with local word-context correlations
  publication-title: Proceedings of the 2018 World Wide Web Conference
– volume: 81
  start-page: 34309
  year: 2022
  end-page: 34324
  ident: b0195
  article-title: Deep boundary-aware clustering by jointly optimizing unsupervised representation learning
  publication-title: Multimedia Tools and Applications
– volume: 213
  year: 2023
  ident: b0185
  article-title: Multi-scale deep multi-view subspace clustering with self-weighting fusion and structure preserving
  publication-title: Expert Systems with Applications
– volume: 24
  start-page: 513
  year: 1988
  end-page: 523
  ident: b0125
  article-title: Term-weighting approaches in automatic text retrieval
  publication-title: Information Processing & Management
– volume: 623
  start-page: 40
  year: 2023
  end-page: 55
  ident: b0020
  article-title: HVAE: A deep generative model via hierarchical variational auto-encoder for multi-view document modeling
  publication-title: Information Sciences
– volume: 227
  year: 2021
  ident: b0220
  article-title: Orthogonal nonnegative matrix factorization using a novel deep autoencoder network
  publication-title: Knowledge-Based Systems
– volume: 213
  year: 2023
  ident: b0245
  article-title: Community detection based on unsupervised attributed network embedding
  publication-title: Expert Systems with Applications
– volume: 517
  start-page: 62
  year: 2023
  end-page: 70
  ident: b0135
  article-title: Projection concept factorization with self-representation for data clustering
  publication-title: Neurocomputing
– year: 2017
  ident: b0150
  article-title: A non-negative symmetric encoder-decoder approach for community detection
  publication-title: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management
– volume: 13
  start-page: 736
  year: 2021
  end-page: 750
  ident: b0170
  article-title: Prosociality in cyberspace: Developing emotion and behavioral regulation to decrease aggressive communication
  publication-title: Cognitive Computation
– volume: 52
  start-page: 5457
  year: 2022
  end-page: 5474
  ident: b0240
  article-title: Feature selection based on mutual information with correlation coefficient
  publication-title: Applied Intelligence
– reference: .
– volume: 1–1
  year: 2022
  ident: b0210
  article-title: SGAE: Stacked graph autoencoder for deep clustering
  publication-title: IEEE Transactions on Big Data
– reference: Zhang, H., Zhang, D., Gao, Z., & Zhang, H. (2021). Joint segmentation and quantification of main coronary vessels using dual-branch multi-scale attention network. Medical Image Computing and Computer Assisted Intervention–MICCAI 2021: 24th International Conference, Strasbourg, France, September 27–October 1, 2021, Proceedings, Part I 24.
– volume: 58
  year: 2021
  ident: b0005
  article-title: A novel regularized asymmetric non-negative matrix factorization for text clustering
  publication-title: Information Processing & Management
– volume: 163
  start-page: 108
  year: 2023
  end-page: 121
  ident: b0060
  article-title: An octonion-based nonlinear echo state network for speech emotion recognition in Metaverse
  publication-title: Neural Networks
– reference: Béjar Alonso, J. (2013). K-means vs Mini Batch K-means: a comparison.
– volume: 206
  year: 2022
  ident: b0115
  article-title: A neighbour-similarity based community discovery algorithm
  publication-title: Expert Systems with Applications
– volume: 245
  year: 2022
  ident: 10.1016/j.eswa.2023.121780_b0095
  article-title: Bilateral discriminative autoencoder model orienting co-representation learning
  publication-title: Knowledge-Based Systems
  doi: 10.1016/j.knosys.2022.108653
– volume: 433
  start-page: 96
  year: 2021
  ident: 10.1016/j.eswa.2023.121780_b0065
  article-title: Deep embedding clustering based on contractive autoencoder
  publication-title: Neurocomputing
  doi: 10.1016/j.neucom.2020.12.094
– year: 2013
  ident: 10.1016/j.eswa.2023.121780_b0080
  article-title: Distance weighted cosine similarity measure for text classification
– volume: 1–1
  year: 2022
  ident: 10.1016/j.eswa.2023.121780_b0210
  article-title: SGAE: Stacked graph autoencoder for deep clustering
  publication-title: IEEE Transactions on Big Data
– year: 2022
  ident: 10.1016/j.eswa.2023.121780_b0085
  article-title: Dual-graph global and local concept factorization for data clustering
– volume: 623
  start-page: 40
  year: 2023
  ident: 10.1016/j.eswa.2023.121780_b0020
  article-title: HVAE: A deep generative model via hierarchical variational auto-encoder for multi-view document modeling
  publication-title: Information Sciences
  doi: 10.1016/j.ins.2022.10.052
– volume: 515
  start-page: 157
  year: 2023
  ident: 10.1016/j.eswa.2023.121780_b0190
  article-title: Deep NMF topic modeling
  publication-title: Neurocomputing
  doi: 10.1016/j.neucom.2022.10.002
– year: 2023
  ident: 10.1016/j.eswa.2023.121780_b0050
  article-title: Learning to generate text-grounded mask for open-world semantic segmentation from only image-text pairs
– volume: 42
  start-page: 864
  issue: 3
  year: 2022
  ident: 10.1016/j.eswa.2023.121780_b0225
  article-title: Progressive perception learning for main coronary segmentation in X-ray angiography
  publication-title: IEEE Transactions on Medical Imaging
  doi: 10.1109/TMI.2022.3219126
– volume: 1–1
  year: 2022
  ident: 10.1016/j.eswa.2023.121780_b0155
  article-title: Selecting the best part from multiple laplacian autoencoders for multi-view subspace clustering
  publication-title: IEEE Transactions on Knowledge and Data Engineering
– volume: 107
  start-page: 1375
  issue: 1
  year: 2022
  ident: 10.1016/j.eswa.2023.121780_b0175
  article-title: A short text clustering approaches in social media
  publication-title: ECS Transactions
  doi: 10.1149/10701.1375ecst
– volume: 123
  year: 2022
  ident: 10.1016/j.eswa.2023.121780_b0045
  article-title: Unsupervised deep clustering via contractive feature representation and focal loss
  publication-title: Pattern Recognition
  doi: 10.1016/j.patcog.2021.108386
– volume: 67
  start-page: 301
  issue: 2
  year: 2005
  ident: 10.1016/j.eswa.2023.121780_b0250
  article-title: Regularization and variable selection via the elastic net
  publication-title: Journal of the Royal Statistical Society: Series B (Statistical Methodology)
  doi: 10.1111/j.1467-9868.2005.00503.x
– volume: 214
  year: 2023
  ident: 10.1016/j.eswa.2023.121780_b0120
  article-title: Deep autoencoder-like NMF with contrastive regularization and feature relationship preservation
  publication-title: Expert Systems with Applications
  doi: 10.1016/j.eswa.2022.119051
– volume: 212
  year: 2023
  ident: 10.1016/j.eswa.2023.121780_b0180
  article-title: A hybrid recommender system for an online store using a fuzzy expert system
  publication-title: Expert Systems with Applications
  doi: 10.1016/j.eswa.2022.118565
– ident: 10.1016/j.eswa.2023.121780_b0230
– volume: 211
  year: 2023
  ident: 10.1016/j.eswa.2023.121780_b0010
  article-title: FC-Kmeans: Fixed-centered K-means algorithm
  publication-title: Expert Systems with Applications
  doi: 10.1016/j.eswa.2022.118656
– volume: 9
  start-page: 15
  issue: 1
  year: 2022
  ident: 10.1016/j.eswa.2023.121780_b0145
  article-title: The performance of BERT as data representation of text clustering
  publication-title: Journal of Big Data
  doi: 10.1186/s40537-022-00564-9
– year: 2018
  ident: 10.1016/j.eswa.2023.121780_b0140
  article-title: Short-text topic modeling via non-negative matrix factorization enriched with local word-context correlations
– ident: 10.1016/j.eswa.2023.121780_b0200
  doi: 10.1007/978-981-16-9709-8_21
– volume: 135
  year: 2023
  ident: 10.1016/j.eswa.2023.121780_b0015
  article-title: Finding compact and well-separated clusters: Clustering using silhouette coefficients
  publication-title: Pattern Recognition
  doi: 10.1016/j.patcog.2022.109144
– volume: 621
  start-page: 562
  year: 2023
  ident: 10.1016/j.eswa.2023.121780_b0130
  article-title: Elastic adversarial deep nonnegative matrix factorization for matrix completion
  publication-title: Information Sciences
  doi: 10.1016/j.ins.2022.11.120
– volume: 58
  issue: 6
  year: 2021
  ident: 10.1016/j.eswa.2023.121780_b0005
  article-title: A novel regularized asymmetric non-negative matrix factorization for text clustering
  publication-title: Information Processing & Management
– volume: 201
  year: 2022
  ident: 10.1016/j.eswa.2023.121780_b0105
  article-title: A pipeline and comparative study of 12 machine learning models for text classification
  publication-title: Expert Systems with Applications
  doi: 10.1016/j.eswa.2022.117193
– volume: 108
  year: 2021
  ident: 10.1016/j.eswa.2023.121780_b0215
  article-title: Video salient object detection using dual-stream spatiotemporal attention
  publication-title: Applied Soft Computing
  doi: 10.1016/j.asoc.2021.107433
– volume: 25
  start-page: 103
  issue: 2
  year: 1996
  ident: 10.1016/j.eswa.2023.121780_b0235
  article-title: BIRCH: An efficient data clustering method for very large databases
  publication-title: ACM Sigmod Record
  doi: 10.1145/235968.233324
– volume: 52
  start-page: 5457
  issue: 5
  year: 2022
  ident: 10.1016/j.eswa.2023.121780_b0240
  article-title: Feature selection based on mutual information with correlation coefficient
  publication-title: Applied Intelligence
  doi: 10.1007/s10489-021-02524-x
– volume: 421
  year: 2023
  ident: 10.1016/j.eswa.2023.121780_b0160
  article-title: A majorization–minimization based solution to penalized nonnegative matrix factorization with orthogonal regularization
  publication-title: Journal of Computational and Applied Mathematics
  doi: 10.1016/j.cam.2022.114877
– volume: 213
  year: 2023
  ident: 10.1016/j.eswa.2023.121780_b0185
  article-title: Multi-scale deep multi-view subspace clustering with self-weighting fusion and structure preserving
  publication-title: Expert Systems with Applications
  doi: 10.1016/j.eswa.2022.119031
– year: 2023
  ident: 10.1016/j.eswa.2023.121780_b0205
  article-title: Uncovering the disentanglement capability in text-to-image diffusion models
– volume: 186
  year: 2021
  ident: 10.1016/j.eswa.2023.121780_b0040
  article-title: Unsupervised embedded feature learning for deep clustering with stacked sparse auto-encoder
  publication-title: Expert Systems with Applications
  doi: 10.1016/j.eswa.2021.115729
– volume: 163
  start-page: 108
  year: 2023
  ident: 10.1016/j.eswa.2023.121780_b0060
  article-title: An octonion-based nonlinear echo state network for speech emotion recognition in Metaverse
  publication-title: Neural Networks
  doi: 10.1016/j.neunet.2023.03.026
– volume: 13
  year: 2022
  ident: 10.1016/j.eswa.2023.121780_b0100
  article-title: Deep autoencoder-based fuzzy c-means for topic detection
  publication-title: Array
  doi: 10.1016/j.array.2021.100124
– volume: 495
  start-page: 105
  year: 2022
  ident: 10.1016/j.eswa.2023.121780_b0070
  article-title: Improving NMF clustering by leveraging contextual relationships among words
  publication-title: Neurocomputing
  doi: 10.1016/j.neucom.2022.04.122
– volume: 24
  start-page: 513
  issue: 5
  year: 1988
  ident: 10.1016/j.eswa.2023.121780_b0125
  article-title: Term-weighting approaches in automatic text retrieval
  publication-title: Information Processing & Management
  doi: 10.1016/0306-4573(88)90021-0
– year: 2017
  ident: 10.1016/j.eswa.2023.121780_b0150
  article-title: A non-negative symmetric encoder-decoder approach for community detection
– volume: 227
  year: 2021
  ident: 10.1016/j.eswa.2023.121780_b0220
  article-title: Orthogonal nonnegative matrix factorization using a novel deep autoencoder network
  publication-title: Knowledge-Based Systems
  doi: 10.1016/j.knosys.2021.107236
– ident: 10.1016/j.eswa.2023.121780_b0030
  doi: 10.21203/rs.3.rs-2317581/v1
– volume: 206
  year: 2022
  ident: 10.1016/j.eswa.2023.121780_b0115
  article-title: A neighbour-similarity based community discovery algorithm
  publication-title: Expert Systems with Applications
  doi: 10.1016/j.eswa.2022.117822
– ident: 10.1016/j.eswa.2023.121780_b0055
  doi: 10.1155/2022/2749091
– volume: 81
  start-page: 34309
  issue: 24
  year: 2022
  ident: 10.1016/j.eswa.2023.121780_b0195
  article-title: Deep boundary-aware clustering by jointly optimizing unsupervised representation learning
  publication-title: Multimedia Tools and Applications
  doi: 10.1007/s11042-021-11597-2
– volume: 81
  start-page: 10861
  issue: 8
  year: 2022
  ident: 10.1016/j.eswa.2023.121780_b0075
  article-title: Deep text clustering using stacked AutoEncoder
  publication-title: Multimedia Tools and Applications
  doi: 10.1007/s11042-022-12155-0
– volume: 517
  start-page: 62
  year: 2023
  ident: 10.1016/j.eswa.2023.121780_b0135
  article-title: Projection concept factorization with self-representation for data clustering
  publication-title: Neurocomputing
  doi: 10.1016/j.neucom.2022.10.052
– volume: 213
  year: 2023
  ident: 10.1016/j.eswa.2023.121780_b0245
  article-title: Community detection based on unsupervised attributed network embedding
  publication-title: Expert Systems with Applications
  doi: 10.1016/j.eswa.2022.118937
– ident: 10.1016/j.eswa.2023.121780_b0025
– volume: 218
  start-page: 1196
  year: 2023
  ident: 10.1016/j.eswa.2023.121780_b0110
  article-title: LyEmoBERT: Classification of lyrics’ emotion and recommendation using a pre-trained model
  publication-title: Procedia Computer Science
  doi: 10.1016/j.procs.2023.01.098
– volume: 13
  start-page: 736
  issue: 3
  year: 2021
  ident: 10.1016/j.eswa.2023.121780_b0170
  article-title: Prosociality in cyberspace: Developing emotion and behavioral regulation to decrease aggressive communication
  publication-title: Cognitive Computation
  doi: 10.1007/s12559-021-09852-7
SSID ssj0017007
Score 2.5383718
Snippet Text clustering is a task for grouping extracted information of the text in different clusters, which has many applications in recommender systems, sentiment...
SourceID crossref
elsevier
SourceType Enrichment Source
Index Database
Publisher
StartPage 121780
SubjectTerms Deep autoencoder
Graph regularization
Text clustering
Text embedding
Title Elastic deep autoencoder for text embedding clustering by an improved graph regularization
URI https://dx.doi.org/10.1016/j.eswa.2023.121780
Volume 238
WOSCitedRecordID wos001088597600001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVESC
  databaseName: Elsevier SD Freedom Collection Journals 2021
  customDbUrl:
  eissn: 1873-6793
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017007
  issn: 0957-4174
  databaseCode: AIEXJ
  dateStart: 19950101
  isFulltext: true
  titleUrlDefault: https://www.sciencedirect.com
  providerName: Elsevier
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9NAEF6FlgMX3ojy0h64Ra782qz3WKFUgFCEREERF2tfBleJEzkP2r_Ar2bGu7aTAhU9cLEsy14_5tPs7Hi-bwh5zZICJibJAx2xLMCWxoEUXASFsbDU1pEwjaTQlw98MsmmU_FxMPjZcmG2M15V2cWFWP5XU8MxMDZSZ29g7m5QOAD7YHTYgtlh-0-GH0M8jCqsxtrlUG7WC5SqRMWIpqAQfPHQzpU1DZlFzzYolIC7CoWYkDRZL7YQhDZC1sO66VRfe67mXhYfJZLXXgi6pcjt_AzvM-DgS78Xroz7FALbue0S0J8W4JHmslKy_Fa69PTltkfrRBblqnQsnLLzThLFUFxlsanlfDdrEadYtuV4m136kQdp5Dr0tJ44dkIv3pdGsFpyXZ5-c_Mu43B-bFc_UDsqTo77k_c1ta_MdV0FYlvcdp7jGDmOkbsxbpHDmDMBPvPw5N14-r77J8VDR75vn9xTsFy14NUn-XOYsxO6nN0nd_2ag544rDwgA1s9JPfafh7Uu_dH5KuHDkXo0B3oUIAORejQDjq0hw5Vl1RWtIUObaBD96HzmHw-HZ-9eRv43huBTsJwHVh4WZnERgmjbKp0GlnJipRLbqMiThLFMqHDUIrYjiAKSo3kRcIKLSxjOi5Y8oQcVIvKPiV0hGEwU8aqQsO0nAnORgYTDZFQ2gh5RKL2U-XaC9Njf5RZ_ncjHZFhd83SybJcezZrLZD7wNIFjDkA6prrnt3oLs_JnR7pL8jBut7Yl-S23q7LVf3Ko-kXXlOdvQ
linkProvider Elsevier
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Elastic+deep+autoencoder+for+text+embedding+clustering+by+an+improved+graph+regularization&rft.jtitle=Expert+systems+with+applications&rft.au=Daneshfar%2C+Fatemeh&rft.au=Soleymanbaigi%2C+Sayvan&rft.au=Nafisi%2C+Ali&rft.au=Yamini%2C+Pedram&rft.date=2024-03-15&rft.issn=0957-4174&rft.volume=238&rft.spage=121780&rft_id=info:doi/10.1016%2Fj.eswa.2023.121780&rft.externalDBID=n%2Fa&rft.externalDocID=10_1016_j_eswa_2023_121780
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0957-4174&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0957-4174&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0957-4174&client=summon