Deterministic convergence analysis for regularized long short-term memory and its application to regression and multi-classification problems

Long short-term memory (LSTM) is a recurrent neural network (RNN) framework designed to solve the gradient disappearance and explosion problems of traditional RNNs. In recent years, LSTM has become a state-of-the-art model for solving various machine-learning problems. This paper propose a novel reg...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:Engineering applications of artificial intelligence Ročník 133; s. 108444
Hlavní autori: Kang, Qian, Yu, Dengxiu, Cheong, Kang Hao, Wang, Zhen
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: Elsevier Ltd 01.07.2024
Predmet:
ISSN:0952-1976, 1873-6769
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Abstract Long short-term memory (LSTM) is a recurrent neural network (RNN) framework designed to solve the gradient disappearance and explosion problems of traditional RNNs. In recent years, LSTM has become a state-of-the-art model for solving various machine-learning problems. This paper propose a novel regularized LSTM based on the batch gradient method. Specifically, the L2 regularization is appended to the objective function as a systematic external force, effectively controlling the excessive growth of weights in the network and preventing the overfitting phenomenon. In addition, a rigorous convergence analysis of the proposed method is carried out, i.e., monotonicity, weak convergence, and strong convergence results are obtained. Finally, comparative simulations are conducted on the benchmark data set for regression and classification problems, and the simulation results verify the effectiveness of the method.
AbstractList Long short-term memory (LSTM) is a recurrent neural network (RNN) framework designed to solve the gradient disappearance and explosion problems of traditional RNNs. In recent years, LSTM has become a state-of-the-art model for solving various machine-learning problems. This paper propose a novel regularized LSTM based on the batch gradient method. Specifically, the L2 regularization is appended to the objective function as a systematic external force, effectively controlling the excessive growth of weights in the network and preventing the overfitting phenomenon. In addition, a rigorous convergence analysis of the proposed method is carried out, i.e., monotonicity, weak convergence, and strong convergence results are obtained. Finally, comparative simulations are conducted on the benchmark data set for regression and classification problems, and the simulation results verify the effectiveness of the method.
ArticleNumber 108444
Author Yu, Dengxiu
Cheong, Kang Hao
Kang, Qian
Wang, Zhen
Author_xml – sequence: 1
  givenname: Qian
  surname: Kang
  fullname: Kang, Qian
  email: kangqian0373@126.com
  organization: School of the Cybersecurity, Northwestern Polytechnical University, Xi’an, 710072, China
– sequence: 2
  givenname: Dengxiu
  orcidid: 0000-0003-1803-3946
  surname: Yu
  fullname: Yu, Dengxiu
  email: yudengxiu@126.com
  organization: School of Artificial Intelligence, Optics and Electronics (iOPEN), Northwestern Polytechnical University, Xi’an, 710072, China
– sequence: 3
  givenname: Kang Hao
  surname: Cheong
  fullname: Cheong, Kang Hao
  email: kanghao_cheong@sutd.edu.sg
  organization: Science, Mathematics and Technology Cluster, Singapore University of Technology and Design, 8 Somapah Road, S487372, Singapore
– sequence: 4
  givenname: Zhen
  surname: Wang
  fullname: Wang, Zhen
  email: zhenwang0@gmail.com
  organization: School of Artificial Intelligence, Optics and Electronics (iOPEN), Northwestern Polytechnical University, Xi’an, 710072, China
BookMark eNqFkE1OwzAQRi1UJNrCFZAvkGI7qZNILED8S5XYwNpynEmZyrEr2yCVO3BnEko3bFiNZvS90cybkYnzDgg552zBGZcXmwW4td5uNS4EE8UwrIqiOCJTXpV5JktZT8iU1UuR8bqUJ2QW44YxlleFnJKvW0gQenQYExpqvPuAsAZngGqn7S5ipJ0PNMD63eqAn9BS692axjcfUjaytIfeh92QbymmSIdTLBqd0Dua_EgGiHHsxkT_bhNmxuph1B1i2-AbC308JcedthHOfuucvN7fvdw8Zqvnh6eb61Vmci5SVkvRMak117LgZlnwOhd1Y5aC57rh2rQVZ7oroJI1LyuR85KXDVSNaLq8MQ3L50Tu95rgYwzQqW3AXoed4kyNUtVGHaSqUaraSx3Ayz-gwfTzQwoa7f_41R6H4bkPhKCiwVF2iwFMUq3H_1Z8A3dmnow
CitedBy_id crossref_primary_10_1016_j_ijhydene_2025_151304
crossref_primary_10_1016_j_oceaneng_2025_120676
crossref_primary_10_1007_s11071_025_11792_y
Cites_doi 10.1016/j.neucom.2013.08.005
10.1109/72.279181
10.1109/ACCESS.2022.3228600
10.1007/s11042-020-09198-6
10.1109/TNNLS.2012.2197412
10.1137/S0097539792240406
10.1109/TCYB.2019.2950105
10.1016/j.jhydrol.2023.129229
10.1162/089976600300015763
10.1007/s00521-014-1730-x
10.1162/089976600300015015
10.1016/j.patrec.2022.04.038
10.3390/hydrology9020036
10.1016/j.watres.2022.119100
10.1016/S0925-2312(01)00706-8
10.1088/1361-6420/33/1/015004
10.1109/TITS.2020.3008612
10.1002/int.22590
10.1109/ACCESS.2020.3039539
10.1007/s10589-017-9916-7
10.1016/j.ins.2021.12.039
10.1109/TKDE.2017.2720734
10.1016/j.patcog.2022.108785
10.1111/j.2517-6161.1996.tb02080.x
10.1007/s10489-021-02518-9
10.3390/math10030488
10.1186/s40537-021-00444-8
10.1109/TMM.2020.2978637
10.1016/j.ins.2020.12.014
10.1016/j.neunet.2012.04.013
10.1109/TIP.2013.2262292
10.1016/j.jco.2009.01.002
10.1016/j.asoc.2017.07.059
10.1162/neco.1997.9.8.1735
10.1109/72.883412
10.1109/72.963769
10.1016/j.ins.2021.11.044
10.1016/j.knosys.2022.109526
10.32604/iasc.2020.013918
10.1109/TITS.2011.2119483
10.1016/j.neucom.2012.02.029
ContentType Journal Article
Copyright 2024 Elsevier Ltd
Copyright_xml – notice: 2024 Elsevier Ltd
DBID AAYXX
CITATION
DOI 10.1016/j.engappai.2024.108444
DatabaseName CrossRef
DatabaseTitle CrossRef
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Applied Sciences
Computer Science
EISSN 1873-6769
ExternalDocumentID 10_1016_j_engappai_2024_108444
S095219762400602X
GrantInformation_xml – fundername: Natural Science Foundation of Shaanxi Province
  grantid: 2024JC-YBQN-0663
  funderid: http://dx.doi.org/10.13039/501100007128
– fundername: Tencent Foundation and XPLORER PRIZE
– fundername: National Natural Science Foundation of China
  grantid: U22B2036; 11931015; 62373302; 62333009
  funderid: http://dx.doi.org/10.13039/501100001809
– fundername: Fok Ying-Tong Education Foundationm China
  grantid: 171105
– fundername: National Science Fund for Distinguished Young Scholarship of China
  grantid: 62025602
– fundername: Technology Innovation Leading Program of Shaanxi
  grantid: 2023GXLH-086
GroupedDBID --K
--M
.DC
.~1
0R~
1B1
1~.
1~5
29G
4.4
457
4G.
5GY
5VS
7-5
71M
8P~
9JN
AABNK
AACTN
AAEDT
AAEDW
AAIAV
AAIKJ
AAKOC
AALRI
AAOAW
AAQFI
AAQXK
AAXUO
AAYFN
ABBOA
ABMAC
ABXDB
ACDAQ
ACGFS
ACNNM
ACRLP
ACZNC
ADBBV
ADEZE
ADJOM
ADMUD
ADTZH
AEBSH
AECPX
AEKER
AENEX
AFKWA
AFTJW
AGHFR
AGUBO
AGYEJ
AHHHB
AHJVU
AHZHX
AIALX
AIEXJ
AIKHN
AITUG
AJOXV
AKRWK
ALMA_UNASSIGNED_HOLDINGS
AMFUW
AMRAJ
AOUOD
ASPBG
AVWKF
AXJTR
AZFZN
BJAXD
BKOJK
BLXMC
CS3
DU5
EBS
EFJIC
EJD
EO8
EO9
EP2
EP3
F5P
FDB
FEDTE
FGOYB
FIRID
FNPLU
FYGXN
G-2
G-Q
GBLVA
GBOLZ
HLZ
HVGLF
HZ~
IHE
J1W
JJJVA
KOM
LG9
LY7
M41
MO0
N9A
O-L
O9-
OAUVE
OZT
P-8
P-9
P2P
PC.
Q38
R2-
RIG
ROL
RPZ
SBC
SDF
SDG
SDP
SES
SET
SEW
SPC
SPCBC
SST
SSV
SSZ
T5K
TN5
UHS
WUQ
ZMT
~G-
9DU
AATTM
AAXKI
AAYWO
AAYXX
ABJNI
ABWVN
ACLOT
ACRPL
ACVFH
ADCNI
ADNMO
AEIPS
AEUPX
AFJKZ
AFPUW
AGQPQ
AIGII
AIIUN
AKBMS
AKYEP
ANKPU
APXCP
CITATION
EFKBS
EFLBG
~HD
ID FETCH-LOGICAL-c312t-962f06aa1a641c5419329bc5213ab1acd810af4e869178231717be8b2bf3bcb03
ISICitedReferencesCount 7
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001236651400001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 0952-1976
IngestDate Tue Nov 18 21:01:22 EST 2025
Sat Nov 29 03:41:18 EST 2025
Tue Jun 18 08:50:47 EDT 2024
IsPeerReviewed true
IsScholarly true
Keywords Long short-term memory
Batch gradient algorithm
Regularization
Convergence
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c312t-962f06aa1a641c5419329bc5213ab1acd810af4e869178231717be8b2bf3bcb03
ORCID 0000-0003-1803-3946
ParticipantIDs crossref_primary_10_1016_j_engappai_2024_108444
crossref_citationtrail_10_1016_j_engappai_2024_108444
elsevier_sciencedirect_doi_10_1016_j_engappai_2024_108444
PublicationCentury 2000
PublicationDate July 2024
2024-07-00
PublicationDateYYYYMMDD 2024-07-01
PublicationDate_xml – month: 07
  year: 2024
  text: July 2024
PublicationDecade 2020
PublicationTitle Engineering applications of artificial intelligence
PublicationYear 2024
Publisher Elsevier Ltd
Publisher_xml – name: Elsevier Ltd
References Wollmer, Blaschke, Schindl (b37) 2011; 12
Saito, Nakano (b27) 2000; 12
Wang, Hao, Zhang (b33) 2022; 588
Husken, Stagge (b15) 2003; 50
Donnelly, Abolfathi, Pearson (b7) 2022; 225
Chen, Wang, Xu (b4) 2022; 52
Gers, Schmidhuber, Cummins (b10) 2000; 12
Ludwig, Nunes, Araujo (b22) 2014; 124
Bengio, Simard, Frasconi (b2) 1994; 5
Wang, Wu, Zurada (b36) 2012; 33
Hochreiter, Schmidhuber (b14) 1997; 9
Lee, Kim, Lee (b19) 2020; 23
Zhang, Wu, Yao (b45) 2012; 89
Fan, Kang, Zurada (b8) 2022; 585
Xie, Li, Diao, An, Xu (b39) 2019; 11
Jian, Xiang, Le (b16) 2022; 2022
Haydari, Yilmaz (b13) 2020; 23
Li, Fang, Zha, Gao, Zheng (b20) 2022; 129
Gers, Schmidhuber (b9) 2001; 12
Natarajan (b25) 1995; 24
Thakkar, Lohiya, Zhang (b30) 2021; 36
Cheng, Chen, Xiao (b5) 2022
Khosravi, Rezaie, Cooper (b18) 2023; 618
Maragheh, Gharehchopogh, Majidzadeh (b24) 2022; 10
Xie, Zhang, Wang, Wang, Pal (b40) 2020; 50
Zhang, Ye, Zhang (b46) 2017; 68
Zhang, Tang, Liu (b44) 2015; 26
Yang, Zhou, Balasubramanian (b43) 2013; 22
Guptha, Balamurugan, Megharaj (b12) 2022; 159
Liang, Wang (b21) 2000; 11
Chen, Hofmann, Zou (b3) 2017; 33
Noori, Ghiasi, Salehi (b26) 2022; 9
Wang, Wen, Ye (b34) 2017; 61
Xu, Chang, Xu (b41) 2012; 23
Kang, Fan, Zurada (b17) 2021; 553
Vijayaprabakaran, Sathiyamurthy (b32) 2020; 34
Wang, Wu, Zhang (b35) 2022
Stuner, Chatelain, Paquet (b29) 2020; 79
Guo (b11) 2020; 26
Shi, Chen, Chen, Lee (b28) 2022; 10
Yang, Yu, Ma (b42) 2022; 253
Luo, Liu, Yin, Li, Wu (b23) 2017; 29
De Mol, De Vito, Rosasco (b6) 2009; 25
Alzubaidi, Zhang, Humaidi (b1) 2021; 8
Xiao, Chang, Zhang (b38) 2020; 8
Tibshirani (b31) 1996; 58
Shi (10.1016/j.engappai.2024.108444_b28) 2022; 10
Wollmer (10.1016/j.engappai.2024.108444_b37) 2011; 12
Maragheh (10.1016/j.engappai.2024.108444_b24) 2022; 10
Bengio (10.1016/j.engappai.2024.108444_b2) 1994; 5
Donnelly (10.1016/j.engappai.2024.108444_b7) 2022; 225
Alzubaidi (10.1016/j.engappai.2024.108444_b1) 2021; 8
Saito (10.1016/j.engappai.2024.108444_b27) 2000; 12
Khosravi (10.1016/j.engappai.2024.108444_b18) 2023; 618
Kang (10.1016/j.engappai.2024.108444_b17) 2021; 553
Lee (10.1016/j.engappai.2024.108444_b19) 2020; 23
Vijayaprabakaran (10.1016/j.engappai.2024.108444_b32) 2020; 34
Zhang (10.1016/j.engappai.2024.108444_b44) 2015; 26
Zhang (10.1016/j.engappai.2024.108444_b45) 2012; 89
Husken (10.1016/j.engappai.2024.108444_b15) 2003; 50
Zhang (10.1016/j.engappai.2024.108444_b46) 2017; 68
Yang (10.1016/j.engappai.2024.108444_b42) 2022; 253
Gers (10.1016/j.engappai.2024.108444_b10) 2000; 12
Yang (10.1016/j.engappai.2024.108444_b43) 2013; 22
Chen (10.1016/j.engappai.2024.108444_b3) 2017; 33
Cheng (10.1016/j.engappai.2024.108444_b5) 2022
Luo (10.1016/j.engappai.2024.108444_b23) 2017; 29
Wang (10.1016/j.engappai.2024.108444_b34) 2017; 61
Fan (10.1016/j.engappai.2024.108444_b8) 2022; 585
Thakkar (10.1016/j.engappai.2024.108444_b30) 2021; 36
Tibshirani (10.1016/j.engappai.2024.108444_b31) 1996; 58
Xu (10.1016/j.engappai.2024.108444_b41) 2012; 23
Wang (10.1016/j.engappai.2024.108444_b36) 2012; 33
Guo (10.1016/j.engappai.2024.108444_b11) 2020; 26
De Mol (10.1016/j.engappai.2024.108444_b6) 2009; 25
Guptha (10.1016/j.engappai.2024.108444_b12) 2022; 159
Ludwig (10.1016/j.engappai.2024.108444_b22) 2014; 124
Gers (10.1016/j.engappai.2024.108444_b9) 2001; 12
Haydari (10.1016/j.engappai.2024.108444_b13) 2020; 23
Xie (10.1016/j.engappai.2024.108444_b40) 2020; 50
Li (10.1016/j.engappai.2024.108444_b20) 2022; 129
Hochreiter (10.1016/j.engappai.2024.108444_b14) 1997; 9
Chen (10.1016/j.engappai.2024.108444_b4) 2022; 52
Stuner (10.1016/j.engappai.2024.108444_b29) 2020; 79
Xiao (10.1016/j.engappai.2024.108444_b38) 2020; 8
Wang (10.1016/j.engappai.2024.108444_b33) 2022; 588
Liang (10.1016/j.engappai.2024.108444_b21) 2000; 11
Xie (10.1016/j.engappai.2024.108444_b39) 2019; 11
Noori (10.1016/j.engappai.2024.108444_b26) 2022; 9
Natarajan (10.1016/j.engappai.2024.108444_b25) 1995; 24
Jian (10.1016/j.engappai.2024.108444_b16) 2022; 2022
Wang (10.1016/j.engappai.2024.108444_b35) 2022
References_xml – volume: 50
  start-page: 223
  year: 2003
  end-page: 235
  ident: b15
  article-title: Recurrent neural networks for time series classification
  publication-title: Neurocomputing
– volume: 618
  year: 2023
  ident: b18
  article-title: Soil water erosion susceptibility assessment using deep learning algorithms
  publication-title: J. Hydrol.
– volume: 29
  start-page: 2125
  year: 2017
  end-page: 2139
  ident: b23
  article-title: Deep learning of graphs with ngram convolutional neural networks
  publication-title: IEEE Trans. Knowl. Data Eng.
– volume: 10
  start-page: 488
  year: 2022
  ident: b24
  article-title: A new hybrid based on long short-term memory network with spotted hyena optimization algorithm for multi-label text classification
  publication-title: Mathematics
– volume: 588
  start-page: 106
  year: 2022
  end-page: 123
  ident: b33
  article-title: Convergence and robustness of bounded recurrent neural networks for solving dynamic Lyapunov equations
  publication-title: Inform. Sci.
– volume: 12
  start-page: 2451
  year: 2000
  end-page: 2471
  ident: b10
  article-title: Learning to forget: Continual prediction with LSTM
  publication-title: Neural Comput.
– volume: 52
  start-page: 7513
  year: 2022
  end-page: 7528
  ident: b4
  article-title: GC-LSTM: Graph convolution embedded LSTM for dynamic network link prediction
  publication-title: Appl. Intell.
– volume: 9
  start-page: 1735
  year: 1997
  end-page: 1780
  ident: b14
  article-title: Long short-term memory
  publication-title: Neural Comput.
– volume: 22
  start-page: 3234
  year: 2013
  end-page: 3246
  ident: b43
  article-title: Fast
  publication-title: IEEE Trans. Image Process.
– volume: 8
  start-page: 1
  year: 2021
  end-page: 74
  ident: b1
  article-title: Review of deep learning: Concepts, CNN architectures, challenges, applications, future directions
  publication-title: J. Big Data
– volume: 12
  start-page: 1333
  year: 2001
  end-page: 1340
  ident: b9
  article-title: LSTM recurrent networks learn simple context-free and context-sensitive languages
  publication-title: IEEE Trans. Neural Netw. Learn. Syst.
– volume: 26
  start-page: 383
  year: 2015
  end-page: 390
  ident: b44
  article-title: Batch gradient training method with smoothing
  publication-title: Neural Comput. Appl.
– volume: 129
  year: 2022
  ident: b20
  article-title: HAM: Hybrid attention module in deep convolutional neural networks for image classification
  publication-title: Pattern Recognit.
– volume: 25
  start-page: 201
  year: 2009
  end-page: 230
  ident: b6
  article-title: Elastic-net regularization in learning theory
  publication-title: J. Complexity
– volume: 159
  start-page: 16
  year: 2022
  end-page: 22
  ident: b12
  article-title: Cross lingual handwritten character recognition using long short term memory network with aid of elephant herding optimization algorithm
  publication-title: Pattern Recognit. Lett.
– volume: 8
  year: 2020
  ident: b38
  article-title: Multi-information spatial–temporal LSTM fusion continuous sign language neural machine translation
  publication-title: IEEE Access
– volume: 68
  start-page: 437
  year: 2017
  end-page: 454
  ident: b46
  article-title: A generalized elastic net regularization with smoothed
  publication-title: Comput. Optim. Appl.
– volume: 89
  start-page: 141
  year: 2012
  end-page: 146
  ident: b45
  article-title: Boundedness and convergence of batch backpropagation algorithm with penalty for feedforward neural networks
  publication-title: Neurocomputing
– volume: 11
  start-page: 1
  year: 2019
  end-page: 4
  ident: b39
  article-title: Regularization based fine-grained neural network pruning method
  publication-title: Proc. Int. Conf. Electron. Comput. Artif. Intell.
– volume: 585
  start-page: 70
  year: 2022
  end-page: 88
  ident: b8
  article-title: Convergence analysis for Sigma-Pi-Sigma neural network based on some relaxed conditions
  publication-title: Inform. Sci.
– volume: 12
  start-page: 709
  year: 2000
  end-page: 729
  ident: b27
  article-title: Second-order learning algorithm with squared penalty term
  publication-title: Neural Comput.
– volume: 61
  start-page: 354
  year: 2017
  end-page: 363
  ident: b34
  article-title: Convergence analysis of BP neural networks via sparse response regularization
  publication-title: Appl. Soft Comput.
– volume: 26
  start-page: 421
  year: 2020
  end-page: 427
  ident: b11
  article-title: Extreme learning machine with elastic net regularization
  publication-title: Intell. Autom. Soft Comput.
– volume: 124
  start-page: 33
  year: 2014
  end-page: 42
  ident: b22
  article-title: Eigenvalue decay: A new method for neural network regularization
  publication-title: Neurocomputing
– volume: 58
  start-page: 267
  year: 1996
  end-page: 288
  ident: b31
  article-title: Regression shrinkage and selection via the lasso
  publication-title: J. R. Stat. Soc. Ser. B Methodol.
– volume: 12
  start-page: 574
  year: 2011
  end-page: 582
  ident: b37
  article-title: Online driver distraction detection using long short-term memory
  publication-title: IEEE Trans. Intell. Transp. Syst.
– volume: 24
  start-page: 227
  year: 1995
  end-page: 234
  ident: b25
  article-title: Sparse approximate solutions to linear systems
  publication-title: SIAM J. Comput.
– volume: 33
  year: 2017
  ident: b3
  article-title: Elastic-net regularization versus
  publication-title: Inverse Probl.
– volume: 2022
  year: 2022
  ident: b16
  article-title: LSTM-based attentional embedding for english machine translation
  publication-title: Sci. Program.
– volume: 79
  start-page: 34407
  year: 2020
  end-page: 34427
  ident: b29
  article-title: Handwriting recognition using cohort of LSTM and lexicon verification with extremely large lexicon
  publication-title: Multimed. Tools Appl.
– start-page: 1
  year: 2022
  end-page: 22
  ident: b5
  article-title: A dual-stage attention-based Bi-LSTM network for multivariate time series prediction
  publication-title: J. Supercomput.
– year: 2022
  ident: b35
  article-title: Predrnn: A recurrent neural network for spatiotemporal predictive learning
  publication-title: IEEE Trans. Pattern Anal. Mach. Intell.
– volume: 9
  year: 2022
  ident: b26
  article-title: An efficient data driven-based model for prediction of the total sediment load in rivers
  publication-title: Hydrology
– volume: 225
  year: 2022
  ident: b7
  article-title: Gaussian process emulation of spatio-temporal outputs of a 2D inland flood model
  publication-title: Water Res.
– volume: 11
  start-page: 1251
  year: 2000
  end-page: 1262
  ident: b21
  article-title: A recurrent neural network for nonlinear optimization with a continuously differentiable objective function and bound constraints
  publication-title: IEEE Trans. Neural Netw. Learn. Syst.
– volume: 5
  start-page: 157
  year: 1994
  end-page: 166
  ident: b2
  article-title: Learning long-term dependencies with gradient descent is difficult
  publication-title: IEEE Trans. Neural Netw. Learn. Syst.
– volume: 10
  year: 2022
  ident: b28
  article-title: CNO-LSTM: A chaotic neural oscillatory long short-term memory model for text classification
  publication-title: IEEE Access
– volume: 553
  start-page: 66
  year: 2021
  end-page: 82
  ident: b17
  article-title: Deterministic convergence analysis via smoothing group Lasso regularization and adaptive momentum for Sigma-Pi-Sigma neural network
  publication-title: Inform. Sci.
– volume: 253
  year: 2022
  ident: b42
  article-title: Deep representation-based transfer learning for deep neural networks
  publication-title: Knowl.-Based Syst.
– volume: 33
  start-page: 127
  year: 2012
  end-page: 135
  ident: b36
  article-title: Computational properties and convergence analysis of BPNN for cyclic and almost cyclic learning with penalty
  publication-title: Neural Netw.
– volume: 23
  start-page: 11
  year: 2020
  end-page: 32
  ident: b13
  article-title: Deep reinforcement learning for intelligent transportation systems: A survey
  publication-title: IEEE Trans. Intell. Transp. Syst.
– volume: 50
  start-page: 1333
  year: 2020
  end-page: 1346
  ident: b40
  article-title: Learning optimized structure of neural networks by hidden node pruning with
  publication-title: IEEE Trans Cybern.
– volume: 23
  start-page: 1013
  year: 2012
  end-page: 1027
  ident: b41
  article-title: Regularization: A thresholding representation theory and a fast solver
  publication-title: IEEE Trans. Neural Netw. Learn. Syst.
– volume: 34
  start-page: 2637
  year: 2020
  end-page: 2650
  ident: b32
  article-title: Towards activation function search for long short-term model network: A differential evolution based approach
  publication-title: J. King Saud Univ.-Comput. Inf. Sci.
– volume: 23
  start-page: 415
  year: 2020
  end-page: 428
  ident: b19
  article-title: 3-d human behavior understanding using generalized ts-lstm networks
  publication-title: IEEE Trans. Multimed.
– volume: 36
  year: 2021
  ident: b30
  article-title: Analyzing fusion of regularization techniques in the deep learning-based intrusion detection system
  publication-title: Int. J. Intell. Syst.
– volume: 124
  start-page: 33
  year: 2014
  ident: 10.1016/j.engappai.2024.108444_b22
  article-title: Eigenvalue decay: A new method for neural network regularization
  publication-title: Neurocomputing
  doi: 10.1016/j.neucom.2013.08.005
– volume: 2022
  year: 2022
  ident: 10.1016/j.engappai.2024.108444_b16
  article-title: LSTM-based attentional embedding for english machine translation
  publication-title: Sci. Program.
– volume: 5
  start-page: 157
  issue: 2
  year: 1994
  ident: 10.1016/j.engappai.2024.108444_b2
  article-title: Learning long-term dependencies with gradient descent is difficult
  publication-title: IEEE Trans. Neural Netw. Learn. Syst.
  doi: 10.1109/72.279181
– volume: 10
  year: 2022
  ident: 10.1016/j.engappai.2024.108444_b28
  article-title: CNO-LSTM: A chaotic neural oscillatory long short-term memory model for text classification
  publication-title: IEEE Access
  doi: 10.1109/ACCESS.2022.3228600
– volume: 11
  start-page: 1
  year: 2019
  ident: 10.1016/j.engappai.2024.108444_b39
  article-title: L0 Regularization based fine-grained neural network pruning method
  publication-title: Proc. Int. Conf. Electron. Comput. Artif. Intell.
– volume: 79
  start-page: 34407
  issue: 45
  year: 2020
  ident: 10.1016/j.engappai.2024.108444_b29
  article-title: Handwriting recognition using cohort of LSTM and lexicon verification with extremely large lexicon
  publication-title: Multimed. Tools Appl.
  doi: 10.1007/s11042-020-09198-6
– volume: 23
  start-page: 1013
  issue: 7
  year: 2012
  ident: 10.1016/j.engappai.2024.108444_b41
  article-title: L1/2 Regularization: A thresholding representation theory and a fast solver
  publication-title: IEEE Trans. Neural Netw. Learn. Syst.
  doi: 10.1109/TNNLS.2012.2197412
– volume: 24
  start-page: 227
  issue: 2
  year: 1995
  ident: 10.1016/j.engappai.2024.108444_b25
  article-title: Sparse approximate solutions to linear systems
  publication-title: SIAM J. Comput.
  doi: 10.1137/S0097539792240406
– volume: 50
  start-page: 1333
  year: 2020
  ident: 10.1016/j.engappai.2024.108444_b40
  article-title: Learning optimized structure of neural networks by hidden node pruning with L1 regularization
  publication-title: IEEE Trans Cybern.
  doi: 10.1109/TCYB.2019.2950105
– volume: 618
  year: 2023
  ident: 10.1016/j.engappai.2024.108444_b18
  article-title: Soil water erosion susceptibility assessment using deep learning algorithms
  publication-title: J. Hydrol.
  doi: 10.1016/j.jhydrol.2023.129229
– volume: 12
  start-page: 709
  issue: 3
  year: 2000
  ident: 10.1016/j.engappai.2024.108444_b27
  article-title: Second-order learning algorithm with squared penalty term
  publication-title: Neural Comput.
  doi: 10.1162/089976600300015763
– volume: 26
  start-page: 383
  issue: 2
  year: 2015
  ident: 10.1016/j.engappai.2024.108444_b44
  article-title: Batch gradient training method with smoothing L0 regularization for feedforward neural networks
  publication-title: Neural Comput. Appl.
  doi: 10.1007/s00521-014-1730-x
– volume: 12
  start-page: 2451
  issue: 10
  year: 2000
  ident: 10.1016/j.engappai.2024.108444_b10
  article-title: Learning to forget: Continual prediction with LSTM
  publication-title: Neural Comput.
  doi: 10.1162/089976600300015015
– volume: 159
  start-page: 16
  year: 2022
  ident: 10.1016/j.engappai.2024.108444_b12
  article-title: Cross lingual handwritten character recognition using long short term memory network with aid of elephant herding optimization algorithm
  publication-title: Pattern Recognit. Lett.
  doi: 10.1016/j.patrec.2022.04.038
– volume: 9
  year: 2022
  ident: 10.1016/j.engappai.2024.108444_b26
  article-title: An efficient data driven-based model for prediction of the total sediment load in rivers
  publication-title: Hydrology
  doi: 10.3390/hydrology9020036
– volume: 225
  year: 2022
  ident: 10.1016/j.engappai.2024.108444_b7
  article-title: Gaussian process emulation of spatio-temporal outputs of a 2D inland flood model
  publication-title: Water Res.
  doi: 10.1016/j.watres.2022.119100
– volume: 50
  start-page: 223
  year: 2003
  ident: 10.1016/j.engappai.2024.108444_b15
  article-title: Recurrent neural networks for time series classification
  publication-title: Neurocomputing
  doi: 10.1016/S0925-2312(01)00706-8
– volume: 33
  year: 2017
  ident: 10.1016/j.engappai.2024.108444_b3
  article-title: Elastic-net regularization versus l1-regularization for linear inverse problems with quasi-sparse solutions
  publication-title: Inverse Probl.
  doi: 10.1088/1361-6420/33/1/015004
– volume: 23
  start-page: 11
  issue: 1
  year: 2020
  ident: 10.1016/j.engappai.2024.108444_b13
  article-title: Deep reinforcement learning for intelligent transportation systems: A survey
  publication-title: IEEE Trans. Intell. Transp. Syst.
  doi: 10.1109/TITS.2020.3008612
– volume: 36
  year: 2021
  ident: 10.1016/j.engappai.2024.108444_b30
  article-title: Analyzing fusion of regularization techniques in the deep learning-based intrusion detection system
  publication-title: Int. J. Intell. Syst.
  doi: 10.1002/int.22590
– volume: 8
  year: 2020
  ident: 10.1016/j.engappai.2024.108444_b38
  article-title: Multi-information spatial–temporal LSTM fusion continuous sign language neural machine translation
  publication-title: IEEE Access
  doi: 10.1109/ACCESS.2020.3039539
– volume: 68
  start-page: 437
  year: 2017
  ident: 10.1016/j.engappai.2024.108444_b46
  article-title: A generalized elastic net regularization with smoothed lq penalty for sparse vector recovery
  publication-title: Comput. Optim. Appl.
  doi: 10.1007/s10589-017-9916-7
– volume: 588
  start-page: 106
  year: 2022
  ident: 10.1016/j.engappai.2024.108444_b33
  article-title: Convergence and robustness of bounded recurrent neural networks for solving dynamic Lyapunov equations
  publication-title: Inform. Sci.
  doi: 10.1016/j.ins.2021.12.039
– volume: 29
  start-page: 2125
  issue: 10
  year: 2017
  ident: 10.1016/j.engappai.2024.108444_b23
  article-title: Deep learning of graphs with ngram convolutional neural networks
  publication-title: IEEE Trans. Knowl. Data Eng.
  doi: 10.1109/TKDE.2017.2720734
– volume: 129
  year: 2022
  ident: 10.1016/j.engappai.2024.108444_b20
  article-title: HAM: Hybrid attention module in deep convolutional neural networks for image classification
  publication-title: Pattern Recognit.
  doi: 10.1016/j.patcog.2022.108785
– volume: 58
  start-page: 267
  year: 1996
  ident: 10.1016/j.engappai.2024.108444_b31
  article-title: Regression shrinkage and selection via the lasso
  publication-title: J. R. Stat. Soc. Ser. B Methodol.
  doi: 10.1111/j.2517-6161.1996.tb02080.x
– volume: 52
  start-page: 7513
  issue: 7
  year: 2022
  ident: 10.1016/j.engappai.2024.108444_b4
  article-title: GC-LSTM: Graph convolution embedded LSTM for dynamic network link prediction
  publication-title: Appl. Intell.
  doi: 10.1007/s10489-021-02518-9
– volume: 34
  start-page: 2637
  issue: 6
  year: 2020
  ident: 10.1016/j.engappai.2024.108444_b32
  article-title: Towards activation function search for long short-term model network: A differential evolution based approach
  publication-title: J. King Saud Univ.-Comput. Inf. Sci.
– volume: 10
  start-page: 488
  issue: 3
  year: 2022
  ident: 10.1016/j.engappai.2024.108444_b24
  article-title: A new hybrid based on long short-term memory network with spotted hyena optimization algorithm for multi-label text classification
  publication-title: Mathematics
  doi: 10.3390/math10030488
– start-page: 1
  year: 2022
  ident: 10.1016/j.engappai.2024.108444_b5
  article-title: A dual-stage attention-based Bi-LSTM network for multivariate time series prediction
  publication-title: J. Supercomput.
– volume: 8
  start-page: 1
  issue: 1
  year: 2021
  ident: 10.1016/j.engappai.2024.108444_b1
  article-title: Review of deep learning: Concepts, CNN architectures, challenges, applications, future directions
  publication-title: J. Big Data
  doi: 10.1186/s40537-021-00444-8
– volume: 23
  start-page: 415
  year: 2020
  ident: 10.1016/j.engappai.2024.108444_b19
  article-title: 3-d human behavior understanding using generalized ts-lstm networks
  publication-title: IEEE Trans. Multimed.
  doi: 10.1109/TMM.2020.2978637
– volume: 553
  start-page: 66
  year: 2021
  ident: 10.1016/j.engappai.2024.108444_b17
  article-title: Deterministic convergence analysis via smoothing group Lasso regularization and adaptive momentum for Sigma-Pi-Sigma neural network
  publication-title: Inform. Sci.
  doi: 10.1016/j.ins.2020.12.014
– volume: 33
  start-page: 127
  year: 2012
  ident: 10.1016/j.engappai.2024.108444_b36
  article-title: Computational properties and convergence analysis of BPNN for cyclic and almost cyclic learning with penalty
  publication-title: Neural Netw.
  doi: 10.1016/j.neunet.2012.04.013
– volume: 22
  start-page: 3234
  issue: 8
  year: 2013
  ident: 10.1016/j.engappai.2024.108444_b43
  article-title: Fast ℓ1-minimization algorithms for robust face recognition
  publication-title: IEEE Trans. Image Process.
  doi: 10.1109/TIP.2013.2262292
– volume: 25
  start-page: 201
  year: 2009
  ident: 10.1016/j.engappai.2024.108444_b6
  article-title: Elastic-net regularization in learning theory
  publication-title: J. Complexity
  doi: 10.1016/j.jco.2009.01.002
– volume: 61
  start-page: 354
  year: 2017
  ident: 10.1016/j.engappai.2024.108444_b34
  article-title: Convergence analysis of BP neural networks via sparse response regularization
  publication-title: Appl. Soft Comput.
  doi: 10.1016/j.asoc.2017.07.059
– volume: 9
  start-page: 1735
  issue: 8
  year: 1997
  ident: 10.1016/j.engappai.2024.108444_b14
  article-title: Long short-term memory
  publication-title: Neural Comput.
  doi: 10.1162/neco.1997.9.8.1735
– year: 2022
  ident: 10.1016/j.engappai.2024.108444_b35
  article-title: Predrnn: A recurrent neural network for spatiotemporal predictive learning
  publication-title: IEEE Trans. Pattern Anal. Mach. Intell.
– volume: 11
  start-page: 1251
  issue: 6
  year: 2000
  ident: 10.1016/j.engappai.2024.108444_b21
  article-title: A recurrent neural network for nonlinear optimization with a continuously differentiable objective function and bound constraints
  publication-title: IEEE Trans. Neural Netw. Learn. Syst.
  doi: 10.1109/72.883412
– volume: 12
  start-page: 1333
  issue: 6
  year: 2001
  ident: 10.1016/j.engappai.2024.108444_b9
  article-title: LSTM recurrent networks learn simple context-free and context-sensitive languages
  publication-title: IEEE Trans. Neural Netw. Learn. Syst.
  doi: 10.1109/72.963769
– volume: 585
  start-page: 70
  year: 2022
  ident: 10.1016/j.engappai.2024.108444_b8
  article-title: Convergence analysis for Sigma-Pi-Sigma neural network based on some relaxed conditions
  publication-title: Inform. Sci.
  doi: 10.1016/j.ins.2021.11.044
– volume: 253
  year: 2022
  ident: 10.1016/j.engappai.2024.108444_b42
  article-title: Deep representation-based transfer learning for deep neural networks
  publication-title: Knowl.-Based Syst.
  doi: 10.1016/j.knosys.2022.109526
– volume: 26
  start-page: 421
  year: 2020
  ident: 10.1016/j.engappai.2024.108444_b11
  article-title: Extreme learning machine with elastic net regularization
  publication-title: Intell. Autom. Soft Comput.
  doi: 10.32604/iasc.2020.013918
– volume: 12
  start-page: 574
  issue: 2
  year: 2011
  ident: 10.1016/j.engappai.2024.108444_b37
  article-title: Online driver distraction detection using long short-term memory
  publication-title: IEEE Trans. Intell. Transp. Syst.
  doi: 10.1109/TITS.2011.2119483
– volume: 89
  start-page: 141
  year: 2012
  ident: 10.1016/j.engappai.2024.108444_b45
  article-title: Boundedness and convergence of batch backpropagation algorithm with penalty for feedforward neural networks
  publication-title: Neurocomputing
  doi: 10.1016/j.neucom.2012.02.029
SSID ssj0003846
Score 2.4393427
Snippet Long short-term memory (LSTM) is a recurrent neural network (RNN) framework designed to solve the gradient disappearance and explosion problems of traditional...
SourceID crossref
elsevier
SourceType Enrichment Source
Index Database
Publisher
StartPage 108444
SubjectTerms Batch gradient algorithm
Convergence
Long short-term memory
Regularization
Title Deterministic convergence analysis for regularized long short-term memory and its application to regression and multi-classification problems
URI https://dx.doi.org/10.1016/j.engappai.2024.108444
Volume 133
WOSCitedRecordID wos001236651400001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVESC
  databaseName: Elsevier SD Freedom Collection Journals 2021
  customDbUrl:
  eissn: 1873-6769
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0003846
  issn: 0952-1976
  databaseCode: AIEXJ
  dateStart: 19950201
  isFulltext: true
  titleUrlDefault: https://www.sciencedirect.com
  providerName: Elsevier
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3NbtQwELaWlgMXKH9qgSIfuFUpifNj-1hBUeFQgSjSwiWynSxNtWSr7G618A48ES_HTGwnWVipIMQlWlmeOMl8a4_H38wQ8iwTRRIKbYKQa9ygmCyQHHatnClRlBxs8ky3xSb46akYj-Xb0eiHj4W5mvK6FquVvPyvqoY2UDaGzv6FurubQgP8BqXDFdQO1z9S_EtHcGkzMFtWeWMzbiqfgASphU1bhL6pvoHFOcWCQ_NzsMQDlD34gvTbr925wuCQG01VkLTkWctjbimJgUErHGlHtpurUzNfc_z3qQ-Ht2y5JPgeLpdFNUgS2i0Hzqv9bgDmj0s7X9afV9WyZymUjmOMIgcnatafGNj2T-cu-M25OljS0WKd_83H4PSEJ-vIZEEkuUuobadxweMAybtr87zNuPHbmmHdFxeH8Lzw8qo6xKGRe5nYzJS_5ON-jwPieMi-zUI2vkG2GU8lTKnbR6-Px286QyAWNk7MP-AgQH3zaJtto4G9c7ZDbruNCj2yALtLRmV9j9xxmxbqloQ5NPm6IL7tPvm-BkE6gCD1EKQAQTqAIEUI0h6C1EIQ-hcUIEgHeKGLGe0h2PbYBEHqIfiAfHh1fPbiJHBlPwITR2wRyIxNwkypSGVJZNIEtxhSG_josdKRMoWIQjVJSpHJiOMpNo-4LoVmehJro8P4IdmqZ3W5S2jMosJwncWapQkvQpXJVBQmnBSyQLtsj6T-g-fG5cTH0izT3JMfL3KvqBwVlVtF7ZHnndylzQpzrYT0-sydbWtt1hxgeI3so3-QfUxu9f-kJ2Rr0SzLfXLTXC2qefPUIfYn7_jWdA
linkProvider Elsevier
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Deterministic+convergence+analysis+for+regularized+long+short-term+memory+and+its+application+to+regression+and+multi-classification+problems&rft.jtitle=Engineering+applications+of+artificial+intelligence&rft.au=Kang%2C+Qian&rft.au=Yu%2C+Dengxiu&rft.au=Cheong%2C+Kang+Hao&rft.au=Wang%2C+Zhen&rft.date=2024-07-01&rft.pub=Elsevier+Ltd&rft.issn=0952-1976&rft.eissn=1873-6769&rft.volume=133&rft_id=info:doi/10.1016%2Fj.engappai.2024.108444&rft.externalDocID=S095219762400602X
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0952-1976&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0952-1976&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0952-1976&client=summon