RoFormer: Enhanced transformer with Rotary Position Embedding

Uložené v:
Podrobná bibliografia
Vydané v:Neurocomputing (Amsterdam) Ročník 568; s. 127063
Hlavní autori: Su, Jianlin, Ahmed, Murtadha, Lu, Yu, Pan, Shengfeng, Bo, Wen, Liu, Yunfeng
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: 01.02.2024
ISSN:0925-2312
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
ArticleNumber 127063
Author Ahmed, Murtadha
Su, Jianlin
Lu, Yu
Liu, Yunfeng
Pan, Shengfeng
Bo, Wen
Author_xml – sequence: 1
  givenname: Jianlin
  surname: Su
  fullname: Su, Jianlin
– sequence: 2
  givenname: Murtadha
  orcidid: 0000-0002-0741-0710
  surname: Ahmed
  fullname: Ahmed, Murtadha
– sequence: 3
  givenname: Yu
  surname: Lu
  fullname: Lu, Yu
– sequence: 4
  givenname: Shengfeng
  surname: Pan
  fullname: Pan, Shengfeng
– sequence: 5
  givenname: Wen
  surname: Bo
  fullname: Bo, Wen
– sequence: 6
  givenname: Yunfeng
  surname: Liu
  fullname: Liu, Yunfeng
BookMark eNp9j0FLwzAcxXOY4Db9Bh76BVrzT9qmGXiQ0TlhoAw9hzRNXMqaSFIRv72t9eTB04MHv8f7rdDCeacRugGcAYbytsuc_lC-zwgmNAPCcEkXaIk5KVJCgVyiVYwdxsCA8CW6O_qdD70Om6R2J-mUbpMhSBfNT5t82uGUHP0gw1fy7KMdrHdJ3Te6ba17u0IXRp6jvv7NNXrd1S_bfXp4enjc3h9SRQoYUqKYAYIrTjVlTUE05rwymrGyaoguGSsk5rhqlMmLQpUAJqc0z6FpoGISNF2jzbyrgo8xaCOUHeT0ZfxqzwKwmORFJ2Z5McmLWX6E8z_we7D9KPQ_9g2cYmQ5
CitedBy_id crossref_primary_10_3390_app14188457
crossref_primary_10_1016_j_jksuci_2024_102000
crossref_primary_10_1002_ima_70172
crossref_primary_10_1007_s11042_024_20382_w
crossref_primary_10_1109_TKDE_2024_3376539
crossref_primary_10_1109_LSP_2024_3418714
crossref_primary_10_1021_acs_iecr_5c01387
crossref_primary_10_1038_s41598_025_91940_x
crossref_primary_10_1016_j_cherd_2024_12_008
crossref_primary_10_3389_frai_2025_1576992
crossref_primary_10_1016_j_ipm_2024_103814
crossref_primary_10_1016_j_eswa_2025_126961
crossref_primary_10_1038_s41467_024_49798_6
crossref_primary_10_1109_ACCESS_2024_3399670
crossref_primary_10_1016_j_imavis_2025_105672
crossref_primary_10_1049_cvi2_70022
crossref_primary_10_1162_tacl_a_00693
crossref_primary_10_1109_ACCESS_2025_3605729
crossref_primary_10_1109_LCA_2025_3535470
crossref_primary_10_1126_science_ado9336
crossref_primary_10_1109_TIM_2024_3509573
crossref_primary_10_1007_s11263_025_02426_2
crossref_primary_10_1109_JIOT_2025_3558021
crossref_primary_10_1029_2025SW004424
crossref_primary_10_1016_j_csbj_2025_07_038
crossref_primary_10_1016_j_knosys_2025_113762
crossref_primary_10_1016_j_tig_2024_11_013
crossref_primary_10_1186_s12859_024_05847_x
crossref_primary_10_1016_j_atech_2025_101266
crossref_primary_10_1038_s41592_024_02523_z
crossref_primary_10_1093_gbe_evaf101
crossref_primary_10_1016_j_knosys_2025_113881
crossref_primary_10_1016_j_eswa_2025_128596
crossref_primary_10_1109_TPAMI_2024_3463709
crossref_primary_10_1117_1_JEI_34_2_023043
crossref_primary_10_1016_j_artmed_2025_103147
crossref_primary_10_1007_s13042_025_02760_4
crossref_primary_10_1007_s10506_025_09435_z
crossref_primary_10_1093_nargab_lqaf058
crossref_primary_10_1109_TASLPRO_2025_3587393
crossref_primary_10_1007_s10115_025_02516_0
crossref_primary_10_1016_j_ijbiomac_2025_139941
crossref_primary_10_1016_j_patcog_2025_111803
crossref_primary_10_1109_TMM_2024_3373249
crossref_primary_10_1109_TPWRS_2024_3400123
crossref_primary_10_3390_molecules30051116
crossref_primary_10_3390_min15040374
crossref_primary_10_1145_3714416
crossref_primary_10_1109_LRA_2025_3568612
crossref_primary_10_1109_MNET_2024_3376419
crossref_primary_10_1016_j_procs_2025_02_260
crossref_primary_10_1016_j_neunet_2025_107123
crossref_primary_10_1080_23311908_2025_2455783
crossref_primary_10_1038_s41587_024_02511_w
crossref_primary_10_3390_math13091386
crossref_primary_10_1016_j_csbj_2025_03_021
crossref_primary_10_1016_j_taml_2024_100527
crossref_primary_10_1038_s42256_024_00946_z
crossref_primary_10_1016_j_asr_2024_10_001
crossref_primary_10_1007_s10462_025_11259_x
crossref_primary_10_1038_s41467_025_56349_0
crossref_primary_10_1109_ACCESS_2025_3582745
crossref_primary_10_1016_j_jfranklin_2025_107635
crossref_primary_10_1109_ACCESS_2025_3549031
crossref_primary_10_3389_frai_2025_1663484
crossref_primary_10_1162_coli_a_00541
crossref_primary_10_1093_bib_bbaf137
crossref_primary_10_1007_s13369_025_10472_8
crossref_primary_10_1016_j_csl_2025_101843
crossref_primary_10_1016_j_rse_2025_114913
crossref_primary_10_1109_TMC_2023_3310712
crossref_primary_10_1016_j_aei_2025_103377
crossref_primary_10_1109_TASLPRO_2025_3577336
crossref_primary_10_1093_bioinformatics_btaf446
crossref_primary_10_1007_s10115_025_02386_6
crossref_primary_10_1109_ACCESS_2025_3594132
crossref_primary_10_1016_j_trc_2025_105239
crossref_primary_10_3390_app15042106
crossref_primary_10_1109_TKDE_2024_3435765
crossref_primary_10_1109_TPAMI_2024_3443922
crossref_primary_10_1007_s00371_024_03469_1
crossref_primary_10_1007_s10115_024_02310_4
crossref_primary_10_1088_2632_2153_ad743e
crossref_primary_10_1021_acs_jcim_5c01713
crossref_primary_10_1126_sciadv_adu2488
crossref_primary_10_3389_fninf_2024_1414925
crossref_primary_10_1016_j_bspc_2024_107471
crossref_primary_10_1016_j_eswa_2025_129801
crossref_primary_10_1038_s41586_025_09292_5
crossref_primary_10_1007_s11063_025_11721_5
crossref_primary_10_1016_j_knosys_2024_112404
crossref_primary_10_1109_MNANO_2024_3513112
crossref_primary_10_1109_TKDE_2025_3580640
crossref_primary_10_1109_LRA_2025_3550707
crossref_primary_10_1145_3729405
crossref_primary_10_1109_TASLPRO_2025_3574847
crossref_primary_10_1016_j_patrec_2024_09_023
crossref_primary_10_1109_TIP_2025_3544494
crossref_primary_10_2478_amns_2025_0995
crossref_primary_10_1038_s42256_025_01044_4
crossref_primary_10_1007_s11263_024_02045_3
crossref_primary_10_1038_s42256_024_00848_0
crossref_primary_10_1145_3768577
crossref_primary_10_3390_app14156832
crossref_primary_10_1109_JSTARS_2025_3564326
crossref_primary_10_1016_j_geoen_2024_213629
crossref_primary_10_1038_s41467_025_60872_5
crossref_primary_10_1016_j_medj_2024_07_026
crossref_primary_10_1007_s40747_025_02041_1
crossref_primary_10_1016_j_ins_2025_122611
crossref_primary_10_3390_electronics13173364
crossref_primary_10_1007_s40031_025_01229_w
crossref_primary_10_1109_TCSI_2025_3547732
crossref_primary_10_1007_s00530_024_01582_8
crossref_primary_10_1007_s13042_024_02254_9
crossref_primary_10_1038_s41598_025_97519_w
crossref_primary_10_1186_s12915_025_02361_1
crossref_primary_10_3390_app15179482
crossref_primary_10_1016_j_eswa_2025_128726
crossref_primary_10_1038_s41467_025_57215_9
crossref_primary_10_1016_j_neunet_2025_108012
crossref_primary_10_1145_3744238
crossref_primary_10_1016_j_compmedimag_2024_102452
crossref_primary_10_1038_s44385_025_00022_0
crossref_primary_10_2478_fcds_2025_0015
crossref_primary_10_1016_j_ait_2025_100003
crossref_primary_10_1002_advs_202412926
crossref_primary_10_1093_bioinformatics_btaf229
crossref_primary_10_3390_electronics13234591
crossref_primary_10_14489_vkit_2025_03_pp_050_056
crossref_primary_10_1109_ACCESS_2025_3556449
crossref_primary_10_3390_chemosensors12090172
crossref_primary_10_1038_s41598_025_98271_x
crossref_primary_10_1007_s10010_025_00875_2
crossref_primary_10_1016_j_robot_2024_104870
crossref_primary_10_26599_BDMA_2024_9020028
crossref_primary_10_3390_electronics14050967
crossref_primary_10_3390_rs17122016
crossref_primary_10_1007_s12293_025_00440_y
crossref_primary_10_1007_s10844_025_00944_6
crossref_primary_10_3390_bdcc8120179
crossref_primary_10_1016_j_eswa_2025_128613
crossref_primary_10_1016_j_knosys_2024_112868
crossref_primary_10_1007_s12559_025_10404_6
crossref_primary_10_1016_j_enconman_2024_119218
crossref_primary_10_1007_s12145_025_01981_z
crossref_primary_10_1007_s13042_025_02736_4
crossref_primary_10_1016_j_sysarc_2025_103548
crossref_primary_10_1007_s00521_025_10975_3
crossref_primary_10_7717_peerj_cs_1967
crossref_primary_10_1016_j_nlp_2025_100144
crossref_primary_10_1109_TNSRE_2024_3515175
crossref_primary_10_1016_j_ajo_2025_08_008
crossref_primary_10_3390_s25051318
crossref_primary_10_3390_en17102249
crossref_primary_10_1016_j_compbiolchem_2025_108429
crossref_primary_10_1016_j_inffus_2025_103526
crossref_primary_10_1007_s11432_024_4466_3
crossref_primary_10_3390_math13111760
crossref_primary_10_1016_j_nlp_2025_100143
crossref_primary_10_3390_atmos16050541
crossref_primary_10_3390_rs17162803
crossref_primary_10_1016_j_jhydrol_2025_132906
crossref_primary_10_1109_TKDE_2021_3115669
crossref_primary_10_1145_3637871
crossref_primary_10_1049_ipr2_70213
crossref_primary_10_1364_BOE_553849
crossref_primary_10_1016_j_ab_2025_115882
crossref_primary_10_1016_j_dsp_2024_104683
crossref_primary_10_1109_TKDE_2024_3469578
crossref_primary_10_1109_LSP_2024_3353039
crossref_primary_10_1002_mp_17639
crossref_primary_10_1063_5_0211187
crossref_primary_10_1002_aisy_202401001
crossref_primary_10_1109_JBHI_2023_3319361
crossref_primary_10_1109_JAS_2025_125495
crossref_primary_10_1109_TLT_2024_3521898
crossref_primary_10_1109_JAS_2025_125498
crossref_primary_10_1016_j_knosys_2024_112686
crossref_primary_10_1007_s10586_024_05015_z
crossref_primary_10_1016_j_neucom_2025_129472
crossref_primary_10_1007_s11227_025_07297_5
crossref_primary_10_1016_j_csbj_2024_06_016
crossref_primary_10_1038_s41467_024_50903_y
crossref_primary_10_1039_D5DD00122F
crossref_primary_10_1016_j_isci_2025_113495
crossref_primary_10_1016_j_isprsjprs_2025_01_025
crossref_primary_10_1162_tacl_a_00716
crossref_primary_10_3390_app14114834
crossref_primary_10_1109_TASLPRO_2025_3578755
crossref_primary_10_1016_j_csbj_2025_05_039
crossref_primary_10_1016_j_cej_2024_158578
crossref_primary_10_1016_j_measurement_2025_117752
crossref_primary_10_1016_j_artmed_2025_103220
crossref_primary_10_1080_03155986_2024_2388452
crossref_primary_10_1145_3712064
crossref_primary_10_1134_S1064562423701168
crossref_primary_10_1038_s41591_024_03445_1
crossref_primary_10_1016_j_heliyon_2024_e39038
crossref_primary_10_1016_j_bbapap_2025_141100
crossref_primary_10_1016_j_patcog_2025_111641
crossref_primary_10_5814_j_issn_1674_764x_2025_02_018
crossref_primary_10_1093_bioadv_vbaf117
crossref_primary_10_1145_3725273
crossref_primary_10_1016_j_tics_2024_01_011
crossref_primary_10_1007_s11192_025_05386_z
crossref_primary_10_1016_j_neunet_2025_107769
crossref_primary_10_1109_LRA_2025_3539080
crossref_primary_10_3390_app14125068
crossref_primary_10_1016_j_eswa_2025_129290
crossref_primary_10_3390_app142411777
crossref_primary_10_1016_j_eswa_2025_128523
crossref_primary_10_1134_S1054661824700962
crossref_primary_10_3390_electronics13245040
crossref_primary_10_3390_electronics13152892
crossref_primary_10_1016_j_inffus_2025_103332
crossref_primary_10_1016_j_eswa_2025_128658
crossref_primary_10_1016_j_compbiomed_2025_109845
crossref_primary_10_3390_drones9060386
crossref_primary_10_1186_s12880_024_01476_1
crossref_primary_10_1109_TIM_2025_3551795
crossref_primary_10_1121_10_0038981
crossref_primary_10_3390_app15137260
crossref_primary_10_1016_j_dsm_2025_09_001
crossref_primary_10_1007_s44366_025_0060_0
crossref_primary_10_1016_j_patcog_2024_110572
crossref_primary_10_1016_j_isprsjprs_2025_09_006
crossref_primary_10_1038_s41598_025_12498_2
crossref_primary_10_1016_j_jvcir_2025_104558
crossref_primary_10_1007_s10489_024_05549_0
crossref_primary_10_1016_j_compbiomed_2024_108626
crossref_primary_10_1016_j_aei_2024_102713
crossref_primary_10_1016_j_brainres_2025_149634
crossref_primary_10_1109_TSTE_2024_3482360
crossref_primary_10_1186_s13321_025_00959_9
crossref_primary_10_1109_TRO_2025_3539193
crossref_primary_10_1016_j_asoc_2025_113622
crossref_primary_10_1038_s40494_025_01621_1
crossref_primary_10_1016_j_jcp_2025_114272
crossref_primary_10_1109_JIOT_2025_3560654
crossref_primary_10_1109_LRA_2025_3592065
crossref_primary_10_1007_s11263_025_02353_2
crossref_primary_10_3390_rs17030517
crossref_primary_10_1007_s10796_025_10634_x
crossref_primary_10_1109_OJCS_2025_3587005
crossref_primary_10_3390_s24227128
crossref_primary_10_1080_19420862_2025_2528902
crossref_primary_10_1109_ACCESS_2024_3397326
crossref_primary_10_1016_j_neuroimage_2025_121096
crossref_primary_10_1016_j_patter_2025_101325
crossref_primary_10_1186_s12911_025_03037_0
crossref_primary_10_1109_TIM_2025_3576014
crossref_primary_10_1145_3696413
crossref_primary_10_1109_LSP_2024_3522856
crossref_primary_10_3390_app15168999
crossref_primary_10_1016_j_trc_2025_105183
crossref_primary_10_1109_JSTARS_2025_3539791
crossref_primary_10_1016_j_xgen_2025_100762
crossref_primary_10_1016_j_inffus_2025_103265
crossref_primary_10_1016_j_ymeth_2025_01_015
crossref_primary_10_1007_s11227_025_07348_x
crossref_primary_10_1016_j_jpha_2025_101406
crossref_primary_10_1016_j_eswa_2025_129658
crossref_primary_10_1145_3768165
crossref_primary_10_1109_LGRS_2025_3607840
crossref_primary_10_1145_3768163
crossref_primary_10_1109_ACCESS_2025_3562967
crossref_primary_10_1109_ACCESS_2025_3560549
crossref_primary_10_1371_journal_pone_0302275
crossref_primary_10_1016_j_aiopen_2025_01_002
crossref_primary_10_1016_j_neucom_2025_130517
crossref_primary_10_1109_LSP_2025_3601497
crossref_primary_10_1109_JBHI_2024_3416348
crossref_primary_10_3390_bioengineering12050538
crossref_primary_10_3390_electronics14142829
crossref_primary_10_3390_fi17040185
crossref_primary_10_1109_TCSVT_2024_3445337
crossref_primary_10_1109_ACCESS_2025_3537649
crossref_primary_10_1038_s41467_025_58250_2
crossref_primary_10_1016_j_jksuci_2024_102095
crossref_primary_10_1146_annurev_biodatasci_103123_095406
crossref_primary_10_1109_TSE_2025_3548168
crossref_primary_10_1145_3768156
crossref_primary_10_32604_cmc_2024_059018
crossref_primary_10_3390_make7030093
crossref_primary_10_1145_3759441_3759448
crossref_primary_10_1145_3727882
crossref_primary_10_3390_jlpea15010008
crossref_primary_10_1111_1755_6724_15213
crossref_primary_10_1109_ACCESS_2025_3580488
crossref_primary_10_1021_acs_jcim_5c00914
crossref_primary_10_1016_j_engappai_2025_110215
crossref_primary_10_1038_s42256_024_00920_9
crossref_primary_10_1109_TSC_2024_3440013
crossref_primary_10_1002_mlf2_12157
Cites_doi 10.1162/tacl_a_00574
10.2478/pralin-2018-0002
ContentType Journal Article
DBID AAYXX
CITATION
DOI 10.1016/j.neucom.2023.127063
DatabaseName CrossRef
DatabaseTitle CrossRef
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
ExternalDocumentID 10_1016_j_neucom_2023_127063
GroupedDBID ---
--K
--M
.DC
.~1
0R~
123
1B1
1~.
1~5
29N
4.4
457
4G.
53G
5VS
7-5
71M
8P~
9DU
9JM
9JN
AABNK
AAEDT
AAEDW
AAIKJ
AAKOC
AALRI
AAOAW
AAQFI
AAQXK
AATTM
AAXKI
AAXLA
AAXUO
AAYFN
AAYWO
AAYXX
ABBOA
ABCQJ
ABFNM
ABJNI
ABMAC
ABWVN
ABXDB
ACDAQ
ACGFS
ACLOT
ACNNM
ACRLP
ACRPL
ACVFH
ACZNC
ADBBV
ADCNI
ADEZE
ADJOM
ADMUD
ADNMO
AEBSH
AEIPS
AEKER
AENEX
AEUPX
AFJKZ
AFPUW
AFTJW
AFXIZ
AGHFR
AGQPQ
AGUBO
AGWIK
AGYEJ
AHHHB
AHZHX
AIALX
AIEXJ
AIGII
AIIUN
AIKHN
AITUG
AKBMS
AKRWK
AKYEP
ALMA_UNASSIGNED_HOLDINGS
AMRAJ
ANKPU
AOUOD
APXCP
ASPBG
AVWKF
AXJTR
AZFZN
BKOJK
BLXMC
CITATION
CS3
DU5
EBS
EFJIC
EFKBS
EFLBG
EJD
EO8
EO9
EP2
EP3
F5P
FDB
FEDTE
FGOYB
FIRID
FNPLU
FYGXN
G-Q
GBLVA
GBOLZ
HLZ
HVGLF
HZ~
IHE
J1W
KOM
LG9
M41
MO0
MOBAO
N9A
O-L
O9-
OAUVE
OZT
P-8
P-9
P2P
PC.
Q38
R2-
ROL
RPZ
SBC
SDF
SDG
SDP
SES
SEW
SPC
SPCBC
SSN
SSV
SSZ
T5K
WUQ
XPP
ZMT
~G-
~HD
ID FETCH-LOGICAL-c251t-2c7f120893e37b52e0998fe7768b2e6775a0908bcf455c611f433441bb187a1e3
ISICitedReferencesCount 508
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001128175500001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 0925-2312
IngestDate Tue Nov 18 22:35:27 EST 2025
Sat Nov 29 07:15:30 EST 2025
IsPeerReviewed true
IsScholarly true
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c251t-2c7f120893e37b52e0998fe7768b2e6775a0908bcf455c611f433441bb187a1e3
ORCID 0000-0002-0741-0710
ParticipantIDs crossref_citationtrail_10_1016_j_neucom_2023_127063
crossref_primary_10_1016_j_neucom_2023_127063
PublicationCentury 2000
PublicationDate 2024-02-00
PublicationDateYYYYMMDD 2024-02-01
PublicationDate_xml – month: 02
  year: 2024
  text: 2024-02-00
PublicationDecade 2020
PublicationTitle Neurocomputing (Amsterdam)
PublicationYear 2024
References Popel (10.1016/j.neucom.2023.127063_b30) 2018; 110
Brown (10.1016/j.neucom.2023.127063_b50) 2020
Devlin (10.1016/j.neucom.2023.127063_b4) 2019
Liu (10.1016/j.neucom.2023.127063_b20) 2020; vol. 119
Mahoney (10.1016/j.neucom.2023.127063_b45) 2011
Ke (10.1016/j.neucom.2023.127063_b17) 2021
Junczys-Dowmunt (10.1016/j.neucom.2023.127063_b29) 2016
Huang (10.1016/j.neucom.2023.127063_b13) 2019
Wolf (10.1016/j.neucom.2023.127063_b44) 2020
Al-Natsheh (10.1016/j.neucom.2023.127063_b41) 2017
Raffel (10.1016/j.neucom.2023.127063_b51) 2020; 21
Joshi (10.1016/j.neucom.2023.127063_b55) 2017
Choromanski (10.1016/j.neucom.2023.127063_b26) 2021
Wu (10.1016/j.neucom.2023.127063_b31) 2016
Socher (10.1016/j.neucom.2023.127063_b39) 2013
Islam (10.1016/j.neucom.2023.127063_b2) 2020
Williams (10.1016/j.neucom.2023.127063_b43) 2018
Sennrich (10.1016/j.neucom.2023.127063_b28) 2016
Paperno (10.1016/j.neucom.2023.127063_b53) 2016
Murtadha (10.1016/j.neucom.2023.127063_b3) 2023; 11
Yun (10.1016/j.neucom.2023.127063_b7) 2020
Raffel (10.1016/j.neucom.2023.127063_b16) 2020; 21
Shaw (10.1016/j.neucom.2023.127063_b32) 2018
Papineni (10.1016/j.neucom.2023.127063_b34) 2002
Shaw (10.1016/j.neucom.2023.127063_b12) 2018
Wang (10.1016/j.neucom.2023.127063_b22) 2020
Parikh (10.1016/j.neucom.2023.127063_b11) 2016
Foundation (10.1016/j.neucom.2023.127063_b36) 2021
Shen (10.1016/j.neucom.2023.127063_b24) 2021
Biderman (10.1016/j.neucom.2023.127063_b49) 2021
Dai (10.1016/j.neucom.2023.127063_b14) 2019
Press (10.1016/j.neucom.2023.127063_b60) 2022
Xiao (10.1016/j.neucom.2023.127063_b48) 2019
Liu (10.1016/j.neucom.2023.127063_b59) 2019
Loshchilov (10.1016/j.neucom.2023.127063_b37) 2019
Rajpurkar (10.1016/j.neucom.2023.127063_b40) 2016
Wang (10.1016/j.neucom.2023.127063_b25) 2019
Ott (10.1016/j.neucom.2023.127063_b33) 2019
Clark (10.1016/j.neucom.2023.127063_b9) 2020
Wei (10.1016/j.neucom.2023.127063_b47) 2019
Lan (10.1016/j.neucom.2023.127063_b8) 2020
Gehring (10.1016/j.neucom.2023.127063_b1) 2017; vol. 70
Chen (10.1016/j.neucom.2023.127063_b42) 2018
Zhu (10.1016/j.neucom.2023.127063_b58) 2015
Bojar (10.1016/j.neucom.2023.127063_b27) 2014
Radford (10.1016/j.neucom.2023.127063_b6) 2019
Chen (10.1016/j.neucom.2023.127063_b21) 2018
Zhu (10.1016/j.neucom.2023.127063_b35) 2015
Radford (10.1016/j.neucom.2023.127063_b10) 2018
Katharopoulos (10.1016/j.neucom.2023.127063_b23) 2020; vol. 119
Carion (10.1016/j.neucom.2023.127063_b57) 2020
Huang (10.1016/j.neucom.2023.127063_b19) 2020
Dolan (10.1016/j.neucom.2023.127063_b38) 2005
Zellers (10.1016/j.neucom.2023.127063_b54) 2019
Vaswani (10.1016/j.neucom.2023.127063_b5) 2017
Tian (10.1016/j.neucom.2023.127063_b56) 2022
He (10.1016/j.neucom.2023.127063_b18) 2021
Gao (10.1016/j.neucom.2023.127063_b52) 2021
10.1016/j.neucom.2023.127063_b46
Yang (10.1016/j.neucom.2023.127063_b15) 2019
References_xml – start-page: 2249
  year: 2016
  ident: 10.1016/j.neucom.2023.127063_b11
  article-title: A decomposable attention model for natural language inference
– start-page: 12
  year: 2014
  ident: 10.1016/j.neucom.2023.127063_b27
  article-title: Findings of the 2014 workshop on statistical machine translation
– year: 2016
  ident: 10.1016/j.neucom.2023.127063_b28
  article-title: Neural machine translation of rare words with subword units
– year: 2019
  ident: 10.1016/j.neucom.2023.127063_b59
– start-page: 3327
  year: 2020
  ident: 10.1016/j.neucom.2023.127063_b19
  article-title: Improve transformer models with better relative position embeddings
– start-page: 19
  year: 2015
  ident: 10.1016/j.neucom.2023.127063_b58
  article-title: Aligning books and movies: Towards story-like visual explanations by watching movies and reading books
– start-page: 464
  year: 2018
  ident: 10.1016/j.neucom.2023.127063_b12
  article-title: Self-attention with relative position representations
– year: 2005
  ident: 10.1016/j.neucom.2023.127063_b38
  article-title: Automatically constructing a corpus of sentential paraphrases
– year: 2011
  ident: 10.1016/j.neucom.2023.127063_b45
– start-page: 48
  year: 2019
  ident: 10.1016/j.neucom.2023.127063_b33
  article-title: Fairseq: A fast, extensible toolkit for sequence modeling
– year: 2019
  ident: 10.1016/j.neucom.2023.127063_b13
  article-title: Music transformer: Generating music with long-term structure
– start-page: 3530
  year: 2021
  ident: 10.1016/j.neucom.2023.127063_b24
  article-title: Efficient attention: Attention with linear complexities
– start-page: 4791
  year: 2019
  ident: 10.1016/j.neucom.2023.127063_b54
  article-title: HellaSwag: Can a machine really finish your sentence?
– volume: 11
  start-page: 771
  issn: 2307-387X
  year: 2023
  ident: 10.1016/j.neucom.2023.127063_b3
  article-title: Rank-Aware Negative Training for Semi-Supervised Text Classification
  publication-title: Transactions of the Association for Computational Linguistics
  doi: 10.1162/tacl_a_00574
– start-page: 5998
  year: 2017
  ident: 10.1016/j.neucom.2023.127063_b5
  article-title: Attention is all you need
– year: 2020
  ident: 10.1016/j.neucom.2023.127063_b8
  article-title: ALBERT: a lite BERT for self-supervised learning of language representations
– year: 2020
  ident: 10.1016/j.neucom.2023.127063_b57
– year: 2022
  ident: 10.1016/j.neucom.2023.127063_b56
– year: 2021
  ident: 10.1016/j.neucom.2023.127063_b26
  article-title: Rethinking attention with performers
– start-page: 1631
  year: 2013
  ident: 10.1016/j.neucom.2023.127063_b39
  article-title: Recursive deep models for semantic compositionality over a sentiment treebank
– volume: 21
  start-page: 140:1
  year: 2020
  ident: 10.1016/j.neucom.2023.127063_b16
  article-title: Exploring the limits of transfer learning with a unified text-to-text transformer
  publication-title: J. Mach. Learn. Res.
– start-page: 2978
  year: 2019
  ident: 10.1016/j.neucom.2023.127063_b14
  article-title: Transformer-XL: Attentive language models beyond a fixed-length context
– year: 2019
  ident: 10.1016/j.neucom.2023.127063_b25
  article-title: GLUE: a multi-task benchmark and analysis platform for natural language understanding
– start-page: 6572
  year: 2018
  ident: 10.1016/j.neucom.2023.127063_b21
  article-title: Neural ordinary differential equations
– year: 2020
  ident: 10.1016/j.neucom.2023.127063_b22
  article-title: Encoding word order in complex embeddings
– start-page: 19
  year: 2015
  ident: 10.1016/j.neucom.2023.127063_b35
  article-title: Aligning books and movies: Towards story-like visual explanations by watching movies and reading books
– start-page: 311
  year: 2002
  ident: 10.1016/j.neucom.2023.127063_b34
  article-title: Bleu: a method for automatic evaluation of machine translation
– year: 2020
  ident: 10.1016/j.neucom.2023.127063_b50
  article-title: Language models are few-shot learners
– volume: vol. 119
  start-page: 5156
  year: 2020
  ident: 10.1016/j.neucom.2023.127063_b23
  article-title: Transformers are RNNs: Fast Autoregressive Transformers with Linear Attention
– year: 2021
  ident: 10.1016/j.neucom.2023.127063_b49
– year: 2018
  ident: 10.1016/j.neucom.2023.127063_b42
– year: 2016
  ident: 10.1016/j.neucom.2023.127063_b53
  article-title: The LAMBADA dataset: Word prediction requiring a broad discourse context
– year: 2019
  ident: 10.1016/j.neucom.2023.127063_b48
– start-page: 1112
  year: 2018
  ident: 10.1016/j.neucom.2023.127063_b43
  article-title: A broad-coverage challenge corpus for sentence understanding through inference
– year: 2019
  ident: 10.1016/j.neucom.2023.127063_b47
– year: 2016
  ident: 10.1016/j.neucom.2023.127063_b29
  article-title: Is neural machine translation ready for deployment? A case study on 30 translation directions
– year: 2021
  ident: 10.1016/j.neucom.2023.127063_b18
  article-title: Deberta: decoding-enhanced bert with disentangled attention
– year: 2021
  ident: 10.1016/j.neucom.2023.127063_b17
  article-title: Rethinking positional encoding in language pre-training
– year: 2019
  ident: 10.1016/j.neucom.2023.127063_b37
  article-title: Decoupled weight decay regularization
– year: 2020
  ident: 10.1016/j.neucom.2023.127063_b9
  article-title: ELECTRA: pre-training text encoders as discriminators rather than generators
– year: 2019
  ident: 10.1016/j.neucom.2023.127063_b6
– start-page: 5754
  year: 2019
  ident: 10.1016/j.neucom.2023.127063_b15
  article-title: Xlnet: Generalized autoregressive pretraining for language understanding
– year: 2021
  ident: 10.1016/j.neucom.2023.127063_b36
– start-page: 38
  year: 2020
  ident: 10.1016/j.neucom.2023.127063_b44
  article-title: Transformers: State-of-the-art natural language processing
– year: 2022
  ident: 10.1016/j.neucom.2023.127063_b60
  article-title: Train short, test long: Attention with linear biases enables input length extrapolation
– year: 2016
  ident: 10.1016/j.neucom.2023.127063_b31
– start-page: 464
  year: 2018
  ident: 10.1016/j.neucom.2023.127063_b32
  article-title: Self-attention with relative position representations
– start-page: 2383
  year: 2016
  ident: 10.1016/j.neucom.2023.127063_b40
  article-title: Squad: 100, 000+ questions for machine comprehension of text
– volume: 21
  start-page: 140:1
  year: 2020
  ident: 10.1016/j.neucom.2023.127063_b51
  article-title: Exploring the limits of transfer learning with a unified text-to-text transformer
  publication-title: J. Mach. Learn. Res.
– year: 2020
  ident: 10.1016/j.neucom.2023.127063_b7
  article-title: Are transformers universal approximators of sequence-to-sequence functions?
– year: 2021
  ident: 10.1016/j.neucom.2023.127063_b52
– volume: vol. 119
  start-page: 6327
  year: 2020
  ident: 10.1016/j.neucom.2023.127063_b20
  article-title: Learning to encode position for transformer with continuous dynamical model
– year: 2018
  ident: 10.1016/j.neucom.2023.127063_b10
– start-page: 4171
  year: 2019
  ident: 10.1016/j.neucom.2023.127063_b4
  article-title: BERT: pre-training of deep bidirectional transformers for language understanding
– start-page: 1601
  year: 2017
  ident: 10.1016/j.neucom.2023.127063_b55
  article-title: Triviaqa: A large scale distantly supervised challenge dataset for reading comprehension
– volume: vol. 70
  start-page: 1243
  year: 2017
  ident: 10.1016/j.neucom.2023.127063_b1
  article-title: Convolutional sequence to sequence learning
– volume: 110
  start-page: 43
  year: 2018
  ident: 10.1016/j.neucom.2023.127063_b30
  article-title: Training tips for the transformer model
  publication-title: Prague Bull. Math. Linguist.
  doi: 10.2478/pralin-2018-0002
– ident: 10.1016/j.neucom.2023.127063_b46
– year: 2020
  ident: 10.1016/j.neucom.2023.127063_b2
  article-title: How much position information do convolutional neural networks encode?
– start-page: 115
  year: 2017
  ident: 10.1016/j.neucom.2023.127063_b41
  article-title: Udl at SemEval-2017 task 1: Semantic textual similarity estimation of english sentence pairs using regression model over pairwise features
SSID ssj0017129
Score 2.7672222
SourceID crossref
SourceType Enrichment Source
Index Database
StartPage 127063
Title RoFormer: Enhanced transformer with Rotary Position Embedding
Volume 568
WOSCitedRecordID wos001128175500001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVESC
  databaseName: Elsevier SD Freedom Collection Journals 2021
  issn: 0925-2312
  databaseCode: AIEXJ
  dateStart: 19950101
  customDbUrl:
  isFulltext: true
  dateEnd: 99991231
  titleUrlDefault: https://www.sciencedirect.com
  omitProxy: false
  ssIdentifier: ssj0017129
  providerName: Elsevier
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1LT9wwELbo0kMvtPQhoA_5wK0yWjsPx72tqkVthRCigLanyE7sLis2i5ak4ud3HD92KRIqBy5RFE2cKN-n8ZfxeAahfUFNYpioiEmKlKRSUaJSaogUUgqa1VL0yZgXR_z4uJhMxInfXXLTtxPgTVPc3orrJ4UargHYduvsI-COg8IFOAfQ4Qiww_G_gD9dHIIO1X37x3EzdSv8bdCneulCr6eL1ubLnficrc_judJ1nMdmoaZTB_Nb3_fBRxRGc1tYobYsihGEn11PBeDZ1WWk2mg6d2FUQLKV9TR6_6Pe-le3WrtyEdipbn4b7Z_vwxAsDZnLq3giywiIxTuuNXMtc7xztIvczpvd89suhDA7aHRnk3hsT_eDlfndMtn_TF8xqTDkq81KN0ppRyndKM_QJuOZKAZoc_R9PPkRF5o4Za4co3_7sLuyTwG8_zZr6mVNhpy9Qlv-_wGPHO7baEM3r9HL0JsDe1f9BkUafMGBBHiNBNiSADsS4EACHEnwFp0fjs--fiO-VQapQKC2hFXcUDYE8akTrjKmQfgXRnP4mVRM55xnciiGhapMmmVVTqlJkwSUsFK04JLq5B0aNItG7yAswDZnWhUmNymjucypZvWwAqEtapnmuygJn6CsfB15287kqnwIgF1E4l3Xro7Kg_Z7j7R_j16sWPkBDdplpz-i59Wf9vJm-cmj_hfBDWwD
linkProvider Elsevier
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=RoFormer%3A+Enhanced+transformer+with+Rotary+Position+Embedding&rft.jtitle=Neurocomputing+%28Amsterdam%29&rft.au=Su%2C+Jianlin&rft.au=Ahmed%2C+Murtadha&rft.au=Lu%2C+Yu&rft.au=Pan%2C+Shengfeng&rft.date=2024-02-01&rft.issn=0925-2312&rft.volume=568&rft.spage=127063&rft_id=info:doi/10.1016%2Fj.neucom.2023.127063&rft.externalDBID=n%2Fa&rft.externalDocID=10_1016_j_neucom_2023_127063
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0925-2312&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0925-2312&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0925-2312&client=summon