RoFormer: Enhanced transformer with Rotary Position Embedding
Uložené v:
| Vydané v: | Neurocomputing (Amsterdam) Ročník 568; s. 127063 |
|---|---|
| Hlavní autori: | , , , , , |
| Médium: | Journal Article |
| Jazyk: | English |
| Vydavateľské údaje: |
01.02.2024
|
| ISSN: | 0925-2312 |
| On-line prístup: | Získať plný text |
| Tagy: |
Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
|
| ArticleNumber | 127063 |
|---|---|
| Author | Ahmed, Murtadha Su, Jianlin Lu, Yu Liu, Yunfeng Pan, Shengfeng Bo, Wen |
| Author_xml | – sequence: 1 givenname: Jianlin surname: Su fullname: Su, Jianlin – sequence: 2 givenname: Murtadha orcidid: 0000-0002-0741-0710 surname: Ahmed fullname: Ahmed, Murtadha – sequence: 3 givenname: Yu surname: Lu fullname: Lu, Yu – sequence: 4 givenname: Shengfeng surname: Pan fullname: Pan, Shengfeng – sequence: 5 givenname: Wen surname: Bo fullname: Bo, Wen – sequence: 6 givenname: Yunfeng surname: Liu fullname: Liu, Yunfeng |
| BookMark | eNp9j0FLwzAcxXOY4Db9Bh76BVrzT9qmGXiQ0TlhoAw9hzRNXMqaSFIRv72t9eTB04MHv8f7rdDCeacRugGcAYbytsuc_lC-zwgmNAPCcEkXaIk5KVJCgVyiVYwdxsCA8CW6O_qdD70Om6R2J-mUbpMhSBfNT5t82uGUHP0gw1fy7KMdrHdJ3Te6ba17u0IXRp6jvv7NNXrd1S_bfXp4enjc3h9SRQoYUqKYAYIrTjVlTUE05rwymrGyaoguGSsk5rhqlMmLQpUAJqc0z6FpoGISNF2jzbyrgo8xaCOUHeT0ZfxqzwKwmORFJ2Z5McmLWX6E8z_we7D9KPQ_9g2cYmQ5 |
| CitedBy_id | crossref_primary_10_3390_app14188457 crossref_primary_10_1016_j_jksuci_2024_102000 crossref_primary_10_1002_ima_70172 crossref_primary_10_1007_s11042_024_20382_w crossref_primary_10_1109_TKDE_2024_3376539 crossref_primary_10_1109_LSP_2024_3418714 crossref_primary_10_1021_acs_iecr_5c01387 crossref_primary_10_1038_s41598_025_91940_x crossref_primary_10_1016_j_cherd_2024_12_008 crossref_primary_10_3389_frai_2025_1576992 crossref_primary_10_1016_j_ipm_2024_103814 crossref_primary_10_1016_j_eswa_2025_126961 crossref_primary_10_1038_s41467_024_49798_6 crossref_primary_10_1109_ACCESS_2024_3399670 crossref_primary_10_1016_j_imavis_2025_105672 crossref_primary_10_1049_cvi2_70022 crossref_primary_10_1162_tacl_a_00693 crossref_primary_10_1109_ACCESS_2025_3605729 crossref_primary_10_1109_LCA_2025_3535470 crossref_primary_10_1126_science_ado9336 crossref_primary_10_1109_TIM_2024_3509573 crossref_primary_10_1007_s11263_025_02426_2 crossref_primary_10_1109_JIOT_2025_3558021 crossref_primary_10_1029_2025SW004424 crossref_primary_10_1016_j_csbj_2025_07_038 crossref_primary_10_1016_j_knosys_2025_113762 crossref_primary_10_1016_j_tig_2024_11_013 crossref_primary_10_1186_s12859_024_05847_x crossref_primary_10_1016_j_atech_2025_101266 crossref_primary_10_1038_s41592_024_02523_z crossref_primary_10_1093_gbe_evaf101 crossref_primary_10_1016_j_knosys_2025_113881 crossref_primary_10_1016_j_eswa_2025_128596 crossref_primary_10_1109_TPAMI_2024_3463709 crossref_primary_10_1117_1_JEI_34_2_023043 crossref_primary_10_1016_j_artmed_2025_103147 crossref_primary_10_1007_s13042_025_02760_4 crossref_primary_10_1007_s10506_025_09435_z crossref_primary_10_1093_nargab_lqaf058 crossref_primary_10_1109_TASLPRO_2025_3587393 crossref_primary_10_1007_s10115_025_02516_0 crossref_primary_10_1016_j_ijbiomac_2025_139941 crossref_primary_10_1016_j_patcog_2025_111803 crossref_primary_10_1109_TMM_2024_3373249 crossref_primary_10_1109_TPWRS_2024_3400123 crossref_primary_10_3390_molecules30051116 crossref_primary_10_3390_min15040374 crossref_primary_10_1145_3714416 crossref_primary_10_1109_LRA_2025_3568612 crossref_primary_10_1109_MNET_2024_3376419 crossref_primary_10_1016_j_procs_2025_02_260 crossref_primary_10_1016_j_neunet_2025_107123 crossref_primary_10_1080_23311908_2025_2455783 crossref_primary_10_1038_s41587_024_02511_w crossref_primary_10_3390_math13091386 crossref_primary_10_1016_j_csbj_2025_03_021 crossref_primary_10_1016_j_taml_2024_100527 crossref_primary_10_1038_s42256_024_00946_z crossref_primary_10_1016_j_asr_2024_10_001 crossref_primary_10_1007_s10462_025_11259_x crossref_primary_10_1038_s41467_025_56349_0 crossref_primary_10_1109_ACCESS_2025_3582745 crossref_primary_10_1016_j_jfranklin_2025_107635 crossref_primary_10_1109_ACCESS_2025_3549031 crossref_primary_10_3389_frai_2025_1663484 crossref_primary_10_1162_coli_a_00541 crossref_primary_10_1093_bib_bbaf137 crossref_primary_10_1007_s13369_025_10472_8 crossref_primary_10_1016_j_csl_2025_101843 crossref_primary_10_1016_j_rse_2025_114913 crossref_primary_10_1109_TMC_2023_3310712 crossref_primary_10_1016_j_aei_2025_103377 crossref_primary_10_1109_TASLPRO_2025_3577336 crossref_primary_10_1093_bioinformatics_btaf446 crossref_primary_10_1007_s10115_025_02386_6 crossref_primary_10_1109_ACCESS_2025_3594132 crossref_primary_10_1016_j_trc_2025_105239 crossref_primary_10_3390_app15042106 crossref_primary_10_1109_TKDE_2024_3435765 crossref_primary_10_1109_TPAMI_2024_3443922 crossref_primary_10_1007_s00371_024_03469_1 crossref_primary_10_1007_s10115_024_02310_4 crossref_primary_10_1088_2632_2153_ad743e crossref_primary_10_1021_acs_jcim_5c01713 crossref_primary_10_1126_sciadv_adu2488 crossref_primary_10_3389_fninf_2024_1414925 crossref_primary_10_1016_j_bspc_2024_107471 crossref_primary_10_1016_j_eswa_2025_129801 crossref_primary_10_1038_s41586_025_09292_5 crossref_primary_10_1007_s11063_025_11721_5 crossref_primary_10_1016_j_knosys_2024_112404 crossref_primary_10_1109_MNANO_2024_3513112 crossref_primary_10_1109_TKDE_2025_3580640 crossref_primary_10_1109_LRA_2025_3550707 crossref_primary_10_1145_3729405 crossref_primary_10_1109_TASLPRO_2025_3574847 crossref_primary_10_1016_j_patrec_2024_09_023 crossref_primary_10_1109_TIP_2025_3544494 crossref_primary_10_2478_amns_2025_0995 crossref_primary_10_1038_s42256_025_01044_4 crossref_primary_10_1007_s11263_024_02045_3 crossref_primary_10_1038_s42256_024_00848_0 crossref_primary_10_1145_3768577 crossref_primary_10_3390_app14156832 crossref_primary_10_1109_JSTARS_2025_3564326 crossref_primary_10_1016_j_geoen_2024_213629 crossref_primary_10_1038_s41467_025_60872_5 crossref_primary_10_1016_j_medj_2024_07_026 crossref_primary_10_1007_s40747_025_02041_1 crossref_primary_10_1016_j_ins_2025_122611 crossref_primary_10_3390_electronics13173364 crossref_primary_10_1007_s40031_025_01229_w crossref_primary_10_1109_TCSI_2025_3547732 crossref_primary_10_1007_s00530_024_01582_8 crossref_primary_10_1007_s13042_024_02254_9 crossref_primary_10_1038_s41598_025_97519_w crossref_primary_10_1186_s12915_025_02361_1 crossref_primary_10_3390_app15179482 crossref_primary_10_1016_j_eswa_2025_128726 crossref_primary_10_1038_s41467_025_57215_9 crossref_primary_10_1016_j_neunet_2025_108012 crossref_primary_10_1145_3744238 crossref_primary_10_1016_j_compmedimag_2024_102452 crossref_primary_10_1038_s44385_025_00022_0 crossref_primary_10_2478_fcds_2025_0015 crossref_primary_10_1016_j_ait_2025_100003 crossref_primary_10_1002_advs_202412926 crossref_primary_10_1093_bioinformatics_btaf229 crossref_primary_10_3390_electronics13234591 crossref_primary_10_14489_vkit_2025_03_pp_050_056 crossref_primary_10_1109_ACCESS_2025_3556449 crossref_primary_10_3390_chemosensors12090172 crossref_primary_10_1038_s41598_025_98271_x crossref_primary_10_1007_s10010_025_00875_2 crossref_primary_10_1016_j_robot_2024_104870 crossref_primary_10_26599_BDMA_2024_9020028 crossref_primary_10_3390_electronics14050967 crossref_primary_10_3390_rs17122016 crossref_primary_10_1007_s12293_025_00440_y crossref_primary_10_1007_s10844_025_00944_6 crossref_primary_10_3390_bdcc8120179 crossref_primary_10_1016_j_eswa_2025_128613 crossref_primary_10_1016_j_knosys_2024_112868 crossref_primary_10_1007_s12559_025_10404_6 crossref_primary_10_1016_j_enconman_2024_119218 crossref_primary_10_1007_s12145_025_01981_z crossref_primary_10_1007_s13042_025_02736_4 crossref_primary_10_1016_j_sysarc_2025_103548 crossref_primary_10_1007_s00521_025_10975_3 crossref_primary_10_7717_peerj_cs_1967 crossref_primary_10_1016_j_nlp_2025_100144 crossref_primary_10_1109_TNSRE_2024_3515175 crossref_primary_10_1016_j_ajo_2025_08_008 crossref_primary_10_3390_s25051318 crossref_primary_10_3390_en17102249 crossref_primary_10_1016_j_compbiolchem_2025_108429 crossref_primary_10_1016_j_inffus_2025_103526 crossref_primary_10_1007_s11432_024_4466_3 crossref_primary_10_3390_math13111760 crossref_primary_10_1016_j_nlp_2025_100143 crossref_primary_10_3390_atmos16050541 crossref_primary_10_3390_rs17162803 crossref_primary_10_1016_j_jhydrol_2025_132906 crossref_primary_10_1109_TKDE_2021_3115669 crossref_primary_10_1145_3637871 crossref_primary_10_1049_ipr2_70213 crossref_primary_10_1364_BOE_553849 crossref_primary_10_1016_j_ab_2025_115882 crossref_primary_10_1016_j_dsp_2024_104683 crossref_primary_10_1109_TKDE_2024_3469578 crossref_primary_10_1109_LSP_2024_3353039 crossref_primary_10_1002_mp_17639 crossref_primary_10_1063_5_0211187 crossref_primary_10_1002_aisy_202401001 crossref_primary_10_1109_JBHI_2023_3319361 crossref_primary_10_1109_JAS_2025_125495 crossref_primary_10_1109_TLT_2024_3521898 crossref_primary_10_1109_JAS_2025_125498 crossref_primary_10_1016_j_knosys_2024_112686 crossref_primary_10_1007_s10586_024_05015_z crossref_primary_10_1016_j_neucom_2025_129472 crossref_primary_10_1007_s11227_025_07297_5 crossref_primary_10_1016_j_csbj_2024_06_016 crossref_primary_10_1038_s41467_024_50903_y crossref_primary_10_1039_D5DD00122F crossref_primary_10_1016_j_isci_2025_113495 crossref_primary_10_1016_j_isprsjprs_2025_01_025 crossref_primary_10_1162_tacl_a_00716 crossref_primary_10_3390_app14114834 crossref_primary_10_1109_TASLPRO_2025_3578755 crossref_primary_10_1016_j_csbj_2025_05_039 crossref_primary_10_1016_j_cej_2024_158578 crossref_primary_10_1016_j_measurement_2025_117752 crossref_primary_10_1016_j_artmed_2025_103220 crossref_primary_10_1080_03155986_2024_2388452 crossref_primary_10_1145_3712064 crossref_primary_10_1134_S1064562423701168 crossref_primary_10_1038_s41591_024_03445_1 crossref_primary_10_1016_j_heliyon_2024_e39038 crossref_primary_10_1016_j_bbapap_2025_141100 crossref_primary_10_1016_j_patcog_2025_111641 crossref_primary_10_5814_j_issn_1674_764x_2025_02_018 crossref_primary_10_1093_bioadv_vbaf117 crossref_primary_10_1145_3725273 crossref_primary_10_1016_j_tics_2024_01_011 crossref_primary_10_1007_s11192_025_05386_z crossref_primary_10_1016_j_neunet_2025_107769 crossref_primary_10_1109_LRA_2025_3539080 crossref_primary_10_3390_app14125068 crossref_primary_10_1016_j_eswa_2025_129290 crossref_primary_10_3390_app142411777 crossref_primary_10_1016_j_eswa_2025_128523 crossref_primary_10_1134_S1054661824700962 crossref_primary_10_3390_electronics13245040 crossref_primary_10_3390_electronics13152892 crossref_primary_10_1016_j_inffus_2025_103332 crossref_primary_10_1016_j_eswa_2025_128658 crossref_primary_10_1016_j_compbiomed_2025_109845 crossref_primary_10_3390_drones9060386 crossref_primary_10_1186_s12880_024_01476_1 crossref_primary_10_1109_TIM_2025_3551795 crossref_primary_10_1121_10_0038981 crossref_primary_10_3390_app15137260 crossref_primary_10_1016_j_dsm_2025_09_001 crossref_primary_10_1007_s44366_025_0060_0 crossref_primary_10_1016_j_patcog_2024_110572 crossref_primary_10_1016_j_isprsjprs_2025_09_006 crossref_primary_10_1038_s41598_025_12498_2 crossref_primary_10_1016_j_jvcir_2025_104558 crossref_primary_10_1007_s10489_024_05549_0 crossref_primary_10_1016_j_compbiomed_2024_108626 crossref_primary_10_1016_j_aei_2024_102713 crossref_primary_10_1016_j_brainres_2025_149634 crossref_primary_10_1109_TSTE_2024_3482360 crossref_primary_10_1186_s13321_025_00959_9 crossref_primary_10_1109_TRO_2025_3539193 crossref_primary_10_1016_j_asoc_2025_113622 crossref_primary_10_1038_s40494_025_01621_1 crossref_primary_10_1016_j_jcp_2025_114272 crossref_primary_10_1109_JIOT_2025_3560654 crossref_primary_10_1109_LRA_2025_3592065 crossref_primary_10_1007_s11263_025_02353_2 crossref_primary_10_3390_rs17030517 crossref_primary_10_1007_s10796_025_10634_x crossref_primary_10_1109_OJCS_2025_3587005 crossref_primary_10_3390_s24227128 crossref_primary_10_1080_19420862_2025_2528902 crossref_primary_10_1109_ACCESS_2024_3397326 crossref_primary_10_1016_j_neuroimage_2025_121096 crossref_primary_10_1016_j_patter_2025_101325 crossref_primary_10_1186_s12911_025_03037_0 crossref_primary_10_1109_TIM_2025_3576014 crossref_primary_10_1145_3696413 crossref_primary_10_1109_LSP_2024_3522856 crossref_primary_10_3390_app15168999 crossref_primary_10_1016_j_trc_2025_105183 crossref_primary_10_1109_JSTARS_2025_3539791 crossref_primary_10_1016_j_xgen_2025_100762 crossref_primary_10_1016_j_inffus_2025_103265 crossref_primary_10_1016_j_ymeth_2025_01_015 crossref_primary_10_1007_s11227_025_07348_x crossref_primary_10_1016_j_jpha_2025_101406 crossref_primary_10_1016_j_eswa_2025_129658 crossref_primary_10_1145_3768165 crossref_primary_10_1109_LGRS_2025_3607840 crossref_primary_10_1145_3768163 crossref_primary_10_1109_ACCESS_2025_3562967 crossref_primary_10_1109_ACCESS_2025_3560549 crossref_primary_10_1371_journal_pone_0302275 crossref_primary_10_1016_j_aiopen_2025_01_002 crossref_primary_10_1016_j_neucom_2025_130517 crossref_primary_10_1109_LSP_2025_3601497 crossref_primary_10_1109_JBHI_2024_3416348 crossref_primary_10_3390_bioengineering12050538 crossref_primary_10_3390_electronics14142829 crossref_primary_10_3390_fi17040185 crossref_primary_10_1109_TCSVT_2024_3445337 crossref_primary_10_1109_ACCESS_2025_3537649 crossref_primary_10_1038_s41467_025_58250_2 crossref_primary_10_1016_j_jksuci_2024_102095 crossref_primary_10_1146_annurev_biodatasci_103123_095406 crossref_primary_10_1109_TSE_2025_3548168 crossref_primary_10_1145_3768156 crossref_primary_10_32604_cmc_2024_059018 crossref_primary_10_3390_make7030093 crossref_primary_10_1145_3759441_3759448 crossref_primary_10_1145_3727882 crossref_primary_10_3390_jlpea15010008 crossref_primary_10_1111_1755_6724_15213 crossref_primary_10_1109_ACCESS_2025_3580488 crossref_primary_10_1021_acs_jcim_5c00914 crossref_primary_10_1016_j_engappai_2025_110215 crossref_primary_10_1038_s42256_024_00920_9 crossref_primary_10_1109_TSC_2024_3440013 crossref_primary_10_1002_mlf2_12157 |
| Cites_doi | 10.1162/tacl_a_00574 10.2478/pralin-2018-0002 |
| ContentType | Journal Article |
| DBID | AAYXX CITATION |
| DOI | 10.1016/j.neucom.2023.127063 |
| DatabaseName | CrossRef |
| DatabaseTitle | CrossRef |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Computer Science |
| ExternalDocumentID | 10_1016_j_neucom_2023_127063 |
| GroupedDBID | --- --K --M .DC .~1 0R~ 123 1B1 1~. 1~5 29N 4.4 457 4G. 53G 5VS 7-5 71M 8P~ 9DU 9JM 9JN AABNK AAEDT AAEDW AAIKJ AAKOC AALRI AAOAW AAQFI AAQXK AATTM AAXKI AAXLA AAXUO AAYFN AAYWO AAYXX ABBOA ABCQJ ABFNM ABJNI ABMAC ABWVN ABXDB ACDAQ ACGFS ACLOT ACNNM ACRLP ACRPL ACVFH ACZNC ADBBV ADCNI ADEZE ADJOM ADMUD ADNMO AEBSH AEIPS AEKER AENEX AEUPX AFJKZ AFPUW AFTJW AFXIZ AGHFR AGQPQ AGUBO AGWIK AGYEJ AHHHB AHZHX AIALX AIEXJ AIGII AIIUN AIKHN AITUG AKBMS AKRWK AKYEP ALMA_UNASSIGNED_HOLDINGS AMRAJ ANKPU AOUOD APXCP ASPBG AVWKF AXJTR AZFZN BKOJK BLXMC CITATION CS3 DU5 EBS EFJIC EFKBS EFLBG EJD EO8 EO9 EP2 EP3 F5P FDB FEDTE FGOYB FIRID FNPLU FYGXN G-Q GBLVA GBOLZ HLZ HVGLF HZ~ IHE J1W KOM LG9 M41 MO0 MOBAO N9A O-L O9- OAUVE OZT P-8 P-9 P2P PC. Q38 R2- ROL RPZ SBC SDF SDG SDP SES SEW SPC SPCBC SSN SSV SSZ T5K WUQ XPP ZMT ~G- ~HD |
| ID | FETCH-LOGICAL-c251t-2c7f120893e37b52e0998fe7768b2e6775a0908bcf455c611f433441bb187a1e3 |
| ISICitedReferencesCount | 508 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001128175500001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 0925-2312 |
| IngestDate | Tue Nov 18 22:35:27 EST 2025 Sat Nov 29 07:15:30 EST 2025 |
| IsPeerReviewed | true |
| IsScholarly | true |
| Language | English |
| LinkModel | OpenURL |
| MergedId | FETCHMERGED-LOGICAL-c251t-2c7f120893e37b52e0998fe7768b2e6775a0908bcf455c611f433441bb187a1e3 |
| ORCID | 0000-0002-0741-0710 |
| ParticipantIDs | crossref_citationtrail_10_1016_j_neucom_2023_127063 crossref_primary_10_1016_j_neucom_2023_127063 |
| PublicationCentury | 2000 |
| PublicationDate | 2024-02-00 |
| PublicationDateYYYYMMDD | 2024-02-01 |
| PublicationDate_xml | – month: 02 year: 2024 text: 2024-02-00 |
| PublicationDecade | 2020 |
| PublicationTitle | Neurocomputing (Amsterdam) |
| PublicationYear | 2024 |
| References | Popel (10.1016/j.neucom.2023.127063_b30) 2018; 110 Brown (10.1016/j.neucom.2023.127063_b50) 2020 Devlin (10.1016/j.neucom.2023.127063_b4) 2019 Liu (10.1016/j.neucom.2023.127063_b20) 2020; vol. 119 Mahoney (10.1016/j.neucom.2023.127063_b45) 2011 Ke (10.1016/j.neucom.2023.127063_b17) 2021 Junczys-Dowmunt (10.1016/j.neucom.2023.127063_b29) 2016 Huang (10.1016/j.neucom.2023.127063_b13) 2019 Wolf (10.1016/j.neucom.2023.127063_b44) 2020 Al-Natsheh (10.1016/j.neucom.2023.127063_b41) 2017 Raffel (10.1016/j.neucom.2023.127063_b51) 2020; 21 Joshi (10.1016/j.neucom.2023.127063_b55) 2017 Choromanski (10.1016/j.neucom.2023.127063_b26) 2021 Wu (10.1016/j.neucom.2023.127063_b31) 2016 Socher (10.1016/j.neucom.2023.127063_b39) 2013 Islam (10.1016/j.neucom.2023.127063_b2) 2020 Williams (10.1016/j.neucom.2023.127063_b43) 2018 Sennrich (10.1016/j.neucom.2023.127063_b28) 2016 Paperno (10.1016/j.neucom.2023.127063_b53) 2016 Murtadha (10.1016/j.neucom.2023.127063_b3) 2023; 11 Yun (10.1016/j.neucom.2023.127063_b7) 2020 Raffel (10.1016/j.neucom.2023.127063_b16) 2020; 21 Shaw (10.1016/j.neucom.2023.127063_b32) 2018 Papineni (10.1016/j.neucom.2023.127063_b34) 2002 Shaw (10.1016/j.neucom.2023.127063_b12) 2018 Wang (10.1016/j.neucom.2023.127063_b22) 2020 Parikh (10.1016/j.neucom.2023.127063_b11) 2016 Foundation (10.1016/j.neucom.2023.127063_b36) 2021 Shen (10.1016/j.neucom.2023.127063_b24) 2021 Biderman (10.1016/j.neucom.2023.127063_b49) 2021 Dai (10.1016/j.neucom.2023.127063_b14) 2019 Press (10.1016/j.neucom.2023.127063_b60) 2022 Xiao (10.1016/j.neucom.2023.127063_b48) 2019 Liu (10.1016/j.neucom.2023.127063_b59) 2019 Loshchilov (10.1016/j.neucom.2023.127063_b37) 2019 Rajpurkar (10.1016/j.neucom.2023.127063_b40) 2016 Wang (10.1016/j.neucom.2023.127063_b25) 2019 Ott (10.1016/j.neucom.2023.127063_b33) 2019 Clark (10.1016/j.neucom.2023.127063_b9) 2020 Wei (10.1016/j.neucom.2023.127063_b47) 2019 Lan (10.1016/j.neucom.2023.127063_b8) 2020 Gehring (10.1016/j.neucom.2023.127063_b1) 2017; vol. 70 Chen (10.1016/j.neucom.2023.127063_b42) 2018 Zhu (10.1016/j.neucom.2023.127063_b58) 2015 Bojar (10.1016/j.neucom.2023.127063_b27) 2014 Radford (10.1016/j.neucom.2023.127063_b6) 2019 Chen (10.1016/j.neucom.2023.127063_b21) 2018 Zhu (10.1016/j.neucom.2023.127063_b35) 2015 Radford (10.1016/j.neucom.2023.127063_b10) 2018 Katharopoulos (10.1016/j.neucom.2023.127063_b23) 2020; vol. 119 Carion (10.1016/j.neucom.2023.127063_b57) 2020 Huang (10.1016/j.neucom.2023.127063_b19) 2020 Dolan (10.1016/j.neucom.2023.127063_b38) 2005 Zellers (10.1016/j.neucom.2023.127063_b54) 2019 Vaswani (10.1016/j.neucom.2023.127063_b5) 2017 Tian (10.1016/j.neucom.2023.127063_b56) 2022 He (10.1016/j.neucom.2023.127063_b18) 2021 Gao (10.1016/j.neucom.2023.127063_b52) 2021 10.1016/j.neucom.2023.127063_b46 Yang (10.1016/j.neucom.2023.127063_b15) 2019 |
| References_xml | – start-page: 2249 year: 2016 ident: 10.1016/j.neucom.2023.127063_b11 article-title: A decomposable attention model for natural language inference – start-page: 12 year: 2014 ident: 10.1016/j.neucom.2023.127063_b27 article-title: Findings of the 2014 workshop on statistical machine translation – year: 2016 ident: 10.1016/j.neucom.2023.127063_b28 article-title: Neural machine translation of rare words with subword units – year: 2019 ident: 10.1016/j.neucom.2023.127063_b59 – start-page: 3327 year: 2020 ident: 10.1016/j.neucom.2023.127063_b19 article-title: Improve transformer models with better relative position embeddings – start-page: 19 year: 2015 ident: 10.1016/j.neucom.2023.127063_b58 article-title: Aligning books and movies: Towards story-like visual explanations by watching movies and reading books – start-page: 464 year: 2018 ident: 10.1016/j.neucom.2023.127063_b12 article-title: Self-attention with relative position representations – year: 2005 ident: 10.1016/j.neucom.2023.127063_b38 article-title: Automatically constructing a corpus of sentential paraphrases – year: 2011 ident: 10.1016/j.neucom.2023.127063_b45 – start-page: 48 year: 2019 ident: 10.1016/j.neucom.2023.127063_b33 article-title: Fairseq: A fast, extensible toolkit for sequence modeling – year: 2019 ident: 10.1016/j.neucom.2023.127063_b13 article-title: Music transformer: Generating music with long-term structure – start-page: 3530 year: 2021 ident: 10.1016/j.neucom.2023.127063_b24 article-title: Efficient attention: Attention with linear complexities – start-page: 4791 year: 2019 ident: 10.1016/j.neucom.2023.127063_b54 article-title: HellaSwag: Can a machine really finish your sentence? – volume: 11 start-page: 771 issn: 2307-387X year: 2023 ident: 10.1016/j.neucom.2023.127063_b3 article-title: Rank-Aware Negative Training for Semi-Supervised Text Classification publication-title: Transactions of the Association for Computational Linguistics doi: 10.1162/tacl_a_00574 – start-page: 5998 year: 2017 ident: 10.1016/j.neucom.2023.127063_b5 article-title: Attention is all you need – year: 2020 ident: 10.1016/j.neucom.2023.127063_b8 article-title: ALBERT: a lite BERT for self-supervised learning of language representations – year: 2020 ident: 10.1016/j.neucom.2023.127063_b57 – year: 2022 ident: 10.1016/j.neucom.2023.127063_b56 – year: 2021 ident: 10.1016/j.neucom.2023.127063_b26 article-title: Rethinking attention with performers – start-page: 1631 year: 2013 ident: 10.1016/j.neucom.2023.127063_b39 article-title: Recursive deep models for semantic compositionality over a sentiment treebank – volume: 21 start-page: 140:1 year: 2020 ident: 10.1016/j.neucom.2023.127063_b16 article-title: Exploring the limits of transfer learning with a unified text-to-text transformer publication-title: J. Mach. Learn. Res. – start-page: 2978 year: 2019 ident: 10.1016/j.neucom.2023.127063_b14 article-title: Transformer-XL: Attentive language models beyond a fixed-length context – year: 2019 ident: 10.1016/j.neucom.2023.127063_b25 article-title: GLUE: a multi-task benchmark and analysis platform for natural language understanding – start-page: 6572 year: 2018 ident: 10.1016/j.neucom.2023.127063_b21 article-title: Neural ordinary differential equations – year: 2020 ident: 10.1016/j.neucom.2023.127063_b22 article-title: Encoding word order in complex embeddings – start-page: 19 year: 2015 ident: 10.1016/j.neucom.2023.127063_b35 article-title: Aligning books and movies: Towards story-like visual explanations by watching movies and reading books – start-page: 311 year: 2002 ident: 10.1016/j.neucom.2023.127063_b34 article-title: Bleu: a method for automatic evaluation of machine translation – year: 2020 ident: 10.1016/j.neucom.2023.127063_b50 article-title: Language models are few-shot learners – volume: vol. 119 start-page: 5156 year: 2020 ident: 10.1016/j.neucom.2023.127063_b23 article-title: Transformers are RNNs: Fast Autoregressive Transformers with Linear Attention – year: 2021 ident: 10.1016/j.neucom.2023.127063_b49 – year: 2018 ident: 10.1016/j.neucom.2023.127063_b42 – year: 2016 ident: 10.1016/j.neucom.2023.127063_b53 article-title: The LAMBADA dataset: Word prediction requiring a broad discourse context – year: 2019 ident: 10.1016/j.neucom.2023.127063_b48 – start-page: 1112 year: 2018 ident: 10.1016/j.neucom.2023.127063_b43 article-title: A broad-coverage challenge corpus for sentence understanding through inference – year: 2019 ident: 10.1016/j.neucom.2023.127063_b47 – year: 2016 ident: 10.1016/j.neucom.2023.127063_b29 article-title: Is neural machine translation ready for deployment? A case study on 30 translation directions – year: 2021 ident: 10.1016/j.neucom.2023.127063_b18 article-title: Deberta: decoding-enhanced bert with disentangled attention – year: 2021 ident: 10.1016/j.neucom.2023.127063_b17 article-title: Rethinking positional encoding in language pre-training – year: 2019 ident: 10.1016/j.neucom.2023.127063_b37 article-title: Decoupled weight decay regularization – year: 2020 ident: 10.1016/j.neucom.2023.127063_b9 article-title: ELECTRA: pre-training text encoders as discriminators rather than generators – year: 2019 ident: 10.1016/j.neucom.2023.127063_b6 – start-page: 5754 year: 2019 ident: 10.1016/j.neucom.2023.127063_b15 article-title: Xlnet: Generalized autoregressive pretraining for language understanding – year: 2021 ident: 10.1016/j.neucom.2023.127063_b36 – start-page: 38 year: 2020 ident: 10.1016/j.neucom.2023.127063_b44 article-title: Transformers: State-of-the-art natural language processing – year: 2022 ident: 10.1016/j.neucom.2023.127063_b60 article-title: Train short, test long: Attention with linear biases enables input length extrapolation – year: 2016 ident: 10.1016/j.neucom.2023.127063_b31 – start-page: 464 year: 2018 ident: 10.1016/j.neucom.2023.127063_b32 article-title: Self-attention with relative position representations – start-page: 2383 year: 2016 ident: 10.1016/j.neucom.2023.127063_b40 article-title: Squad: 100, 000+ questions for machine comprehension of text – volume: 21 start-page: 140:1 year: 2020 ident: 10.1016/j.neucom.2023.127063_b51 article-title: Exploring the limits of transfer learning with a unified text-to-text transformer publication-title: J. Mach. Learn. Res. – year: 2020 ident: 10.1016/j.neucom.2023.127063_b7 article-title: Are transformers universal approximators of sequence-to-sequence functions? – year: 2021 ident: 10.1016/j.neucom.2023.127063_b52 – volume: vol. 119 start-page: 6327 year: 2020 ident: 10.1016/j.neucom.2023.127063_b20 article-title: Learning to encode position for transformer with continuous dynamical model – year: 2018 ident: 10.1016/j.neucom.2023.127063_b10 – start-page: 4171 year: 2019 ident: 10.1016/j.neucom.2023.127063_b4 article-title: BERT: pre-training of deep bidirectional transformers for language understanding – start-page: 1601 year: 2017 ident: 10.1016/j.neucom.2023.127063_b55 article-title: Triviaqa: A large scale distantly supervised challenge dataset for reading comprehension – volume: vol. 70 start-page: 1243 year: 2017 ident: 10.1016/j.neucom.2023.127063_b1 article-title: Convolutional sequence to sequence learning – volume: 110 start-page: 43 year: 2018 ident: 10.1016/j.neucom.2023.127063_b30 article-title: Training tips for the transformer model publication-title: Prague Bull. Math. Linguist. doi: 10.2478/pralin-2018-0002 – ident: 10.1016/j.neucom.2023.127063_b46 – year: 2020 ident: 10.1016/j.neucom.2023.127063_b2 article-title: How much position information do convolutional neural networks encode? – start-page: 115 year: 2017 ident: 10.1016/j.neucom.2023.127063_b41 article-title: Udl at SemEval-2017 task 1: Semantic textual similarity estimation of english sentence pairs using regression model over pairwise features |
| SSID | ssj0017129 |
| Score | 2.7672222 |
| SourceID | crossref |
| SourceType | Enrichment Source Index Database |
| StartPage | 127063 |
| Title | RoFormer: Enhanced transformer with Rotary Position Embedding |
| Volume | 568 |
| WOSCitedRecordID | wos001128175500001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVESC databaseName: Elsevier SD Freedom Collection Journals 2021 issn: 0925-2312 databaseCode: AIEXJ dateStart: 19950101 customDbUrl: isFulltext: true dateEnd: 99991231 titleUrlDefault: https://www.sciencedirect.com omitProxy: false ssIdentifier: ssj0017129 providerName: Elsevier |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1LT9wwELbo0kMvtPQhoA_5wK0yWjsPx72tqkVthRCigLanyE7sLis2i5ak4ud3HD92KRIqBy5RFE2cKN-n8ZfxeAahfUFNYpioiEmKlKRSUaJSaogUUgqa1VL0yZgXR_z4uJhMxInfXXLTtxPgTVPc3orrJ4UargHYduvsI-COg8IFOAfQ4Qiww_G_gD9dHIIO1X37x3EzdSv8bdCneulCr6eL1ubLnficrc_judJ1nMdmoaZTB_Nb3_fBRxRGc1tYobYsihGEn11PBeDZ1WWk2mg6d2FUQLKV9TR6_6Pe-le3WrtyEdipbn4b7Z_vwxAsDZnLq3giywiIxTuuNXMtc7xztIvczpvd89suhDA7aHRnk3hsT_eDlfndMtn_TF8xqTDkq81KN0ppRyndKM_QJuOZKAZoc_R9PPkRF5o4Za4co3_7sLuyTwG8_zZr6mVNhpy9Qlv-_wGPHO7baEM3r9HL0JsDe1f9BkUafMGBBHiNBNiSADsS4EACHEnwFp0fjs--fiO-VQapQKC2hFXcUDYE8akTrjKmQfgXRnP4mVRM55xnciiGhapMmmVVTqlJkwSUsFK04JLq5B0aNItG7yAswDZnWhUmNymjucypZvWwAqEtapnmuygJn6CsfB15287kqnwIgF1E4l3Xro7Kg_Z7j7R_j16sWPkBDdplpz-i59Wf9vJm-cmj_hfBDWwD |
| linkProvider | Elsevier |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=RoFormer%3A+Enhanced+transformer+with+Rotary+Position+Embedding&rft.jtitle=Neurocomputing+%28Amsterdam%29&rft.au=Su%2C+Jianlin&rft.au=Ahmed%2C+Murtadha&rft.au=Lu%2C+Yu&rft.au=Pan%2C+Shengfeng&rft.date=2024-02-01&rft.issn=0925-2312&rft.volume=568&rft.spage=127063&rft_id=info:doi/10.1016%2Fj.neucom.2023.127063&rft.externalDBID=n%2Fa&rft.externalDocID=10_1016_j_neucom_2023_127063 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0925-2312&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0925-2312&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0925-2312&client=summon |