Efficient SRAM-PIM Co-design by Joint Exploration of Value-Level and Bit-Level Sparsity
Processing-in-memory (PIM) architectures mitigate the Von Neumann bottleneck by integrating computation units into memory arrays. Among PIM architectures, digital SRAMPIM has become a prominent approach, directly integrating digital logic within the SRAM array. However, the rigid crossbar architectu...
Gespeichert in:
| Veröffentlicht in: | IEEE transactions on computer-aided design of integrated circuits and systems S. 1 |
|---|---|
| Hauptverfasser: | , , , , , , , , , |
| Format: | Journal Article |
| Sprache: | Englisch |
| Veröffentlicht: |
IEEE
2025
|
| Schlagworte: | |
| ISSN: | 0278-0070, 1937-4151 |
| Online-Zugang: | Volltext |
| Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
| Abstract | Processing-in-memory (PIM) architectures mitigate the Von Neumann bottleneck by integrating computation units into memory arrays. Among PIM architectures, digital SRAMPIM has become a prominent approach, directly integrating digital logic within the SRAM array. However, the rigid crossbar architecture and full array activation pose challenges in efficiently utilizing value-level sparsity. Moreover, neural network models exhibit a high proportion of zero bits within non-zero values, which remain underutilized due to architectural constraints. To overcome these limitations, we present Dyadic Block PIM (DB-PIM), a groundbreaking algorithm-architecture co-design framework to harness both value-level and bit-level sparsity. At the algorithm level, our hybrid-grained pruning technique, combined with a novel sparsity pattern, enables effective sparsity management. Architecturally, DB-PIM incorporates a sparse network and customized digital SRAM-PIM macros, including input pre-processing unit (IPU), dyadic block multiply units (DBMUs), and Canonical Signed Digit (CSD)-based adder trees. It circumvents structured zero values in weights and bypasses unstructured zero bits within non-zero weights and block-wise all-zero bit columns in input features. As a result, the DBPIM framework skips a majority of unnecessary computations, thereby driving significant gains in computational efficiency. Experimental results demonstrate that our DB-PIM framework achieves up to 8.01× speedup and 85.28% energy savings, significantly boosting computational efficiency in digital SRAMPIM systems. |
|---|---|
| AbstractList | Processing-in-memory (PIM) architectures mitigate the Von Neumann bottleneck by integrating computation units into memory arrays. Among PIM architectures, digital SRAMPIM has become a prominent approach, directly integrating digital logic within the SRAM array. However, the rigid crossbar architecture and full array activation pose challenges in efficiently utilizing value-level sparsity. Moreover, neural network models exhibit a high proportion of zero bits within non-zero values, which remain underutilized due to architectural constraints. To overcome these limitations, we present Dyadic Block PIM (DB-PIM), a groundbreaking algorithm-architecture co-design framework to harness both value-level and bit-level sparsity. At the algorithm level, our hybrid-grained pruning technique, combined with a novel sparsity pattern, enables effective sparsity management. Architecturally, DB-PIM incorporates a sparse network and customized digital SRAM-PIM macros, including input pre-processing unit (IPU), dyadic block multiply units (DBMUs), and Canonical Signed Digit (CSD)-based adder trees. It circumvents structured zero values in weights and bypasses unstructured zero bits within non-zero weights and block-wise all-zero bit columns in input features. As a result, the DBPIM framework skips a majority of unnecessary computations, thereby driving significant gains in computational efficiency. Experimental results demonstrate that our DB-PIM framework achieves up to 8.01× speedup and 85.28% energy savings, significantly boosting computational efficiency in digital SRAMPIM systems. |
| Author | He, Xiaolin Wang, Xueyan Jia, Xiaotao Duan, Cenlin Yang, Jianlei Wang, Yiou Zhao, Weisheng Qi, Yingjie Wang, Yikun Yan, Bonan |
| Author_xml | – sequence: 1 givenname: Cenlin orcidid: 0000-0002-7531-3461 surname: Duan fullname: Duan, Cenlin organization: School of Integrated Circuit Science and Engineering, Fert Beijing Research Institute, Beihang University, Beijing, China – sequence: 2 givenname: Jianlei orcidid: 0000-0001-8424-7040 surname: Yang fullname: Yang, Jianlei email: jianlei@buaa.edu.cn organization: School of Computer Science and Engineering, Beihang University, Beijing, China – sequence: 3 givenname: Yikun surname: Wang fullname: Wang, Yikun organization: School of Computer Science and Engineering, Beihang University, Beijing, China – sequence: 4 givenname: Yiou surname: Wang fullname: Wang, Yiou organization: School of Computer Science and Engineering, Beihang University, Beijing, China – sequence: 5 givenname: Yingjie orcidid: 0009-0009-2785-4480 surname: Qi fullname: Qi, Yingjie organization: School of Computer Science and Engineering, Beihang University, Beijing, China – sequence: 6 givenname: Xiaolin surname: He fullname: He, Xiaolin organization: School of Computer Science and Engineering, Beihang University, Beijing, China – sequence: 7 givenname: Bonan orcidid: 0000-0002-3052-9330 surname: Yan fullname: Yan, Bonan organization: Institute for Artificial Intelligence, Peking University, Beijing, China – sequence: 8 givenname: Xueyan orcidid: 0000-0003-0080-4730 surname: Wang fullname: Wang, Xueyan organization: School of Integrated Circuit Science and Engineering, Fert Beijing Research Institute, Beihang University, Beijing, China – sequence: 9 givenname: Xiaotao orcidid: 0000-0003-2207-6092 surname: Jia fullname: Jia, Xiaotao organization: School of Integrated Circuit Science and Engineering, Fert Beijing Research Institute, Beihang University, Beijing, China – sequence: 10 givenname: Weisheng orcidid: 0000-0001-8088-0404 surname: Zhao fullname: Zhao, Weisheng email: weisheng.zhao@buaa.edu.cn organization: School of Integrated Circuit Science and Engineering, Fert Beijing Research Institute, Beihang University, Beijing, China |
| BookMark | eNpFkNFKw0AQRRepYFv9AMGH_YGtM9lsNnmssWolRbFVH8MmmZWVmJRsFPv3prTg0zDMnHvhTNioaRti7BJhhgjJ9Sad384CCNRMqhikVCdsjInUIkSFIzaGQMcCQMMZm3j_CYChCpIxe19Y60pHTc_XL_OVeF6ueNqKirz7aHix44-tG26L323ddqZ3bcNby99M_U0iox-quWkqfuP647bems67fnfOTq2pPV0c55S93i026YPInu6X6TwTJUrViyTGQgUqjqOkDKAAkIA6TiCqSCsMZKWkDlVUEmpLBSjSVahMZI01AxiVcsrwkFt2rfcd2XzbuS_T7XKEfG8m35vJ92byo5mBuTowjoj-_xGGrhjlH5htX4M |
| CODEN | ITCSDI |
| ContentType | Journal Article |
| DBID | 97E RIA RIE AAYXX CITATION |
| DOI | 10.1109/TCAD.2025.3580335 |
| DatabaseName | IEEE Xplore (IEEE) IEEE All-Society Periodicals Package (ASPP) 1998–Present IEEE Electronic Library (IEL) CrossRef |
| DatabaseTitle | CrossRef |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Xplore url: https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Engineering |
| EISSN | 1937-4151 |
| EndPage | 1 |
| ExternalDocumentID | 10_1109_TCAD_2025_3580335 11037481 |
| Genre | orig-research |
| GrantInformation_xml | – fundername: National Natural Science Foundation of China grantid: Grant No. 62072019 funderid: 10.13039/501100001809 – fundername: National Key Research and Development Program of China grantid: Grant No. 2023YFB4503704; Grant No. 2024YFB4505601 funderid: 10.13039/501100012166 – fundername: Beijing Natural Science Foundation grantid: Grant No. L243031 |
| GroupedDBID | --Z -~X 0R~ 29I 4.4 5GY 6IK 97E AAJGR AARMG AASAJ AAWTH ABAZT ABQJQ ABVLG ACGFS ACIWK ACNCT AENEX AGQYO AHBIQ AKJIK AKQYR ALMA_UNASSIGNED_HOLDINGS ASUFR ATWAV BEFXN BFFAM BGNUA BKEBE BPEOZ CS3 DU5 EBS HZ~ IFIPE IPLJI JAVBF LAI M43 O9- OCL P2P PZZ RIA RIE RNS TN5 5VS AAYXX AETIX AGSQL AI. AIBXA ALLEH CITATION EJD H~9 IBMZZ ICLAB IFJZH VH1 VJK |
| ID | FETCH-LOGICAL-c135t-981b5258869c20b0030178906de75123d537456ce17feb05e7d45a6fafa1b56c3 |
| IEDL.DBID | RIE |
| ISSN | 0278-0070 |
| IngestDate | Sat Nov 29 07:49:00 EST 2025 Wed Jun 25 06:00:20 EDT 2025 |
| IsPeerReviewed | true |
| IsScholarly | true |
| Language | English |
| License | https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html https://doi.org/10.15223/policy-029 https://doi.org/10.15223/policy-037 |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c135t-981b5258869c20b0030178906de75123d537456ce17feb05e7d45a6fafa1b56c3 |
| ORCID | 0009-0009-2785-4480 0000-0002-7531-3461 0000-0001-8424-7040 0000-0002-3052-9330 0000-0001-8088-0404 0000-0003-0080-4730 0000-0003-2207-6092 |
| PageCount | 1 |
| ParticipantIDs | ieee_primary_11037481 crossref_primary_10_1109_TCAD_2025_3580335 |
| PublicationCentury | 2000 |
| PublicationDate | 2025-00-00 |
| PublicationDateYYYYMMDD | 2025-01-01 |
| PublicationDate_xml | – year: 2025 text: 2025-00-00 |
| PublicationDecade | 2020 |
| PublicationTitle | IEEE transactions on computer-aided design of integrated circuits and systems |
| PublicationTitleAbbrev | TCAD |
| PublicationYear | 2025 |
| Publisher | IEEE |
| Publisher_xml | – name: IEEE |
| SSID | ssj0014529 |
| Score | 2.4358857 |
| Snippet | Processing-in-memory (PIM) architectures mitigate the Von Neumann bottleneck by integrating computation units into memory arrays. Among PIM architectures,... |
| SourceID | crossref ieee |
| SourceType | Index Database Publisher |
| StartPage | 1 |
| SubjectTerms | Accuracy Algorithm/Architecture Co-design Artificial intelligence Artificial neural networks Computational efficiency Computational modeling Computer architecture Hardware Hybrid-grained Sparsity Integrated circuit modeling Memory management Processing-In-Memory Redundancy SRAM-PIM |
| Title | Efficient SRAM-PIM Co-design by Joint Exploration of Value-Level and Bit-Level Sparsity |
| URI | https://ieeexplore.ieee.org/document/11037481 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVIEE databaseName: IEEE Xplore customDbUrl: eissn: 1937-4151 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0014529 issn: 0278-0070 databaseCode: RIE dateStart: 19820101 isFulltext: true titleUrlDefault: https://ieeexplore.ieee.org/ providerName: IEEE |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3LT8IwGG-UeNCDT4z4Sg-eTAprt67bEQlEDRAiiNyWrf2akBhGcJj439t2U7h48LZHv6T9tcn3_H1F6E4HTBo16xMegSQBVx4xIzXJWGqZmJH2HRdm2hfDYTSbxaOKrO64MADgis-gaR9dLl_lcm1DZS1KXbcU4-zsChGWZK3flIHNILqAim0Zaw5ylcKkXtyamFUZV5Dxpk36-e5qt40S2rpVxSmV3tE_p3OMDivrEbfL7T5BO7A4RQdbPQXP0FvXNYUwknj80h6Q0dMAd3KiXKUGzr7wcz43_8raO7ctONd4mr6vgfRtBRFOFwo_zIvqbbxMXd1GHb32upPOI6luTyCS-rwgsTFIOeNRFMaSeaXzZFmvoQJhtLyvuJk8DyVQoSHzOAgV8DTUqU6NYCj9c1Rb5Au4QFhlDAJKJRUSAh3TyBOS-VJxppkxUbIGuv-BM1mWTTIS51x4cWKxTyz2SYV9A9UtlJuBFYqXf3y_QvtWvAx7XKNasVrDDdqTn8X8Y3XrzsA3yo2sqw |
| linkProvider | IEEE |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3LT8IwGG8MmqgH30Z89uDJpLB26x5HJBDQQYggclu2PhISsxEcJv73tl0VLh687dEu3a9Nvufv-wC4lx5hSsy6iIaCIY9yB6mREmUk1UzMULqGCzONg-EwnM2ikSWrGy6MEMIkn4mGvjSxfF6wlXaVNTE21VKUsbNNPY84FV3rN2igY4jGpaKLxqqjbIOY2ImaE_VfyhgktKHDfq5p7rYWQxt9VYxY6R7-c0FH4MDqj7BVbfgx2BL5CdjfqCp4Ct46piyEmgnHL60BGvUHsF0gbnI1YPYFn4q5eldl35mNgYWE0_R9JVCsc4hgmnP4OC_t3XiRmsyNM_Da7UzaPWT7JyCGXVqiSKmklNAw9CNGnMp80rxXn4tAyXmXU7V46jOBAykyh4qAezT1ZSpTNdFn7jmo5UUuLgDkGREexgwHTHgywqETMOIyTokkSknJ6uDhB85kUZXJSIx54USJxj7R2CcW-zo401CuB1oUL_94fgd2e5NBnMT94fMV2NOfqpwg16BWLlfiBuywz3L-sbw15-Eb-Syv8g |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Efficient+SRAM-PIM+Co-design+by+Joint+Exploration+of+Value-Level+and+Bit-Level+Sparsity&rft.jtitle=IEEE+transactions+on+computer-aided+design+of+integrated+circuits+and+systems&rft.au=Duan%2C+Cenlin&rft.au=Yang%2C+Jianlei&rft.au=Wang%2C+Yikun&rft.au=Wang%2C+Yiou&rft.date=2025&rft.pub=IEEE&rft.issn=0278-0070&rft.spage=1&rft.epage=1&rft_id=info:doi/10.1109%2FTCAD.2025.3580335&rft.externalDocID=11037481 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0278-0070&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0278-0070&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0278-0070&client=summon |