Efficient SRAM-PIM Co-design by Joint Exploration of Value-Level and Bit-Level Sparsity
Processing-in-memory (PIM) architectures mitigate the Von Neumann bottleneck by integrating computation units into memory arrays. Among PIM architectures, digital SRAMPIM has become a prominent approach, directly integrating digital logic within the SRAM array. However, the rigid crossbar architectu...
Saved in:
| Published in: | IEEE transactions on computer-aided design of integrated circuits and systems p. 1 |
|---|---|
| Main Authors: | , , , , , , , , , |
| Format: | Journal Article |
| Language: | English |
| Published: |
IEEE
2025
|
| Subjects: | |
| ISSN: | 0278-0070, 1937-4151 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Abstract | Processing-in-memory (PIM) architectures mitigate the Von Neumann bottleneck by integrating computation units into memory arrays. Among PIM architectures, digital SRAMPIM has become a prominent approach, directly integrating digital logic within the SRAM array. However, the rigid crossbar architecture and full array activation pose challenges in efficiently utilizing value-level sparsity. Moreover, neural network models exhibit a high proportion of zero bits within non-zero values, which remain underutilized due to architectural constraints. To overcome these limitations, we present Dyadic Block PIM (DB-PIM), a groundbreaking algorithm-architecture co-design framework to harness both value-level and bit-level sparsity. At the algorithm level, our hybrid-grained pruning technique, combined with a novel sparsity pattern, enables effective sparsity management. Architecturally, DB-PIM incorporates a sparse network and customized digital SRAM-PIM macros, including input pre-processing unit (IPU), dyadic block multiply units (DBMUs), and Canonical Signed Digit (CSD)-based adder trees. It circumvents structured zero values in weights and bypasses unstructured zero bits within non-zero weights and block-wise all-zero bit columns in input features. As a result, the DBPIM framework skips a majority of unnecessary computations, thereby driving significant gains in computational efficiency. Experimental results demonstrate that our DB-PIM framework achieves up to 8.01× speedup and 85.28% energy savings, significantly boosting computational efficiency in digital SRAMPIM systems. |
|---|---|
| AbstractList | Processing-in-memory (PIM) architectures mitigate the Von Neumann bottleneck by integrating computation units into memory arrays. Among PIM architectures, digital SRAMPIM has become a prominent approach, directly integrating digital logic within the SRAM array. However, the rigid crossbar architecture and full array activation pose challenges in efficiently utilizing value-level sparsity. Moreover, neural network models exhibit a high proportion of zero bits within non-zero values, which remain underutilized due to architectural constraints. To overcome these limitations, we present Dyadic Block PIM (DB-PIM), a groundbreaking algorithm-architecture co-design framework to harness both value-level and bit-level sparsity. At the algorithm level, our hybrid-grained pruning technique, combined with a novel sparsity pattern, enables effective sparsity management. Architecturally, DB-PIM incorporates a sparse network and customized digital SRAM-PIM macros, including input pre-processing unit (IPU), dyadic block multiply units (DBMUs), and Canonical Signed Digit (CSD)-based adder trees. It circumvents structured zero values in weights and bypasses unstructured zero bits within non-zero weights and block-wise all-zero bit columns in input features. As a result, the DBPIM framework skips a majority of unnecessary computations, thereby driving significant gains in computational efficiency. Experimental results demonstrate that our DB-PIM framework achieves up to 8.01× speedup and 85.28% energy savings, significantly boosting computational efficiency in digital SRAMPIM systems. |
| Author | He, Xiaolin Wang, Xueyan Jia, Xiaotao Duan, Cenlin Yang, Jianlei Wang, Yiou Zhao, Weisheng Qi, Yingjie Wang, Yikun Yan, Bonan |
| Author_xml | – sequence: 1 givenname: Cenlin orcidid: 0000-0002-7531-3461 surname: Duan fullname: Duan, Cenlin organization: School of Integrated Circuit Science and Engineering, Fert Beijing Research Institute, Beihang University, Beijing, China – sequence: 2 givenname: Jianlei orcidid: 0000-0001-8424-7040 surname: Yang fullname: Yang, Jianlei email: jianlei@buaa.edu.cn organization: School of Computer Science and Engineering, Beihang University, Beijing, China – sequence: 3 givenname: Yikun surname: Wang fullname: Wang, Yikun organization: School of Computer Science and Engineering, Beihang University, Beijing, China – sequence: 4 givenname: Yiou surname: Wang fullname: Wang, Yiou organization: School of Computer Science and Engineering, Beihang University, Beijing, China – sequence: 5 givenname: Yingjie orcidid: 0009-0009-2785-4480 surname: Qi fullname: Qi, Yingjie organization: School of Computer Science and Engineering, Beihang University, Beijing, China – sequence: 6 givenname: Xiaolin surname: He fullname: He, Xiaolin organization: School of Computer Science and Engineering, Beihang University, Beijing, China – sequence: 7 givenname: Bonan orcidid: 0000-0002-3052-9330 surname: Yan fullname: Yan, Bonan organization: Institute for Artificial Intelligence, Peking University, Beijing, China – sequence: 8 givenname: Xueyan orcidid: 0000-0003-0080-4730 surname: Wang fullname: Wang, Xueyan organization: School of Integrated Circuit Science and Engineering, Fert Beijing Research Institute, Beihang University, Beijing, China – sequence: 9 givenname: Xiaotao orcidid: 0000-0003-2207-6092 surname: Jia fullname: Jia, Xiaotao organization: School of Integrated Circuit Science and Engineering, Fert Beijing Research Institute, Beihang University, Beijing, China – sequence: 10 givenname: Weisheng orcidid: 0000-0001-8088-0404 surname: Zhao fullname: Zhao, Weisheng email: weisheng.zhao@buaa.edu.cn organization: School of Integrated Circuit Science and Engineering, Fert Beijing Research Institute, Beihang University, Beijing, China |
| BookMark | eNpFkNFKw0AQRRepYFv9AMGH_YGtM9lsNnmssWolRbFVH8MmmZWVmJRsFPv3prTg0zDMnHvhTNioaRti7BJhhgjJ9Sad384CCNRMqhikVCdsjInUIkSFIzaGQMcCQMMZm3j_CYChCpIxe19Y60pHTc_XL_OVeF6ueNqKirz7aHix44-tG26L323ddqZ3bcNby99M_U0iox-quWkqfuP647bems67fnfOTq2pPV0c55S93i026YPInu6X6TwTJUrViyTGQgUqjqOkDKAAkIA6TiCqSCsMZKWkDlVUEmpLBSjSVahMZI01AxiVcsrwkFt2rfcd2XzbuS_T7XKEfG8m35vJ92byo5mBuTowjoj-_xGGrhjlH5htX4M |
| CODEN | ITCSDI |
| ContentType | Journal Article |
| DBID | 97E RIA RIE AAYXX CITATION |
| DOI | 10.1109/TCAD.2025.3580335 |
| DatabaseName | IEEE Xplore (IEEE) IEEE All-Society Periodicals Package (ASPP) 1998–Present IEEE Electronic Library (IEL) CrossRef |
| DatabaseTitle | CrossRef |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Engineering |
| EISSN | 1937-4151 |
| EndPage | 1 |
| ExternalDocumentID | 10_1109_TCAD_2025_3580335 11037481 |
| Genre | orig-research |
| GrantInformation_xml | – fundername: National Natural Science Foundation of China grantid: Grant No. 62072019 funderid: 10.13039/501100001809 – fundername: National Key Research and Development Program of China grantid: Grant No. 2023YFB4503704; Grant No. 2024YFB4505601 funderid: 10.13039/501100012166 – fundername: Beijing Natural Science Foundation grantid: Grant No. L243031 |
| GroupedDBID | --Z -~X 0R~ 29I 4.4 5GY 6IK 97E AAJGR AARMG AASAJ AAWTH ABAZT ABQJQ ABVLG ACGFS ACIWK ACNCT AENEX AGQYO AHBIQ AKJIK AKQYR ALMA_UNASSIGNED_HOLDINGS ASUFR ATWAV BEFXN BFFAM BGNUA BKEBE BPEOZ CS3 DU5 EBS HZ~ IFIPE IPLJI JAVBF LAI M43 O9- OCL P2P PZZ RIA RIE RNS TN5 5VS AAYXX AETIX AGSQL AI. AIBXA ALLEH CITATION EJD H~9 IBMZZ ICLAB IFJZH VH1 VJK |
| ID | FETCH-LOGICAL-c135t-981b5258869c20b0030178906de75123d537456ce17feb05e7d45a6fafa1b56c3 |
| IEDL.DBID | RIE |
| ISSN | 0278-0070 |
| IngestDate | Sat Nov 29 07:49:00 EST 2025 Wed Jun 25 06:00:20 EDT 2025 |
| IsPeerReviewed | true |
| IsScholarly | true |
| Language | English |
| License | https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html https://doi.org/10.15223/policy-029 https://doi.org/10.15223/policy-037 |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c135t-981b5258869c20b0030178906de75123d537456ce17feb05e7d45a6fafa1b56c3 |
| ORCID | 0009-0009-2785-4480 0000-0002-7531-3461 0000-0001-8424-7040 0000-0002-3052-9330 0000-0001-8088-0404 0000-0003-0080-4730 0000-0003-2207-6092 |
| PageCount | 1 |
| ParticipantIDs | ieee_primary_11037481 crossref_primary_10_1109_TCAD_2025_3580335 |
| PublicationCentury | 2000 |
| PublicationDate | 2025-00-00 |
| PublicationDateYYYYMMDD | 2025-01-01 |
| PublicationDate_xml | – year: 2025 text: 2025-00-00 |
| PublicationDecade | 2020 |
| PublicationTitle | IEEE transactions on computer-aided design of integrated circuits and systems |
| PublicationTitleAbbrev | TCAD |
| PublicationYear | 2025 |
| Publisher | IEEE |
| Publisher_xml | – name: IEEE |
| SSID | ssj0014529 |
| Score | 2.4358857 |
| Snippet | Processing-in-memory (PIM) architectures mitigate the Von Neumann bottleneck by integrating computation units into memory arrays. Among PIM architectures,... |
| SourceID | crossref ieee |
| SourceType | Index Database Publisher |
| StartPage | 1 |
| SubjectTerms | Accuracy Algorithm/Architecture Co-design Artificial intelligence Artificial neural networks Computational efficiency Computational modeling Computer architecture Hardware Hybrid-grained Sparsity Integrated circuit modeling Memory management Processing-In-Memory Redundancy SRAM-PIM |
| Title | Efficient SRAM-PIM Co-design by Joint Exploration of Value-Level and Bit-Level Sparsity |
| URI | https://ieeexplore.ieee.org/document/11037481 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVIEE databaseName: IEEE Electronic Library (IEL) customDbUrl: eissn: 1937-4151 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0014529 issn: 0278-0070 databaseCode: RIE dateStart: 19820101 isFulltext: true titleUrlDefault: https://ieeexplore.ieee.org/ providerName: IEEE |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1JTwIxFG6UeNCDK0bc0oMnk8J0hrb0iASiRggRVG6TTpeExMwQHEz893YZhYsHb7O9pP1eJ-3bvgfADZWasERliLYNQW0ptP2lCHXeJpLFilIVhWYTbDTqzGZ8XBWr-1oYrbVPPtNNd-lj-aqQK-cqa2Hs2VKssbPNGA3FWr8hAxdB9A4VRxlrF3IVwsQRb03trKwpGJOmC_olvrXbehPa6KriN5XBwT-Hcwj2q9Mj7AZ1H4EtnR-DvQ1OwRPw1vekEFYSTp67QzR-GMJegZTP1IDZF3ws5vZdyL3zaoGFga_ifaXRk8sggiJX8G5eVneThfB5G3XwMuhPe_eo6p6AJE5Iibg9kJKYdDqUyzgKxpOreqVKM7vLJ4rYwROrK8yMziKimWoTQY0wwgpSmZyCWl7k-gxAgSWJM06F4I6xTwimTIwzK0iNtCZYA9z-wJkuAklG6o2LiKcO-9Rhn1bYN0DdQbn-sELx_I_nF2DXiQe3xyWolcuVvgI78rOcfyyv_Rr4Bo57rLc |
| linkProvider | IEEE |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3NT8IwFH8xaKIe_MSInz14Mims29qxIxIIKBAiqNyWru0SErMRHCb-97bdFC4evO3rLd3vdWnf1-8B3DGhaODJGDM_odgXXOlfijLjbaKxKxmTTtFsIhiNmrNZOC6L1W0tjFLKJp-pujm0sXyZiZVxlTUIsWwp2tjZpr7vOkW51m_QwMQQrUvFkMbqqVwGMYkTNqb6u7Qx6NK6Cft5trnbehna6Ktil5Xu4T8HdAQH5f4RtQqFH8OWSk9gf4NV8BTeOpYWQkuiyXNriMf9IWpnWNpcDRR_ocdsru8V2XdWMShL0Ct_Xyk8MDlEiKcSPczz8myy4DZzowov3c603cNl_wQsiEdzHOotKXVps8lC4TqF-WTqXplUgV7nPUn14KnWFgkSFTtUBdKnnCU84VqQCe8MKmmWqnNAnAjqxiHjPDScfZwHMnFJrAVZIrQRVoP7HzijRUGTEVnzwgkjg31ksI9K7GtQNVCuHyxRvPjj-i3s9qbDQTToj54uYc-8qnCCXEElX67UNeyIz3z-sbyx8-EbvV6v_g |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Efficient+SRAM-PIM+Co-design+by+Joint+Exploration+of+Value-Level+and+Bit-Level+Sparsity&rft.jtitle=IEEE+transactions+on+computer-aided+design+of+integrated+circuits+and+systems&rft.au=Duan%2C+Cenlin&rft.au=Yang%2C+Jianlei&rft.au=Wang%2C+Yikun&rft.au=Wang%2C+Yiou&rft.date=2025&rft.pub=IEEE&rft.issn=0278-0070&rft.spage=1&rft.epage=1&rft_id=info:doi/10.1109%2FTCAD.2025.3580335&rft.externalDocID=11037481 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0278-0070&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0278-0070&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0278-0070&client=summon |