Residential College | false |
Status | 已發表Published |
A 28-nm 19.9-to-258.5-TOPS/W 8b Digital Computing-in-Memory Processor With Two-Cycle Macro Featuring Winograd-Domain Convolution and Macro-Level Parallel Dual-Side Sparsity | |
Wu, Hao1; Chen, Yong2; Yuan, Yiyang1; Yue, Jinshan1; Wang, Xinghua3; Li, Xiaoran3; Zhang, Feng1 | |
2024 | |
Source Publication | IEEE Journal of Solid-State Circuits |
ISSN | 0018-9200 |
Abstract | Recently, computing in memory (CiM) has been proven to be an energy-efficient and promising architecture for artificial intelligence (AI) algorithms. And yet, current CiM schemes generally suffer from limited throughput compared to their digital counterparts, and the key reason is that the CiM macro calculation must iterate through multiple cycles. Thus, the need to reduce the calculation cycle of the macro while keeping high energy efficiency and the necessity of developing acceleration methods for the universal CiM-based processor have become major issues faced by the current CiM architectures. To surmount these critical problems, we propose a processor based on a two-cycle CiM macro. Our work makes three main contributions: 1) we present a Radix16-based digital-CiM macro with look-up table (LUT) optimization to reduce dynamic power consumption; 2) we devise a hybrid Winograd microarchitecture and dataflow that supports (2, 3) and (4, 3) Winograd convolution, meaning that a good compromise can be reached between the accuracy of the algorithm and the reduction in workload; and 3) we propose a macrolevel parallel dual-side sparse CiM core that uses a horizontal direction compression method to reduce the input cycle of activation data and improve the mapping efficiency of the weight data in the macros. A prototype of the processor is fabricated in a 28-nm CMOS, which achieves a peak system energy efficiency of 19.9–258.5-TOPS/W for a voltage supply of 0.6–1.1 V, and an operating frequency of 78–287 MHz, a 2.55–7.08 $\times$ higher than other state-of-the-art CiM processors. |
Keyword | Accuracy Artificial Intelligence Artificial Intelligence (Ai) Circuits Cmos Computing-in-memory (Cim) Energy Efficiency Energy Efficiency Look-up Table (Lut) Multiply-accumulation (Mac) Neural Network (Nn) Power Demand Radix16 Table Lookup Throughput Unstructured Sparsity Winograd Convolution |
DOI | 10.1109/JSSC.2024.3409356 |
URL | View the original |
Indexed By | SCIE |
Language | 英語English |
WOS Research Area | Engineering |
WOS Subject | Engineering, Electrical & Electronic |
WOS ID | WOS:001252502500001 |
Publisher | IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC, 445 HOES LANE, PISCATAWAY, NJ 08855-4141 |
Scopus ID | 2-s2.0-85196516021 |
Fulltext Access | |
Citation statistics | |
Document Type | Journal article |
Collection | INSTITUTE OF MICROELECTRONICS |
Corresponding Author | Chen, Yong |
Affiliation | 1.Institute of Microelectronics, Key Laboratory of Microelectronics Device and Integrated Technology, Chinese Academy of Sciences (CAS), Beijing, China 2.State-Key Laboratory of Analog and Mixed-Signal VLSI and IME/ECE-FST, University of Macau, Macau, China 3.School of Information and Electronics, Beijing Institute of Technology, Beijing, China |
Corresponding Author Affilication | Faculty of Science and Technology |
Recommended Citation GB/T 7714 | Wu, Hao,Chen, Yong,Yuan, Yiyang,et al. A 28-nm 19.9-to-258.5-TOPS/W 8b Digital Computing-in-Memory Processor With Two-Cycle Macro Featuring Winograd-Domain Convolution and Macro-Level Parallel Dual-Side Sparsity[J]. IEEE Journal of Solid-State Circuits, 2024. |
APA | Wu, Hao., Chen, Yong., Yuan, Yiyang., Yue, Jinshan., Wang, Xinghua., Li, Xiaoran., & Zhang, Feng (2024). A 28-nm 19.9-to-258.5-TOPS/W 8b Digital Computing-in-Memory Processor With Two-Cycle Macro Featuring Winograd-Domain Convolution and Macro-Level Parallel Dual-Side Sparsity. IEEE Journal of Solid-State Circuits. |
MLA | Wu, Hao,et al."A 28-nm 19.9-to-258.5-TOPS/W 8b Digital Computing-in-Memory Processor With Two-Cycle Macro Featuring Winograd-Domain Convolution and Macro-Level Parallel Dual-Side Sparsity".IEEE Journal of Solid-State Circuits (2024). |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment