ABSTRACT
Subthreshold circuit design has become a popular approach for building energy efficient digital circuits. One drawback is performance degradation due to the exponentially reduced driving current. This had limited subthreshold circuits to relatively low performance applications such as sensor networks. To retain the excellent energy efficiency while reducing performance loss, we propose to apply subthreshold and near-threshold techniques to chip multi-processors. We show that an architecture where several slower cores are clustered together with a shared faster L1 cache is optimal for energy efficiency, because processor cores and memory operate best at different supply and threshold voltages. In particular, SPLASH2 benchmarks show about a 53% energy improvement over the traditional CMP approach (about 70% over a single core machine).
- B. Zhai, L. Nazhandali, et al., "A 2.60pJ/Inst Subthreshold Sensor Processor for Optimal Energy Efficiency", IEEE VLSI Technology and Circuits, 2006Google Scholar
- A. Wang, A. Chandrakasan, "A 180mV FFT processor using subthreshold circuits techniques", IEEE ISSCC 2004Google Scholar
- B. Zhai, D. Blaauw, et al., "Theoretical and practical limits of dynamic voltage scaling", DAC 2004 Google ScholarDigital Library
- B. Calhoun, A. Chandrakasan, "Characterizing and modeling minimum energy operation for subthreshold circuits," ISLPED 2004 Google ScholarDigital Library
- S. C. Woo, M. Ohara, et. al. "The SPLASH-2 Programs: Characterization and Methodological Considerations", ACM ISCA, 1995. Google ScholarDigital Library
- B. Zhai, D. Blaauw, et al., "The Limit of Dynamic Voltage Scaling and Insomniac Dynamic Voltage Scaling", IEEE TVLSI, Nov 2004 Google ScholarDigital Library
- T. Sakurai and A. Newton, "Alpha-power law MOSFET model and its applications to CMOS inverter delay and other formulas," IEEE JSSC, vol. 25, no. 2, pp. 584--594, Apr. 1990.Google ScholarCross Ref
- http://www.arm.com/products/CPUsGoogle Scholar
- R. A. Hankins, T. A. Diep, et al., "Scaling and Characterizing Database Workloads: Bridging the Gap between Research and Practice", IEEE/ACM MICRO 2003. Google ScholarDigital Library
- B. Zhai, S. Hanson, et al., "Analysis and Mitigation of Variability in Subthreshold Design", IEEE ISLPED, 2005 Google ScholarDigital Library
- B. Calhoun and A. Chandrakasan, "A 256kb Sub-threshold SRAM in 65nm CMOS", IEEE ISSCC, 2006Google ScholarCross Ref
- N. Verma, A. Chandrakasan, "A 65nm 8T Sub-Vt SRAM Employing Sense-Amplifier Redundancy", IEEE ISSCC, 2007Google Scholar
- T-H. Kim, J. Liu, et al., "A High-Density Subthreshold SRAM with Data-Independent Bitline Leakage and Virtual-Ground Replica Scheme", IEEE ISSCC, 2007Google Scholar
- B. Zhai, D. Blaauw, et al., "A Sub-200mV 6T SRAM in 0.13um CMOS", IEEE ISSCC, 2007Google Scholar
- M. J. M. Pelgrom, et al., "Matching properties of MOS transistors," IEEE JSSC, vol. 24, no. 5, pp. 1433--1440, 1989.Google ScholarCross Ref
- N. L. Binkert, R. G. Dreslinski, et al., "The M5 Simulator: Modeling Networked Systems.", IEEE Micro, pp. 52--60, 2006 Google ScholarDigital Library
- N. S. Kim, K. Flautner, et al. "Single-Vdd and Single-Vt Super-Drowsy Techniques for Low-Leakage High-Performance Instruction Caches", IEEE/ACM ISLPED, 2004. Google ScholarDigital Library
Index Terms
- Energy efficient near-threshold chip multi-processing
Recommendations
Asymmetric Underlapped FinFETs for Near- and Super-Threshold Logic at Sub-10nm Technology Nodes
Special Issue on Nanoelectronic Circuit and System Design Methods for the Mobile Computing Era and Regular PapersExtending double-gate FinFET scaling to sub-10nm technology regime requires device-engineering techniques for countering the rise of direct source to drain tunneling (DSDT), edge direct tunneling (EDT) and short channel effects (SCE) that degrade FinFET ...
Full-VDD and near-threshold performance of 8T FinFET SRAM cells
We evaluate full-VDD and near-threshold operation of nine novel eight-transistor (8T) FinFET SRAM cell schemes using shorted gate (SG) and low power FinFET configurations for 32-bit by 1024-word SRAMs. 8T SRAM schemes outperform six-transistor schemes ...
A near-threshold 7T SRAM cell with high write and read margins and low write time for sub-20nm FinFET technologies
In this paper, a 7T SRAM cell with differential write and single ended read operations working in the near-threshold region is proposed. The structure is based on modifying a recently proposed 5T cell which uses high and low VTH transistors to improve ...
Comments