An ASIC Design Methodology with Predictably Low Leakage
Download
Report
Transcript An ASIC Design Methodology with Predictably Low Leakage
Predictably Low-Leakage ASIC
Design using Leakage-immune
Standard Cells
Nikhil Jayakumar
Sunil P. Khatri
University of Colorado at Boulder
Introduction
Process feature sizes / operating voltages are
diminishing relentlessly.
Threshold voltages of the MOS devices reduced
along with operating voltages to satisfy speed
requirements.
Leakage (sub-threshold) currents increase as a
consequence
Low leakage crucial for portable electronics to
ensure long battery life.
Introduction…2
Saturation Current Equation:
Ids = K(W/L)(Vgs –VT)2(1+ Vds) ………….(1)
Sub-threshold Current Equation:
Ids = (W/L)I0e(Vgs-VT-Voff/vT)(1-e(-Vds/VT))….(2)
From equation(1): need to reduce threshold
voltage VT with supply voltage to maintain Ids
From equation(2): decreasing VT increases
leakage current exponentially.
Previous Work
DTMOS: Dynamic Threshold MOS. Device gate
connected to bulk (Assaderaghi et. al.)
Results in high-speed switching and low-leakage
through body effect control.
Drawback:
Applicable only when VDD lower than the
diode turn-on voltage.
• Increased gate capacitance slows the device
down.
• Proposed for partially depleted SOI designs.
• Not easily modified to work for other
processes
Previous work…2
VTCMOS: Variable threshold CMOS (Kuroda et.
al.)
Device VT controlled by dynamically modifying the
device bulk voltage
Drawbacks:
Need complex circuitry to generate and control
the bulk voltages.
Cannot be applied to fully depleted SOI, hard
to apply to partially depleted SOI.
With future processes the body effect coefficient () will reduce
Previous work…3
SCCMOS: Super Cut-Off CMOS (Kawaguchi et. al.)
Gate of PMOS device (which gates the VDD
supply) overdriven during standby operation reduces leakage dramatically.
Drawback:
Complex circuitry required to generate the
special voltages.
Previous Work…4
MTCMOS: Multi-threshold CMOS. (Kao et.al.)
”Power switches” (high VT MOS devices) added between
the supplies and the power pins of the circuit.
Delay increased (controlled by sizing power switches
appropriately). Sizes of power switches for individual
logic cells is large.
Device sizing algorithm (based on mutually exclusive
discharging of gates) can be used for groups of cells to
reduce the size of power switches.
Drawbacks:
Device sizing algorithm works well for regular logic.
Leakage current unpredictable since internal nodes float
during standby operation
Memory elements need separate supplies.
Our Approach
Ensure that supply voltage applied across more
than one device and at least one of them is high
VT.
Ensure that output of each cell is either logic-0 or
logic-1 in standby.
No floating internal nodes.
Allows precise estimation of circuit leakage.
If input vector to gate (in standby) is known:
We know which stack (pull-up / pull-down) is
leaking.
Only one power switch device (PMOS or NMOS)
required.
Our Approach…2
So we need two variants of each gate – the “H”
and “L” versions
“H” cell: Inputs (in standby mode) such that
output is logic-1
Leakage is in pull-down stack
Minimize leakage by gating GND supply with high VT
NMOS device
“L” cell: Inputs (in standby mode) such that output
is logic-0
Leakage is in pull-up stack
Minimize leakage by gating the VDD supply with a
high VT PMOS device
Example – NAND3
NAND3 (H, L variants)
Layout Floor plan
standby
standby
Regular Cell
L variant
H variant
Routing standby signals done automatically (by
abutment).
H,L use unmodified cell core from regular cell
Minimizes re-design effort
Sample layout (NAND3-L)
standby
GND rail
VDD rail
standby
Process parameters and
sizing
Used bsim100 predictive 0.1um model cards for
our experiments
SPICE and MAGIC used for cell design and layout
For MTCMOS and H/L gates, the supply gating
transistors sized such that delay penalty less than
15% (over the unmodified cell)
Up-sizing transistors inside cell core can result in
smaller delay and area penalties.
We did not modify cell core
Design Methodology
Design flow using H/L cells very similar to
traditional standard cell based flow:
Optimize and map to standard cell library (SIS).
Given primary input assignment in standby mode:
Simulate circuit, find output value of each gate.
Replace with H / L variant of the gate.
Decision made in time linear in size of circuit.
Regular cells from UCBerkeley.
Gates used:
INVA, INVB, NAND2A, NAND2B, NAND3, NOR2,
NOR3, NOR4, AND2, AND3, AND4, OR2, OR3, OR4,
AOI21, AOI22, OAI21, OAI22.
SPICE3f5 – simulate delay and leakage.
MAGIC – to implement layout of H/L variants
Leakage Comparison
(HL / MTCMOS / Regular)
Leakage: HL,MTCMOS vs Regular
Leakage: HL vs MTCMOS
At cell level, HL and MTCMOS leakage are
comparably low
Circuit Leakage
(Estimate vs SPICE)
At circuit level, HL leakage is precisely estimable
This is a key contribution
MTCMOS leakage is very unpredictable (due to
floating nodes in standby)
Circuit Leakage
(HL /MTCMOS)
Design mapped for minimum area
Design mapped for minimum delay
Note the large circuit leakage range for MTCMOS
Circuit leakage (HL) is a single deterministic value
Circuit leakage (HL) smaller than worst case
MTCMOS circuit leakage.
Circuit Delay, Area
Comparison
Delay:
Performed “Exact Timing Analysis” to obtain largest
sensitizable delay for circuit.
Area:
Place / Route using CADENCE Silicon Ensemble.
Used 4 routing layers.
MTCMOS: header and footer device areas added to
regular layout area.
Tested on 24 circuits from MCNC91 benchmark
suite.
Delay comparison
Circuits mapped for
minimum delay (SIS)
Similar results if
circuits mapped for
minimum area (see
paper)
HL delay less than
MTCMOS delay
Only 1 transition
slower in HL (both
in MTCMOS)
Area Comparison (area mapped)
Conclusions
Advantages:
Internal nodes of a gate never float.
Leakage precisely estimable, unlike MTCMOS
Delay increase only for one transition.
We use only one supply gating device.
MTCMOS requires un-gated supply lines for memory elements.
We do not need separate supply lines
We use flip-flop design of Mutoh et. al.
MTCMOS & HL leakage dramatically lower than regular designs
HL leakage lower than worst case MTCMOS leakage.
HL cell layout easily done
Header, footer regions free for over-the cell routing.
Disadvantages:
Determination of optimal primary input vector for minimal leakage
is a complex problem.
Can be solved using an ADD framework.
Thank you!