2102-282 Digital Electronics - IC Design & Application Research Lab.

Download Report

Transcript 2102-282 Digital Electronics - IC Design & Application Research Lab.

Chapter 6
Static CMOS Circuits
Boonchuay Supmonchai
Integrated Design Application Research (IDAR) Laboratory
August , 2004; Revised - June 28, 2005
B.Supmonchai
Goals of This Chapter

In-depth discussion of CMOS logic families
 Static and Dynamic
 Pass-Transistor
 Nonratioed and Ratioed Logic

Optimizing gate metrics
 Area, Speed, Energy or Robustness

High Performance circuit-design techniques
2102-545 Digital ICs
Static CMOS Circuits
2
B.Supmonchai
Combinational vs. Sequential Logic
In
Combinational
Logic
Circuit
Out
Combinational
Logic
Circuit
In
Out
State
Combinational
Sequential
Output = f(In)
Output = f(In, Previous In)
2102-545 Digital ICs
Static CMOS Circuits
3
B.Supmonchai
Static CMOS Circuits

At every point in time (except during the switching
transients) each gate output is connected to either
VDD or VSS via a low-resistance path.

The outputs of the gates assume at all times the
value of the Boolean function, implemented by the
circuit (ignoring, once again, the transient effects
during switching periods)

This is in contrast to the dynamic circuit class, which
relies on temporary storage of signal values on the
capacitance of high impedance circuit nodes
2102-545 Digital ICs
Static CMOS Circuits
4
B.Supmonchai
Static Complementary CMOS
VDD
In1
In2
PUN
InN
In1
In2
Pull-Up Network: make a connection
from VDD to F when F(In1,In2,…InN) = 1
PMOS transistors only
F(In1,In2,…InN)
PDN
NMOS transistors only
InN
Pull-Down Network: make a connection
from F to GND when F(In1,In2,…InN) = 0
PUN and PDN are dual logic networks
2102-545 Digital ICs
Static CMOS Circuits
5
B.Supmonchai
Threshold Drops
VDD
PUN
VDD
S
D
VDD
D
0  VDD
VGS
S
CL
CL
VDD  0
PDN
D
VDD
VDD  |VTp|
VGS
CL
S
2102-545 Digital ICs
0  VDD - VTn
S
CL
D
Static CMOS Circuits
6
B.Supmonchai
Construction of PDN

Transistors can be thought as a switch controlled by its
gate signal

NMOS switch closes when switch control input is high
A•B
A+B
A
A
B
B
Series = NAND
Parallel = NOR
NMOS Transistors pass a “strong” 0 but a “weak” 1
2102-545 Digital ICs
Static CMOS Circuits
7
B.Supmonchai
Construction of PUN

PMOS switch closes when switch control input is low.
A•B=A+B
A+B=A•B
A
A
B
B
Series = NOR
Parallel = NAND
PMOS Transistors pass a “strong” 0 but a “weak” 1
2102-545 Digital ICs
Static CMOS Circuits
8
B.Supmonchai
Duality of PUN and PDN

PUN and PDN are dual networks
 De Morgan’s theorems
A+B=A•B
[!(A + B) = !A • !B or !(A | B) = !A & !B]
A•B=A+B
[!(A • B) = !A + !B or !(A & B) = !A | !B]
 A parallel connection of transistors in the PUN
corresponds to a series connection of the PDN

Complementary gate is naturally inverting
(NAND, NOR, AOI, OAI)

Number of transistors for an N-input logic gate is 2N
2102-545 Digital ICs
Static CMOS Circuits
9
B.Supmonchai
Example: CMOS NAND gate
VDD
A
A
B
B
A•B
F
A
B
F
0
0
1
0
1
1
1
0
1
1
1
0
A
B
PDN: G = A · B
Conduction to GND
PUN: F = A + B
Conduction to VDD
G(In1, In2, …, InN) = F(In1, In2, …, InN)
2102-545 Digital ICs
Static CMOS Circuits
10
B.Supmonchai
Example: CMOS NOR gate
VDD
A
B
B
A
F
A
B
F
0
0
1
0
1
0
1
0
0
1
1
0
A+B
A
2102-545 Digital ICs
B
Static CMOS Circuits
11
B.Supmonchai
Complex CMOS Gate
B
A
C
D
OUT = D + A • (B + C)
A
Derive PUN hierarchically
by identifying sub-nets
D
B
2102-545 Digital ICs
C
Static CMOS Circuits
12
B.Supmonchai
Cell Design: An Introduction

Standard Cells
 A general purpose logic
 Synthesizable
 Same height but varying width

Datapath Cells
 For regular, structured designs (arithmetic)
 Including some wiring in the cell
 Fixed height and width
2102-545 Digital ICs
Static CMOS Circuits
13
B.Supmonchai
Example of A Standard Cell
N Well
Cell height 12 metal tracks
Metal track is approx. 3 + 3
Pitch = repetitive distance
between objects
VDD
Minimum-Size
Inverter
Cell height is “12 pitch”
2
Cell boundary
2102-545 Digital ICs
In
Out
GND
Static CMOS Circuits
Rails ~10
14
B.Supmonchai
Standard Cell Layout Methodology – 1980s
Routing channel
VDD
signals
GND
Routing channel
2102-545 Digital ICs
What logic function is this?
Static CMOS Circuits
15
B.Supmonchai
Standard Cell Layout Methodology – 1990s
Mirrored Cell
No Routing
channels
VDD
VDD
M2
M3
GND
GND
Mirrored Cell
2102-545 Digital ICs
Static CMOS Circuits
16
B.Supmonchai
Stick Diagrams
Contains no dimensions
Represents relative positions of transistors
VDD
VDD
Out
Out
In
GND
GND
Inverter
2102-545 Digital ICs
A B
NAND2
Static CMOS Circuits
17
B.Supmonchai
OAI21 Logic Graph
Node of the
circuit
A
j
X
PUN
C
C
B
X = C • (A + B)
C
A
i
X
i
B
Transition
Control
B
VDD
j
A
PDN
GND
A
B
C
2102-545 Digital ICs
Static CMOS Circuits
18
B.Supmonchai
Two Stick Diagrams of C • (A + B)
A
C
B
A
B
C
VDD
VDD
X
X
GND
GND
Crossover can be eliminated by re-ordering inputs
2102-545 Digital ICs
Static CMOS Circuits
19
B.Supmonchai
Consistent Euler Path

An uninterrupted diffusion strip is possible only if there
exists an Euler path in the logic graph
X
C
X
i
B
VDD
j
GND
Euler path: a path through all
nodes in the graph such that
each edge is visited once and
only once.
A
A B C
 For
a single poly strip for every input signal, the Euler
paths in the PUN and PDN must be consistent (the same)
2102-545 Digital ICs
Static CMOS Circuits
20
B.Supmonchai
OAI22 Logic Graph
X
A
B
PUN
C
D
D
X = (A+B)•(C+D)
C
C
VDD
X
B
D
A
PDN
A
B
GND
A
B
C
D
2102-545 Digital ICs
Static CMOS Circuits
21
B.Supmonchai
OAI22 Layout
A
B
D
C
VDD
X
GND

Some functions have no consistent Euler path like
x = !(a + bc + de) (but x = !(bc + a + de) does!)
2102-545 Digital ICs
Static CMOS Circuits
22
B.Supmonchai
XNOR/XOR Implementation
XNOR
A
XOR
AB
A
B
B
A
B
A
B
AB


AB
AB
How many transistor in each?
Can you create the stick diagrams for the lower left circuit?
2102-545 Digital ICs
Static CMOS Circuits
23
B.Supmonchai
VTC is Data-Dependent
0.5/0.25 NMOS
0.75 /0.25 PMOS
3
A
M3
B
M4
2
A,B: 0 -> 1
B=1, A:0 -> 1
A=1, B:0->1
F= A • B
D
A
VGS2 = VA –VDS1
B
VGS1 = VB
M2
S
D
M1
S
weaker
PUN
1
Cint
0
0
1
2
VTC Characteristics are dependent upon the data input patterns
applied to the gate (so the noise margins are also data dependent!)
2102-545 Digital ICs
Static CMOS Circuits
24
B.Supmonchai
Observation I

The difference between the blue and the orange
lines results from the state of internal node int
between the two NMOS Devices.

The threshold voltage of M2 is higher than M1
due to the body effect (),
 VSB of M2 is not zero (when VB = 0) due to the
presence of Cint
VTn1 = VTn0
2102-545 Digital ICs
and
VTn2 = VTn0 + ((|2F| + Vint) - |2F|)
Static CMOS Circuits
25
B.Supmonchai
Review: CMOS Inverter - Dynamic
VDD
tpHL = f(Rn, CL)
Vout
Rn
CL
tpHL = 0.69 Reqn CL
tpHL = 0.69 (3/4 (CL VDD)/ IDSATn )
= 0.52 CL / (W/Ln k’n VDSATn )
Vin = V DD
propagation delay is determined by the time to
charge and discharge the load capacitor CL
2102-545 Digital ICs
Static CMOS Circuits
26
B.Supmonchai
Review: Designing for Performance

Reduce CL

Increase W/L ratio of the transistor
 the most powerful and effective performance optimization tool
 watch out for self-loading!

Increase VDD
 only minimal improvement in performance at the cost of
increased energy dissipation

Slope engineering - keeping signal rise and fall times
smaller than or equal to the gate propagation delays and
of approximately equal values
 good for performance and power consumption
2102-545 Digital ICs
Static CMOS Circuits
27
B.Supmonchai
Switch Delay Model
Req
A
Rp
A
A
Rp
Rp
B
Rn
B
CL
A
Rn
B
NAND
2102-545 Digital ICs
Rp
Rp
A
A
Cint
Rn
Cint
CL
A
INVERTER
Static CMOS Circuits
Rn
Rn
A
B
CL
NOR
28
B.Supmonchai
Input Pattern Effects on Delay
Rp
A
Rp

Delay is dependent on the pattern of inputs

Low to high transition
 both inputs go low
B
Rn
 delay is 0.69 Rp/2 CL since two p-resistors are
on in parallel
CL
 one input goes low
A
Rn
 delay is 0.69 Rp CL

Cint
 both inputs go high
B
NAND
2102-545 Digital ICs
High to low transition
 delay is 0.69 2Rn CL

Adding transistors in series (without sizing)
slows down the circuit
Static CMOS Circuits
29
B.Supmonchai
Delay Dependence on Input Patterns
Input Data
Delay
Pattern
(psec)
A=B=01
67
A=1, B=01
64
A= 01, B=1
61
A=B=10
45
A=1, B=10
80
A= 10, B=1
81
3
A=B=10
Voltage [V]
2.5
2
A=1 0, B=1
1.5
1
A=1, B=10
0.5
0
0
100
200
300
400
-0.5
time [ps]
NMOS = 0.5m/0.25 m, PMOS = 0.75m/0.25 m, CL = 100 fF
2102-545 Digital ICs
Static CMOS Circuits
30
B.Supmonchai
Transistor Sizing Basic

Inverter as a reference circuit
Device Transconductance
VDD
(See Supplement 1)
kn = kn’(W/L)n = (µnox/tox)((W/L)n
kp = kp’(W/L)p = (µpox/tox)((W/L)p
2
(W/L)p
In
Out
1
(W/L)n
For rise time equal to fall time,
CL
kp = kn (Rp = Rn)
Because µp ~ µn /2
(W/L)p = 2 (W/L)n
The size of PMOS must be twice as large as that of NMOS
2102-545 Digital ICs
Static CMOS Circuits
31
B.Supmonchai
Transistor Sizing: NAND and NOR
Symmetric Response RPUN = RPDN
Rp
2 A
Rp
B
Rp
4
2
RPUN = Rp + Rp
Rn
CL
B
Rp
4
2 A
Cint
A
RPDN = Rn + Rn
2
Rn
Cint
1
B
NAND
2102-545 Digital ICs
Rp  1/(W/L)p
Rn  1/(W/L)n
Static CMOS Circuits
Rn
Rn
A
B
CL
1
NOR
32
B.Supmonchai
Note on Transistor Sizing

By assuming RPUN = RPDN, we ignores the extra
diffusion capacitance introduced by widening
the transistors.

In DSM, even larger increases in the width are
needed due to velocity saturation.
 For 2-input NANDs, the NMOS transistors should
be made 2.5 times as wide.

NAND implementation is clearly preferred
over a NOR implementation, since a PMOS
stack series is slower than an NMOS stack due
to lower carrier mobility
2102-545 Digital ICs
Static CMOS Circuits
33
B.Supmonchai
Transistor Sizing: Complex CMOS Gate

A
B
8 12
C
8 12
 Follow short path first; note PMOS
for C and B,4 rather than 3 (average
in pull-up chain of three =
(4+4+2)/3)
46
D
 Also note structure of pull-up and
46
A
D
Red sizing assuming RPUN = RPDN
pull-down to minimize diffusion
cap. at output (e.g., single PMOS
drain connected to output)
2
1

B
2C
2
Green for symmetric response and
for performance (where Rp = 3Rn)
 Sizing rules of thumb: PMOS = 3
* NMOS
2102-545 Digital ICs
Static CMOS Circuits
34
B.Supmonchai
Fan-In Considerations
A
B
C
D
A
B
CL
C3
C
C2
Distributed RC model
(Elmore delay)
D
C1
tpHL = 0.69 Reqn(C1+2C2+3C3+4CL)
Propagation delay deteriorates rapidly as a function of
fan-in – quadratically in the worst case.
2102-545 Digital ICs
Static CMOS Circuits
35
B.Supmonchai
Notes on Fan-In Considerations

While output capacitance makes full swing transition from
VDD to 0, internal nodes only swing from VDD-VTn to GND

C1, C2, and C3, each includes junction capacitance as well
as the gate-to-source and gate-to-drain capacitances
(turned into capacitances to ground using the Miller effect)
 For W/L of 0.5/0.25 NMOS and 0.375/0.25 PMOS, values are on
the order of 0.85 fF

CL = 3.47 fF with NO output load (all of diffusion
capacitance = intrinsic capacitance of the gate itself).

tpHL = 85 ps (simulated as 86 ps).
 The simulated worst case low-to-high delay was 106 ps.
2102-545 Digital ICs
Static CMOS Circuits
36
B.Supmonchai
tp as a Function of Fan-In
1250
quadratic
tp (psec)
1000
750
tpHL
500
tp
tpLH
250
linear
0
2
4
6
8
10
12
14
16
fan-in
Gates with a fan-in greater than 4 should be avoided.
2102-545 Digital ICs
Static CMOS Circuits
37
B.Supmonchai
tp as a Function of Fan-Out
All gates have the same drive current.
1200
tpNOR2
1000
tp (psec)
tpNAND2
800
tpINV
600
400
200
0
2
4
6
8
10
12
14
16
eff. fan-out
Slope is a function of “driving strength”
2102-545 Digital ICs
Static CMOS Circuits
38
B.Supmonchai
tp as a Function of Fan-In and Fan-Out

Fan-in: quadratic due to increasing resistance
and capacitance

Fan-out: each additional fan-out gate adds two
gate capacitances to CL
tp = a1FI + a2FI2 + a3FO
Parallel Chain
2102-545 Digital ICs
Serial Chain
Static CMOS Circuits
39
B.Supmonchai
Fast Complex Gates: Design Technique 1

Transistor sizing
 as long as fan-out capacitance dominates

Progressive sizing
Distributed RC line
InN
MN
In3
M3
C3
In2
M2
C2
In1
M1
C1
2102-545 Digital ICs
CL
M1 > M2 > M3 > … > MN
(the FET closest to the output
should be the smallest)
Can reduce delay by more
than 20%; decreasing gains
as technology shrinks
Static CMOS Circuits
40
B.Supmonchai
Notes on Design Technique 1

With transistor sizing, if the load capacitance is dominated
by the intrinsic capacitance of the gate, widening the device
only creates a “self loading” effect and the propagation
delay is unaffected (and may even become worse).

For progressive sizing, M1 have to carry the discharge
current from M2 (C1), M3 (C2), … MN and CL so make it
the largest.
 MN only has to discharge the current from MN (CL)(no internal
capacitances).

While progressive sizing is easy in a schematic, in a real
layout it may not pay off due to design-rule considerations
that force the designer to push the transistors apart
 increasing internal capacitance.
2102-545 Digital ICs
Static CMOS Circuits
41
B.Supmonchai
Fast Complex Gates: Design Technique 2

Input re-ordering
 when not all inputs arrive at the same time
critical path
In3
1
In2 1
In1
01
M3
critical path
CL charged
01
In1
M3
charged
CL
M2
C2 charged
In2 1
M2
C2 discharged
M1
C1 charged
In3 1
M1
C1 discharged
delay determined by time
to discharge CL, C1 and C2
delay determined by time
to discharge CL
Place latest arriving signal (critical path)
closest to the output can result in a speed up.
2102-545 Digital ICs
Static CMOS Circuits
42
B.Supmonchai
Example: Sizing and Ordering Effects
A
3 B
3 C
A
44
B
45
3 D
3
CL = 100 fF
C3
C
46
C2
D
47
C1
Progressive sizing in pull-down
chain gives up to a 23%
improvement.
Input ordering saves 5%
critical path A – 23%
critical path D – 17%
2102-545 Digital ICs
Static CMOS Circuits
43
B.Supmonchai
Fast Complex Gates: Design Technique 3

Alternative logic structures
 Reduced fan-in results in deeper logic depth
F = ABCDEFGH
Reduction in fan-in offsets, by far, the
extra delay incurred by the NOR gate.
2102-545 Digital ICs
Static CMOS Circuits
44
B.Supmonchai
Notes on Design Technique 3

Reducing fan-in increases logic depth of the
circuit
 More stages but each stage has smaller delay

Only simulation will tell which of the two
alternative configurations is faster and has lower
power dissipation.
2102-545 Digital ICs
Static CMOS Circuits
45
B.Supmonchai
Fast Complex Gates: Design Technique 4

Isolating fan-in from fan-out using buffer
insertion
 Optimizing the propagation delay of a gate in isolation
is misguided.
CL
CL
Reduce CL on large fan-in gates, especially for large CL,
and size the inverters progressively to handle the CL
more effectively
2102-545 Digital ICs
Static CMOS Circuits
46
B.Supmonchai
Fast Complex Gates: Design Technique 5

Reducing the voltage swing
tpHL = 0.69 (3/4 (CL VDD)/ IDSATn )
= 0.69 (3/4 (CL Vswing)/ IDSATn )
 linear reduction in delay
 also reduces power consumption


But the following gate is much slower!
Or requires the use of “sense amplifiers” on the
receiving end to restore the signal level (memory
design)
2102-545 Digital ICs
Static CMOS Circuits
47
B.Supmonchai
Sizing Logic Paths for Speed

Frequently, input capacitance of a logic path is
constrained

Logic also has to drive some capacitance
 Example: ALU load in an Intel’s microprocessor is
0.5pF

How do we size the ALU data path to achieve
maximum speed?

We have already solved this for the inverter chain
– can we generalize it for any type of logic?
2102-545 Digital ICs
Static CMOS Circuits
48
B.Supmonchai
Inverter Chain: Recap
In
1
1
f
f N-1
2
N
Out
CL
N
For given N: Ci+1/Ci = Ci/Ci-1 = f = (CL/Cin)

For optimum performance, we try to keep f ~ 4,
which give us the number of stages, N.

Can the same approach (logical effort) be used
for any combinational circuit?
2102-545 Digital ICs
Static CMOS Circuits
49

B.Supmonchai
Delay of a Complex Logic Gate

For a complex gate, we expand the inverter chain equation
 C 
 f 
ext
t p  t p 0 
1   C 
 t p 0 1  



g 
 g  f 
t p  t p 0 p 




 tp0 is the intrinsic delay of an inverter
 f is the effective fan-out
(Cext/Cg) - also called the
electrical effort
 p is the ratio of the intrinsic (unloaded) delay of the
complex gate and a simple inverter (a function of the
gate topology and layout style)
 g is the logical effort
2102-545 Digital ICs
Static CMOS Circuits
50
B.Supmonchai
Notes on Delay of a Logic Gate

Logical effort first defined by Sutherland and
Sproull in 1999.

In a simpler format,
Gate delay:
D=h+p
Effort delay
Effort delay:
h=gf
Logical
Effort
2102-545 Digital ICs
Intrinsic delay
Electrical
= Cout/Cin
Effort
(effective fan-out)
Static CMOS Circuits
51
B.Supmonchai
Intrinsic Delay Term, p

The more involved the structure of the complex
gate, the higher the intrinsic delay compared to
an inverter
Gate Type
p
Inverter
1
n-input NAND
n
n-input NOR
n
n-way mux
2n
XOR, XNOR
n 2n-1
Ignoring second order effects such as internal node capacitances
2102-545 Digital ICs
Static CMOS Circuits
52
B.Supmonchai
Logical Effort Term, g

Logical effort of a gate, g, presents the ratio of its input
capacitance to the inverter capacitance when sized to
deliver the same current
 g represents the fact that, for a given load, complex gates have
to work harder than an inverter to produce a similar (speed)
response
Gate Type
2
3
4
NAND
4/3
5/3
(n+2)/3
NOR
5/3
7/3
(2n+1)/3
Mux
2
2
2
XOR
4
12
Inverter
2102-545 Digital ICs
g (for 1 to 4 input gates)
1
1
Static CMOS Circuits
53
B.Supmonchai
Notes on Logical Effort

Inverter has the smallest logical effort and intrinsic delay
of all static CMOS gates

Logical effort of a gate tells how much worse it is at
producing an output current than an inverter (how much
more input capacitance a gate presents to deliver same
output current)

Logical effort is a function of topology, independent of
sizing
 Logical effort increases with the gate complexity

Electrical effort (Effective fanout) is a function of
load/gate size
2102-545 Digital ICs
Static CMOS Circuits
54
B.Supmonchai
Example of Logical Effort

Assuming a PMOS/NMOS ratio of 2, the input
capacitance of a minimum-sized inverter is three times
the gate capacitance of a minimum-sized NMOS (Cunit)
A
A
A
2 B
2
2
1
A
B
4
A
4
A•B
A
2
B
2
A+B
A
1
B
1
Cunit = 3
Cunit = 4
2102-545 Digital ICs
Static CMOS Circuits
Cunit = 5
55
B.Supmonchai
Delay as a Function of Fan-Out

The slope of the line is
the logical effort of the
gate

The y-axis intercept is
the intrinsic delay

Can adjust the delay by
adjusting the effective
fan-out (by sizing) or
by choosing a gate with
a different logical effort
Static CMOS Circuits
56
normalized delay
7
6
5
4
3
effort delay
2
1
intrinsic delay
0
0
1
2
3
4
5
fan-out f
Gate Effort: h = fg
2102-545 Digital ICs
B.Supmonchai
Path Delay: Complex Logic Gate Network

Total path delay through a combinational logic block
tp =  tp,j = tp0 (pj + (fj gj)/ )

So, the minimum delay through the path determines that
each stage should bear the same gate effort
f1g1 = f2g2 = . . . = fNgN

Consider optimizing the delay through the logic network
1
a
b
c
CL 5
how do we determine a, b, and c sizes?
2102-545 Digital ICs
Static CMOS Circuits
57
B.Supmonchai
Path Delay Equation Derivation

The path logical effort, G =  gi

And the path effective fan-out (path electrical effort) is
F = CL/g1

The branching effort accounts for fan-out to other
gates in the network
b = (Con-path + Coff-path)/Con-path

The path branching effort is then B =  bi

The total path effort is then H = GFB

So, the minimum delay through the path is
N
D = tp0 ( pj + (N H)/ )
2102-545 Digital ICs
Static CMOS Circuits
58
B.Supmonchai
Example: Complex Logic Gates

For gate i in the chain, its size is determined by
i -1
si = (g1 s1)/gi  (fj/bj)
j=1

For this network
1
 F = CL/Cg1 = 5
a
b
c
CL 5
 G = 1 x 5/3 x 5/3 x 1 = 25/9
 B = 1 (no branching)
4
 H = GFB = 125/9, so the optimal stage effort is H = 1.93
 Fan-out factors are f1=1.93, f2=1.93 x 3/5 = 1.16, f3 = 1.16, f4 = 1.93
 So the gate sizes are a = f1g1/g2 = 1.16, b = f1f2g1/g3 = 1.34 and
c = f1f2f3g1/g4 = 2.60
2102-545 Digital ICs
Static CMOS Circuits
59
B.Supmonchai
Example – 8-input AND
2102-545 Digital ICs
Static CMOS Circuits
60
B.Supmonchai
Summary: Method of Logical Effort

Compute the path effort: F = GBH

Find the best number of stages N ~ log4F

Compute the stage effort f = F1/N

Sketch the path with this number of stages

Work either from either end, find sizes:
Cin = Cout*g/f
Reference: Sutherland, Sproull, Harris, “Logical Effort, Morgan-Kaufmann 1999.
2102-545 Digital ICs
Static CMOS Circuits
61
B.Supmonchai
Summary: Key Definitions
Sutherland,
Sproull
Harris
2102-545 Digital ICs
Static CMOS Circuits
62
B.Supmonchai
Ratioed Logic
Goal: To Reduce the number of devices over complementary CMOS
VDD

• N transistors + Load
 VOH = VDD
•V =V
Resistive
Load
OH
RL

• VOL =
F
In1
In2
In3

DD
VOL = RPDN / (RPDN + RL)
RPN
Asymmetrical
Response
RPN + RL
 Static
Power
• Assymetrical
response
PDN
consumption Plow
• Static power consumption
VSS
2102-545 Digital ICs
N transistors + Load
tpL R=LC0.69
• tpL
= 0.69
L
Static CMOS Circuits
RLCL
63
B.Supmonchai
Ratioed Logic: Active Loads
VVDD
DD
Depletion
Depletion
Load
Load
VVDD
DD
PMOS
PMOS
Load
Load
VVTT<<00
VVSSSS
FF
In11
In
In22
In
In33
In
PDN
PDN
FF
InIn11
InIn22
InIn33
VVSSSS
VVSSSS
depletionload
loadNMOS
NMOS
depletion
2102-545 Digital ICs
PDN
PDN
pseudo-NMOS
pseudo-NMOS
Static CMOS Circuits
64
B.Supmonchai
Pseudo NMOS NAND and NOR
F
F
A
CL
A
B
B
C
D
CL
C
NOR
D

NAND
Psedo-NMOS is useful when area is
most important
 Reduce transistor counts
 Used occasionally for large fan-in gates
2102-545 Digital ICs
Static CMOS Circuits
65
B.Supmonchai
Pseudo-NMOS Inverter Characteristics

Assumptions:
 NMOS resides in linear mode
 VOL is small relative to the gate drive (VDD-VT))
2
2 



VDSATp
VOL
kn VDD VTn VOL 
 0
 k p VDD  VTp  VDSATp 
2 
2 


VOL 

k p VDD  VTp  VDSATp
Plow  VDD Ilow

2102-545 Digital ICs
kn VDD VTn 
nW p

 VDSATp
pW n
2


VDSATp
 VDD  k p VDD  VTp  VDSATp 

2 

Static CMOS Circuits
66
B.Supmonchai
Pseudo-NMOS Inverter VTC
3.0
NMOS size = 0.5 µm/0.25 µm
2.5
Vout (V)
Size
W/Lp = 4
2.0
1.5
VOL
Pstat
tpLH
(V)
(µW)
(ps)
4
0.693
564
14
2
0.273
298
56
1
0.133
160
123
0.5
0.064
80
268
0.25
0.031
41
569
W/Lp = 2
1.0
W/Lp = 0.5
W/Lp = 1
0.5
W/Lp = 0.25
0.0
0.0
0.5
1.0
1.5
2.0
2.5
Vin (V)
Larger pull-up device not only improves performance (delay)
but also increases power dissipation and lowers noise margins
by increasing VOL
2102-545 Digital ICs
Static CMOS Circuits
67
B.Supmonchai
Improved Loads: Adaptive Load

VDD
The idea is to reduce
static power consumption by adjusting the
M1 >> M2
load

Enable
M1
Enable
M1 >> M2
M2
 Load M2 when there are
F
A
B
C
D
CL
 Switch to Load M1
when all inputs are
active (thus require high
amount of current to
drive)
Adaptive Load
2102-545 Digital ICs
not too many inputs
(A, B, C, or D) active
Static CMOS Circuits
68
B.Supmonchai
Improved Loads: DCVSL
VDD
VDD
M1
M2
Out
A
A
B
B
Out
PDN1
PDN2
VSS
VSS
Differential Cascode Voltage Switch Logic (DCVSL)
2102-545 Digital ICs
Static CMOS Circuits
69
B.Supmonchai
DCVSL Example
Out
Out
B
B
A
B
B
A
XOR-NXOR gate
2102-545 Digital ICs
Static CMOS Circuits
70
B.Supmonchai
DCVSL Characteristics

Dual Rail Logic
 Each input is provided in complementary format and each gate
produces complementary output
 Increasing complexity

Rail-to-Rail Swing

No static power dissipation

Sizing of the PMOS relative to PDN is critical to
functionality, not just performance
 PDNs must be strong enough to bring outputs below
VDD - |VTp|
2102-545 Digital ICs
Static CMOS Circuits
71
B.Supmonchai
DCVSL Transient Response
V olta ge [V]
2.5
tin->out = 197 ps
AB
1.5
0.5
-0.5 0
AB
A,B
0.2
tin->out = 321 ps
A,B
0.4
0.6
Time [ns]
0.8
1.0
Transient Response of a 2-input AND/NAND gate. How does it look like?
2102-545 Digital ICs
Static CMOS Circuits
72
B.Supmonchai
NMOS Transistors in Series/Parallel

Primary inputs drive both gate and source/drain terminals

NMOS switch closes when the gate input is high
A
B
X
Y
X = Y if A and B
A
B
X
X = Y if A or B
Y
Remember - NMOS transistors pass a strong 0 but a weak 1
2102-545 Digital ICs
Static CMOS Circuits
73
B.Supmonchai
PMOS Transistors in Series/Parallel

Primary inputs drive both gate and source/drain terminals

PMOS switch closes when the gate input is low
A
B
X
Y
X = Y if A and B = A + B
A
X = Y if A or B = A  B
B
X
Y
Remember - PMOS transistors pass a strong 1 but a weak 0
2102-545 Digital ICs
Static CMOS Circuits
74
B.Supmonchai
VTC of PT AND Gate
B
1.5/0.25
A
1
0.5/0.25
0

B = VDD, A = 0VDD
Vout, (V)
0.5/0.25
2
B
0.5/0.25
A = VDD, B = 0VDD
A = B = 0VDD
F= AB
0
0
1
2
Pure PT logic is not regenerative - the signal
gradually degrades after passing through a number
of PTs (can fix with static CMOS inverter insertion)
2102-545 Digital ICs
Static CMOS Circuits
76
B.Supmonchai
Complementary PT Logic (CPL)
A
A
B
B
A
A
B
B
B
PT Network
Inverse
PT Network
B
B
F
F
F
F
B
B
A
A
A
B
F=A·B B
F=A+B A
A
A
A
F=A·B
B
AND/NAND
2102-545 Digital ICs
F=A+B
B
OR/NOR
Static CMOS Circuits
B
F=AB
F=AB
A
XOR/XNOR
77
B.Supmonchai
CPL Properties

Differential, so complementary data inputs and outputs are
always available (don’t need extra inverters)

Static, since the output defining nodes are always tied to
VDD or GND through a low resistance path

Design is modular; all gates use the same topology, only
the inputs are permuted.

Simple XOR makes it attractive for structures like adders

Fast! (assuming number of transistors in series is small)

Additional routing overhead for complementary signals

Still have static power dissipation problems
2102-545 Digital ICs
Static CMOS Circuits
78
B.Supmonchai
NMOS-Only PT Driving an Inverter
In = VDD
VGS
A = VDD
D
Vx
M2
Out
S
M1
B
Vx does not pull up to VDD, but VDD – VTn

Threshold voltage drop causes static power consumption
(M2 may be weakly conducting forming a path from
VDD to GND)

Notice VTn increases of pass transistor due to body effect
(VSB)
2102-545 Digital ICs
Static CMOS Circuits
79
B.Supmonchai
Voltage Swing of PT Driving an Inverter
3
In = 0  VDD
1.5/0.25
S
VDD
D
X
Out
0.5/0.25
B
0.5/0.25
Voltage (V)
In
2
X = 1.8V
1
Out
0
0
0.5
1
1.5
2
Time (ns)


Body effect – large VSB at X - when pulling high (B is tied
to GND and S charged up close to VDD)
So the voltage drop is even worse
Vx = VDD - (VTn0 + ((|2f| + Vx) - |2f|))
2102-545 Digital ICs
Static CMOS Circuits
80
B.Supmonchai
Cascaded NMOS-Only PTs
A = VDD
B=
VDD
M
1
G
S
x = VDD - VTn1
M1
C = VDD
x
y
Out
M2
G
y
C=
VDD
A = VDD
B=
VDD
M2
Out
S
Swing on y = VDD - VTn1 - VTn2
Swing on y = VDD - VTn1

Pass transistor gates should never be cascaded as on
the left

Logic on the right suffers from static power
dissipation and reduced noise margins
2102-545 Digital ICs
Static CMOS Circuits
81
B.Supmonchai
Solution 1: Level Restorer
Level
Restorer

Full swing on x (due to
Level Restorer) so no static
power consumption by
inverter

No static backward current
path through Level Restorer
and PT since Restorer is
only active when A is high

For correct operation Mr
must be sized correctly
(ratioed)
Mr
B
M2
x
A
Out
Mn
M1
A
B
X
Out
Mr
0
0VDD
0
VDD
OFF
VDD
0VDD
VDD
0
ON
2102-545 Digital ICs
Static CMOS Circuits
82
B.Supmonchai
Transient Level Restorer Circuit Response
W/Ln=0.50/0.25, W/L1=0.50/0.25, W/L2=1.50/0.25
Voltage (V)
3
2
W/Lr=1.75/0.25
node x never goes below
VM of inverter so output
never switches
W/Lr=1.50/0.25
1
W/Lr=1.0/0.25
W/Lr=1.25/0.25
0
0
100
200
300
400
500
Time (ps)

Restorer has speed and power impacts:
 Increases the capacitance at x, slowing down the gate
 Increases tr (but decreases tf)
2102-545 Digital ICs
Static CMOS Circuits
83
B.Supmonchai
Notes on Level Restorer

Pull down must be stronger than restorer (pull up)
to switch node X

If resistance of restorer transistor is too small (too
wide transistor) it is impossible to bring the
voltage at node X below the switching threshold of
the inverter, and the inverter never switches!

Sizing of Mr is critical for DC functionality, not
just performance!!

It belongs to Dynamic Logic Family
2102-545 Digital ICs
Static CMOS Circuits
84
B.Supmonchai
Solution 2: Multiple VT Transistors

Technology solution: Use (near) zero VT devices for
the NMOS PTs to eliminate most of the threshold drop
(body effect still in force preventing full swing to VDD)
In2 = 0V
low VT
transistors
A = 2.5V
on
Out
off but
leaking
In1 = 2.5V
B = 0V
sneak
path
2102-545 Digital ICs
Watch out for subthreshold
current flowing through PTs
Static CMOS Circuits
85
B.Supmonchai
Solution 3: Transmission Gates (TGs)

Most widely used solution
C
C
A
B
A
C
C = GND
A = VDD
C
C = GND
B
A = GND
B
C = VDD
C = VDD

B
Full swing bidirectional switch controlled by the gate
signal C, A = B if C = 1
2102-545 Digital ICs
Static CMOS Circuits
86
B.Supmonchai
Resistance of TG
W/Lp=0.50/0.25
0V
Resistance (k)
30
Rn
25
Rp
Rp
20
Req = Rp || Rn
2.5V
Vout
15
Rn
10
2.5V
Req
5
W/Ln=0.50/0.25
0
0
1
2

TG is not an ideal switch - series resistance

Req is relatively constant ( about 8kohms in this case),
so can assume has a constant resistance
2102-545 Digital ICs
Static CMOS Circuits
87
B.Supmonchai
TG Multiplexer
S
S
S
S
S
F
VDD
In2
F
S
In1
S
F = !(In1  S + In2  S)
GND
In1
2102-545 Digital ICs
Static CMOS Circuits
In2
88
B.Supmonchai
Transmission Gate XOR
off
AB
A
off
6 Transistors
B

F always has a connection to VDD or GND - not dynamic

No voltage drop
2102-545 Digital ICs
Static CMOS Circuits
89
B.Supmonchai
Transmission Gate XOR
VDD (B)
off
A
F = A‘
off
1
B

0 (B’)
When B = 1, the circuit behaves as if it is an inverter,
hence F(B = 1) = A’
2102-545 Digital ICs
Static CMOS Circuits
90
B.Supmonchai
Transmission Gate XOR
0 (B)
on
A
when A = 0
weak 0
F=A
on
weak 1
when A = 1
0
B

VDD (B’)
When B = 0, the circuit acts as a transmission gate,
hence F(B = 0) = A (TG ensures no voltage drop)
2102-545 Digital ICs
Static CMOS Circuits
91
B.Supmonchai
Transmission Gate XOR
A‘ B + A B’
A
B

Combine the results using Shannon’s expansion theorem,
F = B·F(B = 1) + B’·F(B = 0) = A’B+AB’ = A  B
2102-545 Digital ICs
Static CMOS Circuits
92
B.Supmonchai
TG Full Adder
Cin
B
A
Sum
Cout
• 16 Transistors, no more than 2 PTs in series
• Full swing
• Similar delay for Sum and Carry
2102-545 Digital ICs
Static CMOS Circuits
93
B.Supmonchai
Delay of a TG Chain
0
0
Vin
V1
5 C
Vin
Req
C

0
Vi
5 C
V1
0
Req
Vi+1
5C
Vi Req
C
VN
5C
Vi+1
C
Req
VN
C
Delay of the RC chain (N TG’s in series) is
tp(Vn) = 0.69 kCReq = 0.69 CReq (N(N+1))/2  0.35 CReqN2
2102-545 Digital ICs
Static CMOS Circuits
95
B.Supmonchai
Notes on TG Chain Delay


Delay grows quadratically in N (in this case in the
number of TGs in series) and increases rapidly with the
number of switches in the chain.
E.g., for 16 cascaded minimum-sized TG’s, each with an
Req of 8kohms.
 The node capacitance is the sum of the capacitances of two
NMOS and PMOS devices (junctions and drains).
 Capacitance values is approx. 3.6 fF for low to high transitions.

The delay through the chain is
 tp = 0.69 CReq(N(N+1))/2 = 0.69 x 3.6fF x 8kΩ x (16x17)/2
= 2.7ns
2102-545 Digital ICs
Static CMOS Circuits
96
B.Supmonchai
TG Delay Optimization

Can speed it up by inserting buffers every M switches
0
0
0
0
0
0
VN
Vin
5C
5C
5C
5C
5C
5C
M

Delay of buffered chain (M TG’s between buffer)
tp = 0.69 N/M CReq (M(M+1))/2 + (N/M - 1) tpbuf
Mopt = 1.7  (tpbuf/CReq )  3 or 4
2102-545 Digital ICs
Static CMOS Circuits
97
B.Supmonchai
Notes on Delay Optimization

Buffered chain is now linear in N
 Quadratic in M but M should be small

This buffer insertion technique works to speed up the
delay down long wires as well.

Consider 16TG chain example. Buffers = inverters
(making sure correct polarity is output).
 For 0.5micron/0.25micron NMOSs and PMOSs in the TGs,
 simulated delay with 2TG per buffer is 154 ps,
 for 3TGs is 154ps, and for 4TG is 164ps.
 The insertion of buffering inverters reduces the delay by a
factor of almost 2.
2102-545 Digital ICs
Static CMOS Circuits
98