Transcript Chapter 6
Digital Integrated
Circuits
A Design Perspective
Jan M. Rabaey
Anantha Chandrakasan
Borivoje Nikolić
Designing Combinational
Logic Circuits
EE141 Integrated
© Digital
Circuits2nd
1
Combinational Circuits
Combinational vs. Sequential Logic
Combinational
Logic
Circuit
In
In
Out
Out
Combinational
Logic
Circuit
State
Combinational
Output = f(In)
EE141 Integrated
© Digital
Circuits2nd
Sequential
Output = f(In, Previous In)
2
Combinational Circuits
Static CMOS Circuit
At every point in time (except during the switching
transients) each gate output is connected to either
VDD or Vss via a low-resistive path.
The outputs of the gates assume at all times the value
of the Boolean function, implemented by the circuit
(ignoring, once again, the transient effects during
switching periods).
This is in contrast to the dynamic circuit class, which
relies on temporary storage of signal values on the
capacitance of high impedance circuit nodes.
EE141 Integrated
© Digital
Circuits2nd
3
Combinational Circuits
Static Complementary MOS
VDD
In1
In2
PUN
InN
In1
In2
PMOS only (make a connection from
Vdd to F when F(In1,…InN)=1
F(In1,In2,…InN)
PDN
NMOS only (make a connection from
Gnd to F when F(In1,…InN)=0
InN
The PUN and PDN are structured in a mutually exclusive fashion
such that only one of the them is conducting in steady-state
EE141 Integrated
© Digital
Circuits2nd
4
Combinational Circuits
Static Complementary CMOS
Functionally, a transistor can be thought of as a switch. PDN is
on when input signal is high and off when low. PUN is on when
signal is low and off when high.
PDN network is constructed using NMOS while PUN using
PMOS. The primary reason for this is that NMOS produce “strong
zeros” while PMOS generates “strong ones”, why?
EE141 Integrated
© Digital
Circuits2nd
5
Combinational Circuits
Threshold Drops
VDD
PUN
VDD
S
D
VDD
D
0 VDD
VGS
S
CL
VDD 0
PDN
D
VDD
S
CL
CL
VGS
VDD |VTp|
S
EE141 Integrated
© Digital
CL
D
That is why PMOS is used in PUN, and NMOS in PDN
Circuits2nd
0 VDD - VTn
6
Combinational Circuits
NMOS Transistors
in Series/Parallel Connection
Transistors can be thought of as a switch controlled by its gate signal
NMOS switch closes when switch control input is high
A
B
X
X: GND
Y
Y = X if A and B
NAND
Y = X if A OR B
NOR
A
Y: output
X
B
Y
NMOS Transistors pass a “strong” 0 but a “weak” 1
EE141 Integrated
© Digital
Circuits2nd
7
Combinational Circuits
PMOS Transistors
in Series/Parallel Connection
PMOS switch closes when switch control input is low
A
B
X
X: VDD
Y
Y = X if A AND B = A + B
NOR
A
Y: output
X
B
Y
Y = X if A OR B = AB
NAND
PMOS Transistors pass a “strong” 1 but a “weak” 0
EE141 Integrated
© Digital
Circuits2nd
8
Combinational Circuits
Complementary CMOS Logic Style
• Number of transistors required to implement an Ninput logic gate is 2N
EE141 Integrated
© Digital
Circuits2nd
9
Combinational Circuits
Example Gate: NAND
EE141 Integrated
© Digital
Circuits2nd
10
Combinational Circuits
Example Gate: NOR
EE141 Integrated
© Digital
Circuits2nd
11
Combinational Circuits
Complex CMOS Gate
B
A
C
D
OUT = D + A • (B + C)
A
D
B
EE141 Integrated
© Digital
Circuits2nd
C
12
Combinational Circuits
Constructing a Complex Gate
VDD
VDD
C
F
SN4
F
SN1
A
SN3
D
B
C
B
SN2
A
D
A
B
D
C
F
(a) pull-down network
(b) Deriving the pull-up network
hierarchically by identifying
sub-nets
A
D
B
C
(c) complete gate
EE141 Integrated
© Digital
Circuits2nd
13
Combinational Circuits
OAI22 Logic Graph
A
C
B
D
XOR, XNOR ?
X = (A+B)•(C+D)
C
D
A
B
EE141 Integrated
© Digital
Circuits2nd
A
B
C
D
14
Combinational Circuits
Properties of Complementary CMOS Gates
Snapshot
• High noise margins
:
VOH and VOL are at VDD and GND, respectively.
• No static power consumption
:
There never exists a direct path between VDD and
VSS (GND) in steady-state mode.
• Comparable rise and fall times:
(under appropriate sizing conditions)
EE141 Integrated
© Digital
Circuits2nd
15
Combinational Circuits
Complementary MOS Properties
Full rail-to-rail swing; high noise margins
Logic levels not dependent upon the relative
device sizes; ratioless
Always a path to Vdd or Gnd in steady state;
low output impedance
Extremely high input resistance; nearly zero
steady-state input current
No direct path steady state between power
and ground; no static power dissipation
Propagation delay as function of load
capacitance and resistance of transistors
EE141 Integrated
© Digital
Circuits2nd
16
Combinational Circuits
Voltage Transfer Characteristics
Multi-dimensional
plot (can be obtained
using DC sweep analysis)
EE141 Integrated
© Digital
Circuits2nd
17
Combinational Circuits
Delay: Switch Delay Model
Req
A
A
Rp
A
Rp
Rp
B
Rn
Rp
CL
Rn
A
Cint
INV
NAND2
Circuits2nd
Cint
A
A
EE141 Integrated
© Digital
Rp
A
B
Rn
B
CL
Rn
Rn
A
B
CL
NOR2
18
Combinational Circuits
Input Pattern Effects on Delay
Delay is dependent on
the pattern of inputs
Low to high transition
Rp
A
Rp
B
Rn
both inputs go low
– delay is 0.69 Rp/2 CL
CL
one input goes low
B
Rn
– delay is 0.69 Rp CL
Cint
A
High to low transition
both inputs go high
– delay is 0.69 2Rn CL
EE141 Integrated
© Digital
Circuits2nd
19
Combinational Circuits
Delay Dependence on Input Patterns
3
Input Data
Pattern
Delay
(psec)
A=B=01
67
A=1, B=01
60
A= 01, B=1
64
0.5
A=B=10
45
0
A=1, B=10
80
A= 10, B=1
81
A=B=10
2.5
Voltage [V]
2
A=1 0, B=1
1.5
A=1, B=10
1
-0.5
0
100
200
time [ps]
300
400
NMOS = 0.5m/0.25 m
PMOS = 0.75m/0.25 m
CL = 100 fF
The difference between the later two cases of H-L has to
do with the internal node capacitance charging
20
EE141 Integrated Circuits2nd
© Digital
Combinational Circuits
Transistor Sizing
The goal is to size the gate so that it has approximately
the same delay (mostly worst-case delay) as an minimumsize inverter (9λ/2λ,3λ/2λ) Assumes Rp = Rn
Rp
1
A
2
Rp
1
B
B
Rn
2
CL
A
Cint
Rp
Cint
A
B
2 R
n
Rp
2
1 Rn
Rn
A
B
1
First-order estimate neglecting velocity saturation effects
(smaller for stacked transistor) and self-loading
EE141 Integrated
© Digital
Circuits2nd
CL
21
Combinational Circuits
EE141 Integrated
© Digital
Circuits2nd
22
Combinational Circuits
Transistor Sizing a Complex
CMOS Gate
Note: the number for
PMOS is with respect
to PMOS counterpart
3
(27λ/2λ) in minimum size
inverter, and NMOS to
3
NMOS counterpart
B
A
3
C
D
3
OUT = D + A • (B + C)
A
D
EE141 Integrated
© Digital
(6λ/2λ)
1
B
Circuits2nd
2
2C
2
23
Combinational Circuits
Fan-In Considerations
A
B
C
D
A
CL
B
C3
C
C2
D
C1
C1? C2? C3? CL?
C1: CdbD, CsbC, 2CgdD, 2CgsC
EE141 Integrated
© Digital
Circuits2nd
Distributed RC model
(Elmore delay)
tpHL = 0.69 Reqn(C1+2C2+3C3+4CL)
Propagation delay deteriorates
rapidly as a function of fan-in –
quadratically in the worst case.
24
Combinational Circuits
The Elmore Delay
RC Chain
EE141 Integrated
© Digital
Circuits2nd
25
Combinational Circuits
tp as a Function of Fan-In for NAND
1250
quadratic
1000
tp (psec)
750
tpHL
500
Assumes fixed
fan-out
tp
250
tpLH linear
0
2
4
6
8
fan-in
10
12
14
16
Gates with a fan-in greater than 4 should be avoided.
EE141 Integrated
© Digital
Circuits2nd
26
Combinational Circuits
tp as a Function of Fan-Out
tpNAND2
tp (psec)
tpNOR2
2
All gates are
scaled using the
switched delay
model. (why
delay of NOR2
and NAND2 still
lager?)
Intrinsic cap.
tpINV
4
6
8
10
12
14
16
eff. fan-out
EE141 Integrated
© Digital
Circuits2nd
27
Combinational Circuits
tp as a Function of Fan-In and Fan-Out
Fan-in:
quadratic due to increasing
resistance and capacitance
Fan-out:
each additional fan-out gate
adds two gate capacitances to CL
tp = a1FI + a2FI2 + a3FO
EE141 Integrated
© Digital
Circuits2nd
28
Combinational Circuits
Fast Complex Gates:
Design Technique 1
Transistor
sizing
as long as fan-out capacitance dominates
Progressive
InN
sizing (non-uniform sizing)
CL
MN
In3
M3
C3
In2
M2
C2
In1
M1
C1
EE141 Integrated
© Digital
Circuits2nd
Distributed RC line
M1 > M2 > M3 > … > MN
(the transistor closest to the
output is the smallest)
Can reduce delay by more than
20%; decreasing gains as
technology shrinks (due to layout)
29
Combinational Circuits
Fast Complex Gates:
Design Technique 2
An
input signal is called critical if it is
the last signal of all inputs to assume a
stable value
The path through the logic which
determines the ultimate speed of the
structure is called the critical path
Putting the critical path transistors
closer to the output of the gate can
result in a speed up
EE141 Integrated
© Digital
Circuits2nd
30
Combinational Circuits
Fast Complex Gates:
Design Technique 2
Transistor
ordering
critical path
In3
1
In2
1
charged
CL
M3
M2
In1
M1
01
C2 charged
C1 charged
delay determined by time to
discharge CL, C1 and C2
EE141 Integrated
© Digital
Circuits2nd
critical path
01
In1
M3
CL
In2 1 M2
C2 discharged
In3 1 M1
C1 discharged
charged
delay determined by time to
discharge CL
31
Combinational Circuits
Fast Complex Gates:
Design Technique 3
Alternative
logic structures
F = ABCDEFGH
EE141 Integrated
© Digital
Circuits2nd
32
Combinational Circuits
Fast Complex Gates:
Design Technique 4
Isolating
fan-in from fan-out using buffer
insertion (inverter chains)
CL
EE141 Integrated
© Digital
Circuits2nd
CL
33
Combinational Circuits
Sizing Logic Paths for Speed
Frequently, input capacitance of a logic path is constrained
Logic also has to drive some capacitance
Example: ALU load in an Intel’s microprocessor is 0.5pF
How do we size the ALU datapath to achieve maximum
speed?
We have already solved this for the inverter chain – can we
generalize it for any type of logic?
EE141 Integrated
© Digital
Circuits2nd
34
Combinational Circuits
EE141 Integrated
© Digital
Circuits2nd
35
Combinational Circuits
Buffer Example
In
Out
1
2
N
CL
For given N: Ci+1/Ci = Ci /Ci-1
To find N: Ci+1/Ci ~ 4
How to generalize this to any logic path?
EE141 Integrated
© Digital
Circuits2nd
36
Combinational Circuits
Apply to Inverter Chain
In
Out
1
2
N
CL
tp = tp1 + tp2 + …+ tpN
C gin, j 1
t pj ~ Runit Cunit
t pj ~ Runit Cunit 1
C
gin
,
j
N
N
C gin, j 1
, C gin, N 1 C L
t p t p , j t p 0 1
C
j 1
i 1
gin, j
EE141 Integrated
© Digital
Circuits2nd
1 f i
37
Combinational Circuits
Optimal Tapering for Given N
Delay equation has N - 1 unknowns, Cgin,2 – Cgin,N
Minimize the delay, find N - 1 partial derivatives
Result: Cgin,j+1/Cgin,j = Cgin,j /Cgin,j-1
Size of each stage is the geometric mean of two neighbors
C gin, j C gin, j 1C gin, j 1
- each stage has the same effective fanout (Cout/Cin)
- each stage has the same delay
EE141 Integrated
© Digital
Circuits2nd
38
Combinational Circuits
Optimum Delay and Number of
Stages
When each stage is sized by f and has same eff. fanout f:
f N F CL / Cgin,1
Effective fanout of each stage:
f NF
Minimum path delay
t p Nt p 0 1 N F /
EE141 Integrated
© Digital
Circuits2nd
39
Combinational Circuits
Generalized logic path
1
a
b
c
5
How to size this generalized logic path?
EE141 Integrated
© Digital
Circuits2nd
40
Combinational Circuits
Logical Effort
Logical effort is the ratio of input capacitance of a gate to the input
capacitance of an minimum-size inverter gate with the same
output current (considering worst case)
VDD
A
VDD
A
2
2
B
F
2
F
A
A
VDD
B
4
A
4
2
F
1
A
B
Inverter
g=1
EE141 Integrated
© Digital
Circuits2nd
1
B
1
2
2-input NAND
g = 4/3
2-input NOR
g = 5/3
41
Combinational Circuits
Normalizing the delay to inverter
Now, consider to normalize everything to an minimum-size inverter
(Assume = 1). Then, (1) What is the load capacitance in each case for
an effective fan-out of 2 (with respect to one input) for example? (2) What
is the intrinsic delay in each case? (3) what is the external delay?
VDD
A
VDD
A
2
2
B
2
F
6
A
VDD
F
A
2
B
4
A
4
8
F
1
A
B
Inverter
B
1
10
2
2-input NAND
t p,INV k Runit Cunit 1 2 t p , NAND
EE141 Integrated Circuits2nd
© Digital
1
8
k Runit Cunit 2 2
6
2-input NOR
10
t p , NOR k Runit Cunit 2 2
42 6
Combinational Circuits
Delay in a logic gate with minimum size
• So, if we normalize everything to an minimum-size inverter
with ginv =1, pinv = 1 (everything is measured in unit delays tinv
• And assume = 1
• Then we can introduce
CL
t p g f
Delay k Runit Cunit 1
Cin
p – intrinsic delay factor
g – logical effort
f – effective fanout
EE141 Integrated
© Digital
Circuits2nd
43
Combinational Circuits
Logical effort
Gate delay:
d=h+p
effort delay
intrinsic delay
Effort delay:
h=gf
logical
effort
effective fanout (of
each stage) = Cout/Cin
Effective fanout (electrical effort) is a function of load/gate size
EE141 Integrated
© Digital
Circuits2nd
44
Combinational Circuits
Logical Effort
Inverter has the smallest logical effort and intrinsic delay of all
static CMOS gates
Logical effort represents the fact that for a given load, complex
gate has to work harder (in terms of transistor sizes) than an
inverter to get a similar delay).
In another way, complex gives more loading capacitance to the
previous gate when made comparable to inverter after sizing
How much harder? How to measure it?
Logical effort for a complex gate can be computed from the
ratio of its input capacitance to the inverter capacitance when
sized to deliver the same current (Logical effort is a function
of topology, independent of sizing)
Logical effort increases with the gate complexity
EE141 Integrated
© Digital
Circuits2nd
45
Combinational Circuits
Logical Effort
Reference: Sutherland, Sproull, Harris, “Logical Effort, Morgan-Kaufmann, 1999.
EE141 Integrated
© Digital
Circuits2nd
46
Combinational Circuits
Normalized delay (d)
Logical Effort of Gates
t pNAND
g=
p=
d=
t pINV
g=
p=
d=
F(Fan-in)
1
EE141 Integrated
© Digital
Circuits2nd
2
3
4
5
Fan-out (h)
6
7
47
Combinational Circuits
Normalized delay (d)
Logical Effort of Gates
t pNAND
g = 4/3
p=2
d = (4/3)f+2
t pINV
g=1
p=1
d = f+1
F(Fan-in)
1
2
3
4
5
Fan-out (h)
6
7
Intrinsic delay is increased by twice since the intrinsic
capacitance gets two times larger
EE141 Integrated
© Digital
Circuits2nd
48
Combinational Circuits
4/
3;
p
=
2
Logical Effort of Gates
=
D:
g
tN
AN
=
g
r:
e
t
er
1;
p=
1
v
in
3
pu
4
In
2-
Normalized Delay
5
Effort
Delay
2
1
Intrinsic
Delay
1
EE141 Integrated
© Digital
Circuits2nd
2
3
Fanout f
4
5
49
Combinational Circuits