VLSI קורס קווי חיבור

Download Report

Transcript VLSI קורס קווי חיבור

Modeling and Optimization of VLSI Interconnect
049031
Lecture 4: Interconnect modeling
Avinoam Kolodny
Konstantin Moiseev
1
Outline

Delay modeling – continuation


Elmore delay for general input
Model reduction. Asymptotic Waveform Evaluation.
 Moments calculation
 Two-pole model example


Transition time modeling





Delay metrics based on moments.
Central moments and their relation to moments
Representation of response as a probability distribution.
Alpert estimation for receiver slope. Slope expressions based on moments
Explicit output slope calculation for singe RC-stage
Driver-receiver interaction





Driver modeling. K-equations
Admittance and impedance calculations
Representation of load as “PI-model”.
Effective capacitance and algorithm for its calculation.
Two-step delay approximation
2
Asymptotic Waveform Evaluation
(AWE)
 Elmore delay uses first moment to represent system response

Not very accurate
 For more accurate estimation, more moments are required
 AWE uses moments matching to calculate parameters of low-order
model:
ˆ  bˆ s   bˆ s q 1
b
0
1
q 1
Hˆ  s  
1  aˆ1s   aˆq s q
where the reduced order q is much less than the original order n
 AWE flow:
Generate moments from the circuit
Match the first
2q moments to low-order q  pole model
Calculate residues
Obtain time-domain response by inverse Laplace transform
“Asymptotic waveform evaluation for timing analysis”, L. Pillage and R. Rohrer, 1990
3
Calculating moments
 How to calculate moments without knowing transfer function?
 From circuit theory:

RC-tree solution can be represented by:
G  sC X  E0
Conductance and capacitance matrices
 Represent:
 Then
Initial excitation vector
Node voltages (unknowns) vector
X  m0  m1s  m2 s2 
 G  sC   m 0  m1 s  m 2 s 2 
E
0
Matching coefficients
Gm 0  E0
Gm i  Cm i-1 , i  1
4
Calculating moments
Gm 0  E0
 First equation meaning:


Solve system for DC ( s  0 ) with the original
excitation
I.e. all capacitors are open-circuited and node
voltages are calculated
Gm i  Cm i-1 , i  1
G  sC X  E0
 Second equation meaning:

Solve system for DC ( s  0 ) with the original
excitation is set to zero, a new excitation Cmi-1 is
applied instead and node voltages are calculated
 I.e. moments are calculated recursively
For m0 calculation
For mi calculation
5
Moments calculation example
All caps are 1F
All resistances are 1Ω
Input voltage is 1V
Moment 0: all capacitances are open
circuited
m0  1 1 1 1 1
Moment 1: all capacitances are replaced by
current sources and input is grounded
m1  0 4 7 9 10
m2  0 30 56 75 85
m3  0 246 462 622 707
6
Example: two-pole approximation
 Assume we calcuated moments of the circuit:
H  s   m0  m1s  m2 s2 
 Transfer function for reduced model:
 Cross-multiply:
b0  b1s
H s 
1  a1s  a2 s 2
1  a s  a s  m
2
1
2
2

m
s

m
s

0
1
2
b
0
 b1s
 Coefficient match:
Build as many
equations as needed
to resolve all
unknowns
s0 :
b0  m0
s1 :
b1  a1  m1
s 2 : 0  a2  a1m1  m2
s 3 : 0  a2 m1  a1m2  m3
In this case only 4
moments are required
7
Example: two-pole approximation
All caps are 1F
All resistances are 1Ω
Input voltage is 1V
Moments at node 5 are: 1, -10, 85, -707, …
1  10s  85s
b0  1
b1  a1  10
0  a2  10a 1 85
0  10a2  85a1  707
2
 707 s 3 
1  a s  a s   b
2
1
2
0
 b1s
7
s
15
Hˆ  s  
143
31
1
s  s2
15
3
1
8
AWE pros and cons
 Allows simplification of the model


Pretty accurate
Reduced model representation in memory is compact
 No exact formula for roots of polynomial with degree higher than 4

Using numerical methods – computationally expensive
 Higher-order AWE approximations are often unstable


Meaning positive poles
Special methods are needed
9
Some delay expressions based on first
two moments
“An analytical delay model for RLC interconnects”, A.B.Kahng and S.Muddu, 1997

m1
1
2
td  m1  4m2  3m1 ln 1 

2
4m2  3m12



 two pole,
 two moments


“Accurate analytical delay model for VLSI interconnects”, A.B.Kahng and S.Muddu, 1995
td  2m2  m ln  2 
2
1
one pole,
matching second moments
instead of first
“RC delay metrics for performance optimization”, C.J. Alpert, A. Devgan and C.V. Kashyap, 2001
m12
td 
ln  2 
m2
“D2M metric”
pure empirical
10
Central moments
 Elmore was the first who made a comparison between voltage
derivative waveform and PDF
 Keeping this analogy higher-order waveshape terms can be
characterized by central moments
 Central moments of the impulse response are the moments about
the mean:


k    t    h  t  dt
k
0

0   h  t  dt  m0  1

 th  t  dt
0

 h  t  dt

m1
m0
0
0



0

0
0
1    t    h  t  dt   th  t  dt   h t  dt  m1  m0  m1  m1  0



m1
m12
m12
2    t    h  t  dt   t h  t  dt  2  th  t  dt    h  t  dt  2m2  2  m1   2 m0  2m2 
m0
m0
m0
0
0
0
0
2
2
2

6m1m2
m13
3    t  h  h  t  dt   6m3 
2 2
m
m0
0
0
3
11
Understanding central moments
 Zero moment 0 is the area under the
curve. Usually 1
 First moment 1 the mean deviation
around mean, thus it is 0
mode
median
mean
 Second moment 2 is the variance of
the distribution
 Third moment 3 is the skewness of
the distribution

Distribution is symmetric if
3  0
12
Some important properties
 Second and third central moments add under convolution, i.e. for
h t   h1 t  * h2 t 
holds that:
2  h   2  h1    2  h2 
3  h   3  h1   3  h2 

Second and third central moments are always positive
13
Elmore delay for general inputs
 The real input is not a step voltage
 It is modeled usually by saturated ramp

For saturated ramp:  vi ' t  dt  1;
0
2  vi ' t   
tr2
;
12
3  vi ' t    0
 Following results hold:
 For an RC circuit with monotonically increasing, piecewise-smooth
input vi  t  such that vi ' t  is a nonnegatively skewed unimodal
function, Mode  Median  Mean holds for the output response vo  t  at
any node.
 For an RC circuit with monotonically increasing, piecewise-smooth
input vi  t  such that vi ' t  is a symmetric function, then the Elmore
delay of the output response reaches 50% delay as the rise-time of
input signal reaches infinity.
 The area between input and output response equals Elmore delay.
14
Ramp follower
 When input signal transition time is large, transient response is
negligible
 Such kind of response is called ramp follower

In ramp follower, Elmore delay is almost exact delay metric
Real circuit example
15
Another example – lumped RC circuit
R
Vin  t 
C
Vout  t 
tr  0
tr  RC
tr  10RC
16
Transition time modeling
 Transition time: requires calculation of two points

Usually less accurate
 Recall:
 Again Elmore proposal…
Transition time is proportional to
second central moment!!
17
Slew rate for step input
 Assume step response modeled by single time constant:
y t   1  e

t
r
 Impulse response is therefore:
h t  
1
r

e
t
r
 Second central moment for this distribution is:

2    t   
0
2
1
r

e
t
r
dt   r2
 Now, match second central moment of model and actual circuit to
find  r :
 r2  2m2  m12
 r  2m2  m12
“A simple metric for slew rate of RC circuits based on two moments”, Agarwal, Sylvester, Blaauw, 2004
18
Slew rate for step input
 To find 10%-90% slew rate, substitute back to step response and
obtain:
S2M metric
t10%90%  ln 9 2m2  m12
 Works good for far-end nodes
 For near-end nodes single pole apporximation fails
 Better metric is found empirically:
t10%90% 
m1
4
m2
ln 9 2m2  m12
scaled S2M metric
From D2M
Example:
19
Extension for ramp inputs
 We know that for LTI system holds:
y t   x t  * h t 
 According to convolution property:
y ' t   x ' t  * h t 
 And since h  t   s ' t  , we can write:
y ' t   x ' t  * s ' t 
 We have:
s ' t 
x ' t 
y ' t 
is a PDF of step response
is a PDF of input waveform
is a PDF of output waveform
Recall:
second central moments are
added after convolution
of two PDFs !!!
“Closed form expressions for extending Step Delay and Slew Metrics to Ramp Inputs”, Kashyap, Liu, Alpert, Devgan, 2003
20
Extension for ramp inputs
 Therefore
2  y   2  x  2  s 
 2  y    2  x   2  s 
 Taking Elmore assumption that slew rate is proportional to standard
deviation:
tr  x   kx  x 
tr  y   k y  y 
 And therefore:
tr  s   ks  s 
 tr  y    tr  x    tr  s  

  
 

k
k
k
 y   x   s 
2
2
2
 Assuming all constants are equal we get:
tr 2  y   tr 2  x   t r 2  s 
21
Extension for ramp inputs
 We get:
tr  y   tr 2  x   t r 2  s 
 In other words, the output slew rate is the root-mean square of the
input signal slew and step response skew.
 Example: lumped RC circuit
Area 1
Area 2
for
t  0.3RC
Exact calculation gives
Vout  1 
RC
t
t
t  t

  RC
e
 e RC






t20%80%  1.38RC
Vout 
Using PERI:
t20%80% 
  t

1 
t  RC e RC  1 

t 


t
 0.6  0.3RC    RC ln 4  1.397 RC
2
2
22
Using PERI for delay estimation
 Using PERI, the following expression is obtained for 50% delay:
td (ramp)  1    TD  td (step)
where


2
2
m

m
2
1
 
2


t
 
2
2
m

m

 2
1

12







5
Elmore
2
If t   , then   0 and td  ramp   TD
23
Interacting with gate delay models
 Interconnect does not exist standalone
 Usually we want to calculate stage delay / slew rather than pure
interconnect delay / slew
output waveform
 The problem:


Interconnect is linear system
Gate is not !
stage
input waveform
 To model stage delay / slew
we need:


Develop simple and accurate
gate model
Be able to combine gate and
interconnect models without
loss of accuracy
24
Gate modeling
 How to model non-linear gate?
 First option – switch resistor model (i.e. linear model)
Rdr
Cin
Vdr
CL
 Advantage:

Is able to capture the interaction of the gate’s output resistance
and the RC load
 Disadvantages:


Requires calculating a single linear resistor that captures the
switching behavior of CMOS gate
Neglects effecy of the input transition time on delay
25
K-factor equations
 Model delay and output transition time as functions of capacitive
load and input transition time
k-factor equations

td  k  ttr in , CL 


ttr out  k '  ttr in , CL 
 For example:
td   k1  k2C L  ttr in  k3CL3  k4CL  k5
 All parameters are set empirically
Input waveform is modeled by
saturated ramp (single number)
26
Table-lookup model
 Empirical models usually do not scale well
 Preferred approach is pre-characterize each cell for bunch of input
transitions and output loads
 Exact value for specific inputs is found using interpolation
 This approach requires two two-dimensional tables (delay and
output slew) for each in-out cell transition
27
Two–step delay approximation
 For non-linear gate model the delay and slew at stage outputs is
calculated in two steps:
1. Given input slew and driver load, delay and slew at the gate output
are calculated from gate model (either empirical or table-based).
Gate output waveform is approximated by saturated ramp
2. The delay and the slew at the interconnect output are calculated
using interconnect model with given sturated ramp input
Should take into
account gate nonlinearity
Linear interconnect
model
28
Handling cell load
 Cell output is loaded by interconnect tree
 The characterization expects single load capacitance number
 For pure capacitive interconnect the characterization will work as is

Can use
Ctotal
as a load
 It will not work for resistive interconnect !!!
The solution: map complex load to single effective capacitance
Allows using table-lookup model
Ceff
Handles complex interconnect structure
29
Effective capacitance
C0  Ceff  Ctotal
30
Calculating Ceff
 Effective capacitance itself is calculated in two stages:


RC-tree is reduced to two-pole 
 model
Ceff is calculated iteratively by comparing average current drawn
from a source by   model and by single capacitance up to 50%
delay point
31
Representing RC load by
 model
 Recall from circuit theory:
 Input admittance is defined by Yin 
I in
Vin
 Input admittance represents “load” seen by external source
connected to the system input
 Admittance moments:
Yin  s   y0  y1s  y2 s2  y3s3 
32
Representing RC load by
 model
 On the other hand,   model admittance is given by:
s  C1  C2   s 2 RC1C2
Y  s  
1  sRC1
 Matching admittance moments we have:
 y0  0
C  C  y  y RC
 1
2
1
0
1

 RC1C2  y2  y1 RC1
 y3  y2 RC1  0
 Solving system obtain:
y22
C2  y1 
y3
y22
C1 
y3
y32
R 2
y3
33
Constructing Thevenin model
 Thevenin model (non-constant voltage source + resistance) is used
to model input waveform
34
Iterative Ceff calculation process
td  k  ttr in , Ceff 


ttr out  k '  ttr in , Ceff 
t0  f  td , t tr out , Ceff , Rd 


t  f '  ttr out , Ceff , Rd 
I avg  Ceff , t , t0 , Rd   I avg  , t , t0 , Rd 
35
Backup
36