SantaFe - NYU Stern School of Business

Download Report

Transcript SantaFe - NYU Stern School of Business

1
EFFICIENT MARKET
HYPOTHESIS



In its simplest form asserts that excess
returns are unpredictable - possibly
even by agents with special information
Even if this is true for long horizons, it
might not be true at short horizons
Microstructure theory discusses the
transition to efficiency
2
TRANSITION TO EFFICIENCY



Glosten-Milgrom(1985), Easley and
O’Hara(1987), Easley and
O’Hara(1992), Copeland and
Galai(1983) and Kyle(1985)
Two indistinguishable classes of traders
- informed and uninformed
Bid and Ask prices are optimally
updated by market maker until
3
information is incorporated in prices
CONSEQUENCES



Informed traders make excess profits at the
expense of uninformed traders.
The higher the proportion of informed traders,
the faster prices adjust to trades, the wider is
the bid ask spread and the lower are the
profits per informed trader.
In real settings with choice over volumes and
speed of trading, informed traders partly
reveal their identity, reducing profits.
4
INFORMED TRADERS

What is an informed trader?
– Information about true value
– Information about fundamentals
– Information about quantities
– Information about who is informed

Temporary profits from trading but
ultimately will be incorporated into
prices
5
HOW FAST IS THIS
TRANSITION?


Difficult to estimate
Data Problems
– Discreteness of dependent variable
– Bid Ask bounce in transaction prices
– Irregular timing of measurements

Measuring independent variables
– Cannot observe private information trading
– Must infer information events
6
SIMPLE STATISTICS



First order autoregression of transaction
prices (50K observations on IBM) has
coefficient of -.4 with t-stat of -101,
R2=.16
No implication for trading since cannot
buy at the bid price or sell at the ask
Same autoregression for midquote has
coefficient -.26 with t-stat -62 and
7
R2=.07
TIME SERIES PROPERTIES

Both are primarily MA(1) - bid ask
bounce for transactions but why for
midquotes?

Test for autocorrelation after MA(1):
– Transaction prices LB(15)=52 (>>25)
– Midquotes
LB(15)=1106 (>>>>25)
8
THEORY




The higher the proportion of information
traders, the faster prices adjust in trade time
When there is information, there is typically a
higher proportion of information traders
When there is information, traders are in a
hurry so trades are close together
When there is information, prices adjust very
fast in calendar time.
9
MEASURING INFORMATION



When traders are in a hurry, they are more
likely to be informed (short durations)
When trades are large they are more likely to
be informative (except perhaps for block
trades)
When bid ask spreads are wide, it is likely
that the proportion of informed traders is high
10
EMPIRICAL EVIDENCE





Engle, Robert and Jeff Russell,(1998) “Autoregressive
Conditional Duration: A New Model for Irregularly Spaced Data,
Econometrica
Engle, Robert,(2000), “The Econometrics of Ultra-High
Frequency Data”, Econometrica
Dufour and Engle(2000), “Time and the Price Impact of a
Trade”, Journal of Finance, forthcoming
Engle and Lunde, “Trades and Quotes - A Bivariate Point
Process”
Russell and Engle, “Econometric analysis of discrete-valued,
irregularly-spaced, financial transactions data”
http://weber.ucsd.edu/~mbacci/en
gle/
11
APPROACH




Model the time to the next price change
as a random duration (ACD Model)
This is a model of volatility (its inverse)
ACD(2,2) with economic predetermined
variables
Key predictors are transactions/time,
volume/transaction, spread
12
PRICE PATH
Time
Price Duration
13
Model 1
Model 2
Parameter

.2107
(6.14)
.3027
(18.22)
1
.0457
(2.60)
.0507
(2.24)
2
.1731
(5.94)
.1578
(5.19)
1
.0769
(1.00)
.1646
(1.61)
2
.5609
(8.07)
.4600
(5.16)
-.0440
(-12.65)
-.0359
(-13.40)
#Trans/Sec
Spread
Volume/Trans
-.0782
(-15.68)
-.0041
(-4.58)
14
EMPIRICAL EVIDENCE





Engle, Robert and Jeff Russell,(1998) “Autoregressive
Conditional Duration: A New Model for Irregularly Spaced Data,
Econometrica
Engle, Robert,(2000), “The Econometrics of Ultra-High
Frequency Data”, Econometrica
Dufour and Engle(2000), “Time and the Price Impact of a
Trade”, Journal of Finance, forthcoming
Engle and Lunde, “Trades and Quotes - A Bivariate Point
Process”
Russell and Engle, “Econometric analysis of discrete-valued,
irregularly-spaced, financial transactions data”
http://weber.ucsd.edu/~mbacci/en
gle/
15
MODELING VOLATILITY WITH
TRANSACTION DATA



Model the change in midquote from one
transaction to the next
Build GARCH model of volatility per unit
of calendar time
Find that short durations and wide
spreads predict higher volatilities in the
future
16
GARCH(1,1)
VARIABLE
Coef
Std.Err Z-Stat
GARCH&ECON
Coef
Std.Err Z-Stat
MEAN
DURS
-0.008
AR(1)
0.279
MA(1)
-0.656
0.004 -1.892 -0.007
0.002 -4.027
0.023
0.022
12.29
0.186
8.507
0.019 -33.86 -0.570
0.016 -35.70
VARIANCE
C
0.988
0.092
10.74 -0.111
0.047 -2.358
ARCH(1)
0.245
0.020
12.33
0.250
0.013
18.73
GARCH(1)
0.622
0.025
24.70
0.158
0.014
11.71
0.587
0.028
21.27
1/DUR
DUR/EXPDUR
-0.040
0.005 -7.992
LONGVOL(-1)
0.096
0.011
8.801
SPREAD(-1)>>
0.736
0.065
11.29
SIZE>10000
0.193
0.119
1.624
1/EXPDUR
LOGLIK
-112246.3
-107406.4
LB(15)
93.092
0.000
40.810
0.000
LB2(15)
30.422
0.004
169.12
0.000
17
EMPIRICAL EVIDENCE





Engle, Robert and Jeff Russell,(1998) “Autoregressive
Conditional Duration: A New Model for Irregularly Spaced Data,
Econometrica
Engle, Robert,(2000), “The Econometrics of Ultra-High
Frequency Data”, Econometrica
Dufour and Engle(2000), “Time and the Price Impact of a
Trade”, Journal of Finance, forthcoming
Engle and Lunde, “Trades and Quotes - A Bivariate Point
Process”
Russell and Engle, “Econometric analysis of discrete-valued,
irregularly-spaced, financial transactions data”
http://weber.ucsd.edu/~mbacci/en
gle/
18
APPROACH



Measure the time between a trade and
a new price quote
Predict this based on economic
variables correcting for censoring by
intervening trades
Find that information variables predict
quicker price revisions
19
EMPIRICAL EVIDENCE





Engle, Robert and Jeff Russell,(1998) “Autoregressive
Conditional Duration: A New Model for Irregularly Spaced Data,
Econometrica
Engle, Robert,(2000), “The Econometrics of Ultra-High
Frequency Data”, Econometrica
Dufour and Engle(2000), “Time and the Price Impact of a
Trade”, Journal of Finance, forthcoming
Engle and Lunde, “Trades and Quotes - A Bivariate Point
Process”
Russell and Engle, “Econometric analysis of discrete-valued,
irregularly-spaced, financial transactions data”
http://weber.ucsd.edu/~mbacci/en
gle/
20
APPROACH



Extend Hasbrouck’s Vector
Autoregressive measurement of price
impact of trades
Measure effect of time between trades
on price impact
Use ACD to model stochastic process of
trade arrivals
21
Cumulative percentage quote revision after an
unexpected buy
0.08
0.06
0.04
0.02
1/17/91
12/24/90
0
1
3
5
7
9
11
13
15
17
19
21
Transaction Time (t)
22
Cumulative percentagequote revisionafter an
unexpected buy
0.08
1/17/91
0.06
0.04
12/24/90
0.02
20:50
18:45
16:40
14:35
12:30
10:25
08:20
06:15
04:10
02:05
0:00
0
Calendar time(min:sec)
23
SUMMARY



The price impacts, the spreads, the
speed of quote revisions, and the
volatility all respond to information
variables
TRANSITION IS FASTER WHEN
THERE IS INFORMATION ARRIVING
Econometric measures of information
– high shares per trade
– short duration between trades
– sustained wide spreads
24
EMPIRICAL EVIDENCE





Engle, Robert and Jeff Russell,(1998) “Autoregressive
Conditional Duration: A New Model for Irregularly Spaced Data,
Econometrica
Engle, Robert,(2000), “The Econometrics of Ultra-High
Frequency Data”, Econometrica
Dufour and Engle(2000), “Time and the Price Impact of a
Trade”, Journal of Finance, forthcoming
Engle and Lunde, “Trades and Quotes - A Bivariate Point
Process”
Russell and Engle, “Econometric analysis of discrete-valued,
irregularly-spaced, financial transactions data”
http://weber.ucsd.edu/~mbacci/en
gle/
25
Jeffrey R. Russell
Robert F. Engle
University of Chicago
University of California, San Diego
Graduate School of Business
http://gsbwww.uchicago.edu/fac/jeffrey.russell/research/
26
IBM
Transaction Price
105.4
105.3
105.2
105.1
105
104.9
104.8
0
2
4
6
8
10
12
14
Time (Minutes)
27
Goal: Develop an econometric model for discrete-valued,
irregularly-spaced time series data.
Method: Propose a class of models for the joint distribution
of the arrival times of the data and the associated price changes.
Questions: Are returns predictable in the short or long run?
How long is the long run? What factors influence this
adjustment rate?
28
Hausman,Lo and MacKinlay



Estimate Ordered Probit Model,JFE(1992)
States are different price processes
Independent variables
– Time between trades
– Bid Ask Spread
– Volume
– SP500 futures returns over 5 minutes
– Buy-Sell indicator
– Lagged dependent variable
29
A Little Notation
Let ti be the arrival time of the ith transaction where
t0<t1<t2…
A sequence of strictly increasing random variables is
called a simple point process.
N(t) denotes the associated counting process.
Let pi denote the price associated with the ith transaction
and let yi=pi-pi-1 denote the price change associated with
the ith transaction.
Since the price changes are discrete we define yi to take
k unique values.
That is yi is a multinomial random variable.
The bivariate process (yi,ti), is called a marked point process.
30
We take the following conditional joint distribution of the
arrival time ti and the mark yi as the general object of interest:

f yi , ti y i 1 ,t i 1

where y i 1   yi 1 , yi  2 ,...  and t i 1  ti 1 , ti  2 ,...
In the spirit of Engle (1996) we decompose the joint distribution
into the product of the conditional and the marginal distribution:

 


f yi , ti y i 1 ,t i 1  g yi y i 1 ,t i  q ti y i 1 ,t i 1

 
 


?
ACD
Engle and Russell (1998)
31
SPECIFYING THE PROBABILITY STRUCTURE
LET ~
xt and ~t be the kx1 vectors indicating the state
observed and the conditional probability of all k states
respectively.
~
th
That is, xt takes the j column of the kxk identity matrix if
th
the j state occurred.
A first order markov chain
~
~
(1)  t  Pxt 1
links these with a transition probability matrix P with the
properties that
a) all elements are non-negative
b) all columns sum to unity
32
WITH COVARIATES

TRANSITION MATRIX P BECOMES

  
Pt  Pij ( z t )  Pr xt  ei xt 1  e j , z t

where ei is the ith column of identity matrix.

TO INSURE THAT THIS IS A TRANSITION
MATRIX FOR ALL POSSIBLE VALUES OF
THE COVARIATES, USE INVERSE
LOGISTIC TRANSFORMATION
33
 k ~ 
 k

log(  i /  k )  log   Pij x j   log   Pkj ~
xj 
 j 1

 j 1






 log Pij / Pkj ~x j
k
j 1

k 1
 Aij x j  bi
j 1
which implies that:
Pij 
exp[ Aij  bi ]
k 1
1   exp[ Alj  bl ]
l 1
34
Rewriting the k-1 log functions as h() this can be written in simple
form as:
(2)
h( )  Ax  b
where A is an unrestricted (k-1)x(k-1) matrix, b is an unrestricted
(k-1)x1 vector and x is a the (k-1)x1 state vector.
35
MORE GENERALLY

Let matrices have time subscripts and
allow other lagged variables:
h( t )  At xt 1  Bt t 1  Ct h( t 1 )  Dt zt

The likelihood is simply a multinomial
for each observation conditional on the
past
L( x; )   xt ' log(  t )
36
Even more generally, we define the Autoregressive Conditional
Multinomial (ACM) model as:
h i    At , j xi  j   i  j    Bt , j xi  j   Ct , j h i  j   GZi
p
q
r
j 1
j 1
j 1
Where h : ( K 1)  ( K 1) is the inverse logistic function.
Zi might contain ti, a constant term, a deterministic function
of time, or perhaps other weakly exogenous variables.
We call this an ACM(p,q,r) model.
37
The data:
58,944 transactions of IBM stock over the 3 months of Nov.
1990 - Jan. 1991 on the consolidated market. (TORQ)
98.6% of the price changes took one of 5 different values.
70
60
P ercent
50
40
30
20
10
0
-1
0
1
P ric e C ha ng e
38
We therefore
consider a 5
state model
defined as
1,0,0,0 if p < -.125
i

0,1,0,0 if - .125  p < 0
i



xi  0,0,0,0 if p i = 0


0,0,1,0 if 0 < p i  .125


0,0,0,1 if p i > .125

It is interesting to consider the sample cross correlogram of
the state vector xi.
39
Sample cross correlations of x
lag = 1






  
   
  

  
3
    
   


   


    
    
   


   


    
7
8
9
10
    
   


   








    
   


   








    
   


   








14
15
up 2
up 1
down 1
down 2
up 2
up 1
down 1
down 2
2
6
    
   


   














  
   
  

  
11
12
13
    
   


   


    
    
   


   


    
    
   


   


    
4






  
   
  

  
    
   


   


    
5
    
   


   


    
    
   


   


    
40
Parameters are estimated using the joint distribution of arrival
times and price changes.

 


f yi , ti y i 1 ,t i 1  g yi y i 1 ,t i  q ti y i 1 ,t i 1

 
 


ACM
ACD
Initially, we consider simple parameterizations in which
the information set for the joint likelihood consists of the
filtration of past arrival times and past price changes.
41
ACM(p,q,r) specification:
h i      A V
p
1/ 2
j i j
j 2
x
i j
  i  j    B j xi  j   C j h i  j  
q
r
j 1
j 1
ln(  i ) g1   i g 2  ( i  i ) g 3   i g 4
Where  i  ti  ti 1 and gj are symmetric.
ACD(s,t) Engle and Russell (1998) specifies the conditional
probability of the ith event arrival at time ti+ by
 
  I i 1   0   where  i  E  | ti 1 , ti  2, ..., xi 1 , xi  2 ,...
 i  i 
1
v
w
 i j t
ln  i       j
   j ln  i  j     j xi  j    j i2 j
j 1
j 1
j 1
 i  j j 1
s
42
Conditional Variance of Price Changes as a Function of Expected Duration
0.008
0.007
0.006
0.004
0.003
0.002
0.001
4.
9
4.
6
4.
3
4
3.
7
3.
4
3.
1
2.
8
2.
5
2.
2
1.
9
1.
6
1.
3
1
0.
7
0.
4
0
0.
1
Volatility
0.005
Expected Duration
43
Simulations
We perform simulations with spreads, volume, and transaction
rates all set to their median value and examine the long run
price impact of two consecutive trades that push the price
down 1 ticks each.
We then perform simulations with spreads, volume and
transaction rates set to their 95 percentile values, one at a
time, for the initial two trades and then reset them to their
median values for the remainder of the simulation.
44
Price impact of 2 consecutive trades each pushing the price
down by 1 tick.
0
1
4
7
10
13
16
19
22
25
28
31
34
37
40
43
46
49
-0.05
Dollars
-0.1
-0.15
-0.2
-0.25
-0.3
Transaction
Median
High Transaction Rate
Large Volume
Wide Spread
45
0
1
4
7
10
13
16
19
22
25
28
31
34
37
40
43
46
49
-0.01
Dollars
-0.02
-0.03
-0.04
-0.05
-0.06
Transaction
High Transaction Rate
Large Volume
Wide Spread
46
Conclusions
1. Both the realized and the expected duration impact the
distribution of the price changes for the data studied.
2. Transaction rates tend to be lower when price are falling.
3. Transaction rates tend to be higher when volatility is higher.
4. Simulations suggest that the long run price impact of a
trade can be very sensitive to the volume but is less
sensitive to the spread and the transaction rates.
47