PPTX

Transcript PPTX

Currency Forecasting using Multiple Kernel
Learning with Financially Motivated Features
Tristan Fletcher, Zakria Hussain and John Shawe- Taylor
Fanghua Lin
Financial Services Analytics
Content
• Motivation
• Empirical Study
• Results
• Conclusion & Contribution
Motivation
A trader can profit from accurate prediction of currency trend:
Three situations (buying price (bid) and selling price (ask) ):
If
𝐵𝑖𝑑
𝑃𝑡+Δ𝑡
> 𝑃𝑡𝐴𝑠𝑘 (> 𝑃𝑡𝐵𝑖𝑑 )
𝑩𝒖𝒚 𝒄𝒖𝒓𝒓𝒆𝒏𝒄𝒚
𝐴𝑠𝑘
𝐵𝑖𝑑
If 𝑃𝑡𝐵𝑖𝑑 > 𝑃𝑡+Δ𝑡
(> 𝑃𝑡+Δ𝑡
)
Sell 𝒄𝒖𝒓𝒓𝒆𝒏𝒄𝒚
𝐵𝑖𝑑
𝐴𝑠𝑘
If 𝑃𝑡+Δ𝑡
< 𝑃𝑡𝐴𝑠𝑘 𝑎𝑛𝑑 𝑃𝑡𝐵𝑖𝑑 < 𝑃𝑡+Δ𝑡
Do nothing
Financially Motivated Features
Price-based features
•
•
•
•
F1 =
F2 =
F3 =
F4 =
EMA𝐿1 , … , EMA𝐿𝑁
MA𝐿1 , … , MA𝐿𝑁 , σ𝑡 𝐿1 , … , σ𝑡 𝐿𝑁
𝑃𝑡 , max𝑡 𝐿1 , … , 𝑚𝑎𝑥𝑡 𝐿𝑁 , min𝑡 𝐿1 , … , min𝑡 𝐿𝑁
⇑𝑡 𝐿1 , … , ⇑𝑡 𝐿𝑁 , ⇓𝑡 𝐿1 , … , ⇓𝑡 𝐿𝑁
Volume-based features
• F5…8 = 𝑉𝑡 ,
𝑉𝑡
,𝑉
𝑉𝑡 1 𝑡
− 𝑉𝑡−1 ,
𝑉𝑡 −𝑉𝑡−1
𝑉𝑡 −𝑉𝑡−1 1
• Experimental Design:
• Κ1:5 = exp(− 𝑥 − 𝑥 , 2 /σ1 2 ), … , exp(− 𝑥 − 𝑥 ,
• Κ 6:10 = (< 𝑥, 𝑥 , > +1)𝑑1 , … , (< 𝑥, 𝑥 , > +1)𝑑5
• Κ11:15 =
2
−1
sin
𝜋
• Κ16 = < 𝒙, 𝒙, >
2𝒙𝑇 1 𝑥 ,
1+2𝑥 𝑇
1𝑥
1+2𝑥 ,𝑇
1
𝑥,
2 /σ 2 )
5
2
, … , 𝜋 sin−1 (
2𝑥 𝑇 5 𝑥 ,
(1+2𝑥 𝑇
5
𝑥)(1+2𝑥 ,𝑇
)
5
𝑥,)
Empirical Study
8*16=128 feature/kernel combinations
F𝑖 Κ𝑗 : the combination of i feature with j kernel
Three SVM are trained on the data:
𝐵𝑖𝑑
SVM 1: 𝑃𝑡+Δ𝑡
> 𝑃𝑡𝐴𝑠𝑘
𝑦𝑡1 = +1, 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒 𝑦𝑡1 = −1
𝐴𝑠𝑘
SVM 2: 𝑃𝑡𝐵𝑖𝑑 > 𝑃𝑡+Δ𝑡
𝑦𝑡2 = +1, 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒 𝑦𝑡2 = −1
𝐵𝑖𝑑
𝐴𝑠𝑘
SVM 3: 𝑃𝑡+Δ𝑡
< 𝑃𝑡𝐴𝑠𝑘 𝑎𝑛𝑑 𝑃𝑡𝐵𝑖𝑑 < 𝑃𝑡+Δ𝑡
𝑦𝑡3 = +1, 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒 𝑦𝑡3 = −1
𝒚𝒕 = 𝑦𝑡1 , 𝑦𝑡2 , 𝑦𝑡3 = ±𝟏, ±𝟏, ±𝟏 ,
𝒚𝒕 is correct, when only one element of 𝒚𝒕 is postive
Empirical Study
Training: 100
Testing:100
Time
100 days shifting
100 days shifting
Training:100
Testing:100
Time
10-fold cross-validation was used to select the three kernels with the highest
predictive accuracy for the dataset, namely F8 Κ16 , F1 Κ1 𝑎𝑛𝑑 F1 Κ 3
Results
Results
Percentage Accuracy of Predictions
Δt: Time Horizon (Prediction)
5
10
20
50
100
200
MKL
94.7
89.9
81.7
67.1
61.1
58.9
F8K16
94.7
89.6
81.3
65.4
51.1
45.0
F1K1
93.0
88.4
79.5
65.5
60.7
28.8
F1K3
92.8
84.6
72.3
61.1
59.9
61.3
Conclusion
The most successful individual kernels are selected by cross-validation are awarded very low
weights by SimpleMKL. This reflects a common feature of trading rules where individual
signals can drastically change their significance in terms of performance when use in
combination. Furthermore, the effective method of combining a set of price and volume
based features in order to correctly forecast the direction of price movements in a manner
similar to a trading rule
Financial Forecasting with Gompertz
Multiple Kernel Learning
Han Qin Dejing Dou Yue Fang
2010 IEEE International Conference on Data Mining
Fanghua Lin
Financial Services Analytics
Content
• Models
• Garch
• Gompertz Function
• Gomperz Multiple Kernel Learning
• Subgradient Descent Algorithm
• Empirical Study
• Conclusion & Contribution
Garch Model
Garch Model:
σ𝑡 2 = 𝛼0 +
𝑞
2
𝛼
R
𝑖=1 𝑖 𝑡−𝑖
+
Return of Stock
𝑝
2
𝛽
σ
𝑖
𝑡−𝑗
𝑗=1
𝑉𝑜𝑙𝑎𝑡𝑖𝑙𝑖𝑡𝑦 𝑜𝑓 𝑆𝑡𝑜𝑐𝑘
i.e.,
Future Volatility = 𝑓(Past Returns, Past Volatilities)
Kernel Function
Gompertz Function
Assigns higher weights to most recent data
Garch Model
Gompertz Function
SVM
Subgradient Descent Algorithm
Gomperz Multiple Kernel Learning
(GMKL)
Difference between LMKL and GMKL
 LMKL :training data and test data have same distributions.
GMKL addresses the non-stationary problem by favoring recent data.
 LMKL :single data source but different kernel functions.
GMKL: different data sources with same kernel function.
 LMKL : discovers which kernel function is better for a certain region of the kernel matrix.
GMKL: assigns the weights to different regions by considering the order of time series data.
Empirical Study
Data
Index
Daily Index Closing Price （eg, General Motors Corporation）
5 major international stock indexes:
Time Period
Goal Comparsion
Dw Jones Industrial Average, S &P 500, FTSE 100, Hengsheng , Nikei 225
Jan 2007 – Dec 2009
Model 1: SVM
Model 2: MKL
Forecasting Accuracy
Model 3: GMKL
Relative Absolute Error (RAE)
Test Many Shifting Periods and Average the Performance
Testing
Training
n day shifting
Time
n day shifting
Training
Testing
Time
Forecasting DJI using DJI and one other index
Forecasting using all 5 indexes
Conclusion & Contribution
Conclusion:
• GMKL performs better than both SVM and MKL.
• GMKL model is more robust than MKL when considering more training data sources
Contribution:
For data mining:
• novel model to integrate multiple financial time series data sources
• Propose a domain specific kernel function to leverage domain knowledge in the mining process
For financial forecasting:
• New method to tackle the international market integration problem
• address the non-stationary of the financial time series data
• Reveal interesting relationships among multiple international stock markets
Start-up: Thought Machine
Thought Machine is building technology to revolutionize the way people do their day to day banking. Using
Machine Learning to analyze transactions, to find patterns and let users better understand and manage
their finances.
CEO: Paul Taylor
• Working Experience: Manager and Technical Lead in Google, Chief Executive Officer in Phonetic Arts,
Visiting Lecture in University of Cambridge
• Education: PhD, Edinburgh University’s Centre for Speech Technology Research

PPTX

Transcript PPTX

Directory