Transcript PPTX

Currency Forecasting using Multiple Kernel
Learning with Financially Motivated Features
Tristan Fletcher, Zakria Hussain and John Shawe- Taylor
Fanghua Lin
Financial Services Analytics
Content
β€’ Motivation
β€’ Empirical Study
β€’ Results
β€’ Conclusion & Contribution
Motivation
A trader can profit from accurate prediction of currency trend:
Three situations (buying price (bid) and selling price (ask) ):
If
𝐡𝑖𝑑
𝑃𝑑+Δ𝑑
> π‘ƒπ‘‘π΄π‘ π‘˜ (> 𝑃𝑑𝐡𝑖𝑑 )
π‘©π’–π’š π’„π’–π’“π’“π’†π’π’„π’š
π΄π‘ π‘˜
𝐡𝑖𝑑
If 𝑃𝑑𝐡𝑖𝑑 > 𝑃𝑑+Δ𝑑
(> 𝑃𝑑+Δ𝑑
)
Sell π’„π’–π’“π’“π’†π’π’„π’š
𝐡𝑖𝑑
π΄π‘ π‘˜
If 𝑃𝑑+Δ𝑑
< π‘ƒπ‘‘π΄π‘ π‘˜ π‘Žπ‘›π‘‘ 𝑃𝑑𝐡𝑖𝑑 < 𝑃𝑑+Δ𝑑
Do nothing
Financially Motivated Features
Price-based features
β€’
β€’
β€’
β€’
F1 =
F2 =
F3 =
F4 =
EMA𝐿1 , … , EMA𝐿𝑁
MA𝐿1 , … , MA𝐿𝑁 , σ𝑑 𝐿1 , … , σ𝑑 𝐿𝑁
𝑃𝑑 , max𝑑 𝐿1 , … , π‘šπ‘Žπ‘₯𝑑 𝐿𝑁 , min𝑑 𝐿1 , … , min𝑑 𝐿𝑁
⇑𝑑 𝐿1 , … , ⇑𝑑 𝐿𝑁 , ⇓𝑑 𝐿1 , … , ⇓𝑑 𝐿𝑁
Volume-based features
β€’ F5…8 = 𝑉𝑑 ,
𝑉𝑑
,𝑉
𝑉𝑑 1 𝑑
βˆ’ π‘‰π‘‘βˆ’1 ,
𝑉𝑑 βˆ’π‘‰π‘‘βˆ’1
𝑉𝑑 βˆ’π‘‰π‘‘βˆ’1 1
β€’ Experimental Design:
β€’ Κ1:5 = exp(βˆ’ π‘₯ βˆ’ π‘₯ , 2 /Οƒ1 2 ), … , exp(βˆ’ π‘₯ βˆ’ π‘₯ ,
β€’ Κ 6:10 = (< π‘₯, π‘₯ , > +1)𝑑1 , … , (< π‘₯, π‘₯ , > +1)𝑑5
β€’ Κ11:15 =
2
βˆ’1
sin
πœ‹
β€’ Κ16 = < 𝒙, 𝒙, >
2𝒙𝑇 1 π‘₯ ,
1+2π‘₯ 𝑇
1π‘₯
1+2π‘₯ ,𝑇
1
π‘₯,
2 /Οƒ 2 )
5
2
, … , πœ‹ sinβˆ’1 (
2π‘₯ 𝑇 5 π‘₯ ,
(1+2π‘₯ 𝑇
5
π‘₯)(1+2π‘₯ ,𝑇
)
5
π‘₯,)
Empirical Study
8*16=128 feature/kernel combinations
F𝑖 Ξšπ‘— : the combination of i feature with j kernel
Three SVM are trained on the data:
𝐡𝑖𝑑
SVM 1: 𝑃𝑑+Δ𝑑
> π‘ƒπ‘‘π΄π‘ π‘˜
𝑦𝑑1 = +1, π‘œπ‘‘β„Žπ‘’π‘Ÿπ‘€π‘–π‘ π‘’ 𝑦𝑑1 = βˆ’1
π΄π‘ π‘˜
SVM 2: 𝑃𝑑𝐡𝑖𝑑 > 𝑃𝑑+Δ𝑑
𝑦𝑑2 = +1, π‘œπ‘‘β„Žπ‘’π‘Ÿπ‘€π‘–π‘ π‘’ 𝑦𝑑2 = βˆ’1
𝐡𝑖𝑑
π΄π‘ π‘˜
SVM 3: 𝑃𝑑+Δ𝑑
< π‘ƒπ‘‘π΄π‘ π‘˜ π‘Žπ‘›π‘‘ 𝑃𝑑𝐡𝑖𝑑 < 𝑃𝑑+Δ𝑑
𝑦𝑑3 = +1, π‘œπ‘‘β„Žπ‘’π‘Ÿπ‘€π‘–π‘ π‘’ 𝑦𝑑3 = βˆ’1
π’šπ’• = 𝑦𝑑1 , 𝑦𝑑2 , 𝑦𝑑3 = ±πŸ, ±πŸ, ±πŸ ,
π’šπ’• is correct, when only one element of π’šπ’• is postive
Empirical Study
Training: 100
Testing:100
Time
100 days shifting
100 days shifting
Training:100
Testing:100
Time
10-fold cross-validation was used to select the three kernels with the highest
predictive accuracy for the dataset, namely F8 Κ16 , F1 Κ1 π‘Žπ‘›π‘‘ F1 Κ 3
Results
Results
Percentage Accuracy of Predictions
Ξ”t: Time Horizon (Prediction)
5
10
20
50
100
200
MKL
94.7
89.9
81.7
67.1
61.1
58.9
F8K16
94.7
89.6
81.3
65.4
51.1
45.0
F1K1
93.0
88.4
79.5
65.5
60.7
28.8
F1K3
92.8
84.6
72.3
61.1
59.9
61.3
Conclusion
The most successful individual kernels are selected by cross-validation are awarded very low
weights by SimpleMKL. This reflects a common feature of trading rules where individual
signals can drastically change their significance in terms of performance when use in
combination. Furthermore, the effective method of combining a set of price and volume
based features in order to correctly forecast the direction of price movements in a manner
similar to a trading rule
Financial Forecasting with Gompertz
Multiple Kernel Learning
Han Qin Dejing Dou Yue Fang
2010 IEEE International Conference on Data Mining
Fanghua Lin
Financial Services Analytics
Content
β€’ Models
β€’ Garch
β€’ Gompertz Function
β€’ Gomperz Multiple Kernel Learning
β€’ Subgradient Descent Algorithm
β€’ Empirical Study
β€’ Conclusion & Contribution
Garch Model
Garch Model:
σ𝑑 2 = 𝛼0 +
π‘ž
2
𝛼
R
𝑖=1 𝑖 π‘‘βˆ’π‘–
+
Return of Stock
𝑝
2
𝛽
Οƒ
𝑖
π‘‘βˆ’π‘—
𝑗=1
π‘‰π‘œπ‘™π‘Žπ‘‘π‘–π‘™π‘–π‘‘π‘¦ π‘œπ‘“ π‘†π‘‘π‘œπ‘π‘˜
i.e.,
Future Volatility = 𝑓(Past Returns, Past Volatilities)
Kernel Function
Gompertz Function
Assigns higher weights to most recent data
Garch Model
Gompertz Function
SVM
Subgradient Descent Algorithm
Gomperz Multiple Kernel Learning
(GMKL)
Difference between LMKL and GMKL
ο‚· LMKL :training data and test data have same distributions.
GMKL addresses the non-stationary problem by favoring recent data.
ο‚· LMKL :single data source but different kernel functions.
GMKL: different data sources with same kernel function.
ο‚· LMKL : discovers which kernel function is better for a certain region of the kernel matrix.
GMKL: assigns the weights to different regions by considering the order of time series data.
Empirical Study
Data
Index
Daily Index Closing Price (eg, General Motors CorporationοΌ‰
5 major international stock indexes:
Time Period
Goal Comparsion
Dw Jones Industrial Average, S &P 500, FTSE 100, Hengsheng , Nikei 225
Jan 2007 – Dec 2009
Model 1: SVM
Model 2: MKL
Forecasting Accuracy
Model 3: GMKL
Relative Absolute Error (RAE)
Test Many Shifting Periods and Average the Performance
Testing
Training
n day shifting
Time
n day shifting
Training
Testing
Time
Forecasting DJI using DJI and one other index
Forecasting using all 5 indexes
Conclusion & Contribution
Conclusion:
β€’ GMKL performs better than both SVM and MKL.
β€’ GMKL model is more robust than MKL when considering more training data sources
Contribution:
For data mining:
β€’ novel model to integrate multiple financial time series data sources
β€’ Propose a domain specific kernel function to leverage domain knowledge in the mining process
For financial forecasting:
β€’ New method to tackle the international market integration problem
β€’ address the non-stationary of the financial time series data
β€’ Reveal interesting relationships among multiple international stock markets
Start-up: Thought Machine
Thought Machine is building technology to revolutionize the way people do their day to day banking. Using
Machine Learning to analyze transactions, to find patterns and let users better understand and manage
their finances.
CEO: Paul Taylor
β€’ Working Experience: Manager and Technical Lead in Google, Chief Executive Officer in Phonetic Arts,
Visiting Lecture in University of Cambridge
β€’ Education: PhD, Edinburgh University’s Centre for Speech Technology Research