19
Adaptive short term forecasting How to approach short term forecasting of multiple interdependent time series and reduce forecasting error twice [spoiler] A. Akimenko Moscow’16

Adaptive short term forecasting

Embed Size (px)

Citation preview

Adaptive short term forecasting How to approach short term forecasting of multiple interdependent

time series and reduce forecasting error twice [spoiler]

A. AkimenkoMoscow’16

Contents

First glance on the data

Forecasting algorithm

Data preparation and new features

Modeling

Final results & summary

First glance on the data

• Daily bucket volumes - 14 time series; • Correlated with the neighbour with some lag;

Example of one of the time series

• White squares are weekends/holidays or were excluded as outliers; • The time series have dual seasonality (weekly and monthly) and trend;

The task is to develop an algorithm which will predict Y for the next month for each time series with Mean

Absolute Percentage Error (MAPE) < 3%.

Forecasting algorithm

Auto-regression models and moving average (ARMA, ARIMA, GARCH)

SSA/Gusenitca

Neural networks (RNN)

Adaptive short term forecasting:• Exponential smoothing; • Seasonal and trend decomposition;• Adaptive auto-regression;

Adaptive model selection & composition

...

Auto-regression models and moving average (ARMA, ARIMA, GARCH)

SSA/Gusenitca

Neural networks (RNN)

Adaptive short term forecasting:• Exponential smoothing; • Seasonal and trend decomposition);• Adaptive auto-regression;

Adaptive model selection & composition

...

Data preparation and new features

Normalization

Outliers

Observation period

Lagged features

Calendar features

Weighing

Modeling

Regular Linear Model

Penalized Linear Model

Accuracy measureScaled Errors:

• Mean absolute error (MAE) or mean absolute deviation (MAD)

• Mean squared error (MSE) or mean squared prediction error (MSPE)

• Root mean squared error (RMSE)

• Average of Errors (E)

Percentage Errors:

• Mean absolute percentage error (MAPE) or mean absolute percentage deviation (MAPD)

Scaled Errors:

• Mean absolute scaled error (MASE)

Other Measures:

• Forecast skill (SS)

Accuracy measureScaled Errors:

• Mean absolute error (MAE) or mean absolute deviation (MAD)

• Mean squared error (MSE) or mean squared prediction error (MSPE)

• Root mean squared error (RMSE)

• Average of Errors (E)

Percentage Errors:

• Mean absolute percentage error (MAPE) or mean absolute percentage deviation (MAPD)

Scaled Errors:

• Mean absolute scaled error (MASE)

Other Measures:

• Forecast skill (SS)

Challenger models0.Dimension reduction (Principal Component Analysis - PCA).

1.Ensembles (Random Forest, Gradient Boosted Models - GBM and XGBoost);

2.Regressions (Linear, Stepwise, Ridge and Lasso);

3.Distance based (k-Nearest Neighbor - kNN);

Final results & summary

As a result of testing, weighted penalized regression was chosen as base algorithm with α=0 (ridge regression) and λ=0.005. Observation period was set as 2 years.

• The proposed algorithm allows to build time series forecasting of multiple interdependent time series;

• Reveals any kind of seasonality;

• Deals with missing values/outliers;

• Removes overfitting/multicollinearity via penalization;

• Scalable for new features (both lagged and calendar), period of forecasting and number of interdependent time series.

Thank you for your attention!

Full research is available here: https://alexakimenko.github.io/time_series/2016/09/18/adaptive-short-term-forecasting/