Site icon R-bloggers

New in forecast 6.0

[This article was first published on Hyndsight » R, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

This week I uploaded a new version of the forecast package to CRAN. As there were a lot of changes, I decided to increase the version number to 6.0.

The changes are all outlined in the ChangeLog file as usual. I will highlight some of the more important changes since v5.0 here.

ETS

One of the most used functions in the package is ets() and it provides a stock forecasting engine for many organizations. The default model selection is now restricted to exclude multiplicative trend models as these often give very poor forecasts due to the extrapolation of exponential trends. Multiplicative trend models can still be fitted if required. I compared the new default settings with the old defaults on the M3 data, and found a considerable difference in forecast accuracy:

MAPE sMAPE MASE
ETS 17.38 13.13 1.43
ETS (old) 18.04 13.36 1.52
AutoARIMA 19.12 13.85 1.47

Here “ETS” denotes the new default approach (without multiplicative trends) and “ETS (old)” is the old default approach including possibly multiplicative trends. For comparison, the results from applying auto.arima to the same data are also shown.

ARIMA

The auto.arima() function is now stricter on near unit-roots. Even if a model can be estimated, it will not be selected if the characteristic AR or MA roots are too close to the unit circle. This prevents occasional numerical instabilities occurring. Previously the roots had to be at least 0.001 away from the unit circle. Now they have to be at least 0.01 from the unit circle.

There is a new allowmean argument in auto.arima which can be used to prevent a mean term being included in a model.

There is a new plot.Arima() function which plots the characteristic roots of an ARIMA model. This is based on a blog post I wrote last year.

It is now possible to easily obtain the fitted model order for use in other functions. The function arimaorder applied to a fitted ARIMA model (such as that returned by auto.arima) will return a numeric vector of the form (p,d,q) for a nonseasonal model and (p,d,q,P,D,Q,m) for a seasonal model. Similarly, as.character applied to the object returned by Arima or auto.arima will give a character string with the fitted model, suitable for use in plotting or reports.

TBATS/BATS

The models returned by tbats and bats were occasionally unstable. This problem has been fixed, again by restricting the roots to be further away from the unit circle.

STL

stlf and forecast.stl combine forecasting with seasonal decomposition. The seasonally adjusted series are forecast, and then the forecasts are re-seasonalized. These functions now have a forecastfunction argument to allow user-specified methods to be used in the forecasting step.

There is a new stlm function and a corresponding forecast.stlm function to allow the model estimation to be separated from the forecasting, thus matching most other forecasting methods in the package. This allows more flexible specification of the model to be used for the seasonally adjusted series.

ACF/PACF

The Acf function replaces the acf function to provide better plots of the autocorrelation function. The horizontal axis now highlights the seasonal lags.

I have added two new functions taperedacf and taperedpacf to implement the estimates and plots proposed in this recent paper.

Seasonality

The fourier() and fourierf() functions produce a matrix of Fourier terms for use in regression models for seasonal time series. These were updated to work with msts objects so that multiple seasonalities can be fitted.

Occasionally, the period of the seasonality may not be known. The findfrequency() function will estimate it. This is based on an earlier version I wrote for this blog post.

Automatic forecasting

The forecast.ts() function takes a time series and returns some forecasts, without the user necessarily knowing what is going on under the hood. It will use ets with default settings (if the data are non-seasonal or the seasonal period is 12 or less) and stlf (if the seasonal period is 13 or more). If the seasonal period is unknown, there is an option (find.frequency=TRUE) to estimate it first.

 

To leave a comment for the author, please follow the link and comment on their blog: Hyndsight » R.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.