Function to create (and/or modify) a c("regarima_spec","TRAMO_SEATS")
class object with the RegARIMA model specification
for the TRAMO-SEATS method. The object can be created from the name (character
) of a predefined 'JDemetra+' model specification,
a previous specification (c("regarima_spec","TRAMO_SEATS")
object) or a TRAMO-SEATS RegARIMA model (c("regarima","TRAMO_SEATS")
).
regarima_spec_tramoseats(
spec = c("TRfull", "TR0", "TR1", "TR2", "TR3", "TR4", "TR5"),
preliminary.check = NA,
estimate.from = NA_character_,
estimate.to = NA_character_,
estimate.first = NA_integer_,
estimate.last = NA_integer_,
estimate.exclFirst = NA_integer_,
estimate.exclLast = NA_integer_,
estimate.tol = NA_integer_,
estimate.eml = NA,
estimate.urfinal = NA_integer_,
transform.function = c(NA, "Auto", "None", "Log"),
transform.fct = NA_integer_,
usrdef.outliersEnabled = NA,
usrdef.outliersType = NA,
usrdef.outliersDate = NA,
usrdef.outliersCoef = NA,
usrdef.varEnabled = NA,
usrdef.var = NA,
usrdef.varType = NA,
usrdef.varCoef = NA,
tradingdays.mauto = c(NA, "Unused", "FTest", "WaldTest"),
tradingdays.pftd = NA_integer_,
tradingdays.option = c(NA, "TradingDays", "WorkingDays", "UserDefined", "None"),
tradingdays.leapyear = NA,
tradingdays.stocktd = NA_integer_,
tradingdays.test = c(NA, "Separate_T", "Joint_F", "None"),
easter.type = c(NA, "Unused", "Standard", "IncludeEaster", "IncludeEasterMonday"),
easter.julian = NA,
easter.duration = NA_integer_,
easter.test = NA,
outlier.enabled = NA,
outlier.from = NA_character_,
outlier.to = NA_character_,
outlier.first = NA_integer_,
outlier.last = NA_integer_,
outlier.exclFirst = NA_integer_,
outlier.exclLast = NA_integer_,
outlier.ao = NA,
outlier.tc = NA,
outlier.ls = NA,
outlier.so = NA,
outlier.usedefcv = NA,
outlier.cv = NA_integer_,
outlier.eml = NA,
outlier.tcrate = NA_integer_,
automdl.enabled = NA,
automdl.acceptdefault = NA,
automdl.cancel = NA_integer_,
automdl.ub1 = NA_integer_,
automdl.ub2 = NA_integer_,
automdl.armalimit = NA_integer_,
automdl.reducecv = NA_integer_,
automdl.ljungboxlimit = NA_integer_,
automdl.compare = NA,
arima.mu = NA,
arima.p = NA_integer_,
arima.d = NA_integer_,
arima.q = NA_integer_,
arima.bp = NA_integer_,
arima.bd = NA_integer_,
arima.bq = NA_integer_,
arima.coefEnabled = NA,
arima.coef = NA,
arima.coefType = NA,
fcst.horizon = NA_integer_
)
A list of class c("regarima_spec","TRAMO_SEATS")
with the following components, each referring to a different part
of the RegARIMA model specification, mirroring the arguments of the function (for details, see the arguments description).
Each lowest-level component (except the span, pre-specified outliers, user-defined variables and pre-specified ARMA coefficients)
is structured within a data frame with columns denoting different variables of the model specification and rows referring to:
first row = the base specification, as provided within the argument spec
;
second row = user modifications as specified by the remaining arguments of the function (e.g.: arima.d
);
and third row = the final model specification, values that will be used in the function regarima
.
The final specification (third row) shall include user modifications (row two) unless they were wrongly specified.
The pre-specified outliers, user-defined variables and pre-specified ARMA coefficients consist of a list
with the Predefined
(base model specification) and Final
values.
a data frame containing Variables referring to: span
- time span to be used for the estimation,
tolerance
- argument estimate.tol
, exact_ml
- argument estimate.eml
, urfinal
- argument esimate.urfinal
.
The final values can be also accessed with the function s_estimate
.
a data frame containing variables referring to: tfunction
- argument transform.function
,
fct
- argument transform.fct
. The final values can be also accessed with the function s_transform
.
a list containing information on the user-defined variables (userdef
), trading.days
effect and easter
effect.
The user-defined part includes: specification
- data frame with the information if pre-specified outliers (outlier
)
and user-defined variables (variables
) are included in the model and if fixed coefficients are used (outlier.coef
and variables.coef
).
The final values can be also accessed with the function s_usrdef
; outliers
- matrixes with the outliers
(Predefined
and Final
). The final outliers can be also accessed with the function s_preOut
;
and variables
- list with the Predefined
and Final
user-defined variables (series
) and its description
(description
) including information on the variable type and values of fixed coefficients.
The final user-defined variables can be also accessed with the function s_preVar
.
The trading.days
data frame variables refer to:
automatic
- argument tradingdays.mauto
,
pftd
- argument tradingdays.pftd
,
option
- argument tradingdays.option
,
leapyear
- argument tradingdays.leapyear
,
stocktd
- argument tradingdays.stocktd
,
test
- argument tradingdays.test
. The final trading.days
values can be also accessed with the function s_td
.
The easter
data frame variables refer to:
type
- argument easter.type
,
julian
- argument easter.julian
,
duration
- argument easter.duration
,
test
- argument easter.test
. The final easter
values can be also accessed with the function s_easter
.
a data frame. Variables referring to:
ao
- argument outlier.ao
,
tc
- argument outlier.tc
,
ls
- argument outlier.ls
,
so
- argument outlier.so
,
usedefcv
- argument outlier.usedefcv
,
cv
- argument outlier.cv
,
eml
- argument outlier.eml
,
tcrate
- argument outlier.tcrate
. The final values can be also accessed with the function s_out
.
a list containing a data frame with the ARIMA settings (specification
) and matrices giving information
on the pre-specified ARMA coefficients (coefficients
). The matrix Predefined
refers to the pre-defined model specification
and matrix Final
, to the final specification. Both matrices contain the values of the ARMA coefficients and the procedure
for its estimation.
In the data frame specification
, the variable enabled
refers to the argument automdl.enabled
and all remaining variables (automdl.acceptdefault, automdl.cancel, automdl.ub1, automdl.ub2, automdl.armalimit,
automdl.reducecv, automdl.ljungboxlimit, automdl.compare, arima.mu, arima.p, arima.d, arima.q, arima.bp, arima.bd,
arima.bq
), to the respective function arguments.
The final values of the specification
can be also accessed with the function s_arima
,
and final pre-specified ARMA coefficients with the function s_arimaCoef
.
a data frame with the forecasting horizon (argument fcst.horizon
).
The final value can be also accessed with the function s_fcst
.
a matrix containing the final time span for the model estimation and outliers' detection.
It contains the same information as the variable span in the data frames estimate and outliers.
The matrix can be also accessed with the function s_span
.
the model specification. It can be the name (character
) of a predefined 'JDemetra+' model specification
(see Details), an object of class c("regarima_spec","TRAMO_SEATS")
or an object of class c("regarima", "TRAMO_SEATS")
.
The default is "TRfull"
.
a logical
to check the quality of the input series and exclude highly problematic series
e.g. the series with a number of identical observations and/or missing values above pre-specified threshold values.
The time span of the series, which is the (sub)period used to estimate the regarima model, is controlled by the following six variables:
estimate.from, estimate.to, estimate.first, estimate.last, estimate.exclFirst
and estimate.exclLast
;
where estimate.from
and estimate.to
have priority over the remaining span control variables,
estimate.last
and estimate.first
have priority over estimate.exclFirst
and estimate.exclLast
,
and estimate.last
has priority over estimate.first
. Default= "All".
a character in format "YYYY-MM-DD" indicating the start of the time span (e.g. "1900-01-01").
It can be combined with the parameter estimate.to
.
a character
in format "YYYY-MM-DD" indicating the end of the time span (e.g. "2020-12-31").
It can be combined with the parameter estimate.from
.
numeric
, the number of periods considered at the beginning of the series.
numeric
, the number of periods considered at the end of the series.
numeric
, the number of periods excluded at the beginning of the series.
It can be combined with the parameter estimate.exclLast
.
numeric
, the number of periods excluded at the end of the series.
It can be combined with the parameter estimate.exclFirst
.
numeric
, the convergence tolerance. The absolute changes in the log-likelihood function are compared to this value
to check for the convergence of the estimation iterations.
logical
, the exact maximum likelihood estimation. If TRUE
, the program performs an exact maximum likelihood estimation.
If FASLE
, the Unconditional Least Squares method is used.
numeric
, the final unit root limit. The threshold value for the final unit root test
for identification of differencing orders. If the magnitude of an AR root for the final model is smaller than this number,
then a unit root is assumed, the order of the AR polynomial is reduced by one and the appropriate order of the differencing
(non-seasonal, seasonal) is increased.
the transformation of the input series: "None"
= no transformation of the series;
"Log"
= takes the log of the series; "Auto"
= the program tests for the log-level specification.
numeric
controlling the bias in the log/level pre-test:
transform.fct
> 1 favours levels, transform.fct
< 1 favours logs.
Considered only when transform.function
is set to "Auto"
.
Control variables for the pre-specified outliers. Said pre-specified outliers are used in the model only when enabled
(usrdef.outliersEnabled=TRUE
) and when the outliers' type (usrdef.outliersType
) and date (usrdef.outliersDate
)
are provided.
logical
. If TRUE
, the program uses the pre-specified outliers.
a vector defining the outliers' type. Possible types are: ("AO"
) = additive,
("LS"
) = level shift, ("TC"
) = transitory change, ("SO"
) = seasonal outlier.
E.g.: usrdef.outliersType= c("AO","AO","LS")
.
a vector defining the outliers' date. The dates should be characters in format "YYYY-MM-DD".
E.g.: usrdef.outliersDate= c("2009-10-01","2005-02-01","2003-04-01")
.
a vector providing fixed coefficients for the outliers. The coefficients can't be fixed if
the parameter transform.function
is set to "Auto"
(i.e. if the series transformation needs to be pre-defined.)
E.g.: usrdef.outliersCoef= c(200,170,20)
.
Control variables for the user-defined variables:
logical
If TRUE
, the program uses the user-defined variables.
a time series (ts
) or a matrix of time series (mts
) containing the user-defined variables.
a vector of character(s) defining the user-defined variables component type.
Possible types are: "Undefined", "Series", "Trend", "Seasonal", "SeasonallyAdjusted", "Irregular", "Calendar"
.
To use the user-defined calendar regressors, the type "Calendar"
must be defined in conjunction with tradingdays.option = "UserDefined"
.
Otherwise, the program will automatically set usrdef.varType = "Undefined"
.
a vector providing fixed coefficients for the user-defined variables. The coefficients can't be fixed if
transform.function
is set to "Auto"
(i.e. if the series transformation needs to be pre-defined).
defines whether the calendar effects should be added to the model manually ("Unused"
) or automatically.
During the automatic selection, the choice of the number of calendar variables can be based on the F-Test ("FTest"
) or the Wald Test ("WaldTest"
);
the model with higher F value is chosen, provided that it is higher than tradingdays.pftd
).
numeric
. The p-value used in the test specified by the automatic parameter (tradingdays.mauto
)
to assess the significance of the pre-tested calendar effects variables and whether they should be included in the RegArima model.
Control variables for the manual selection of calendar effects variables (tradingdays.mauto
is set to "Unused"
):
to choose the trading days regression variables: "TradingDays"
= six day-of-the-week regression variables;
"WorkingDays"
= one working/non-working day contrast variable; "None"
= no correction for trading days and working days effects;
"UserDefined"
= user-defined trading days regressors (regressors must be defined by the usrdef.var
argument
with usrdef.varType
set to "Calendar"
and usrdef.varEnabled = TRUE
).
"None"
must also be chosen for the "day-of-week effects" correction (and tradingdays.stocktd
must be modified accordingly).
logical
. Specifies if the leap-year correction should be included.
If TRUE
, the model includes the leap-year effect.
numeric indicating the day of the month when inventories and other stock are reported (to denote the last day of the month set the variable to 31). Modifications of this variable are taken into account only when tradingdays.option
is set to "None"
.
defines the pre-tests of the trading day effects: "None"
= calendar variables are used in the model without pre-testing;
"Separate_T"
= a t-test is applied to each trading day variable separately and the trading day variables are included in the RegArima model
if at least one t-statistic is greater than 2.6 or if two t-statistics are greater than 2.0 (in absolute terms);
"Joint_F"
= a joint F-test of significance of all the trading day variables. The trading day effect is significant if the F statistic is greater than 0.95.
acharacter
that specifies the presence and the length of the Easter effect:
"Unused"
= the Easter effect is not considered; "Standard"
= influences the period of n
days strictly before Easter Sunday;
"IncludeEaster"
= influences the entire period (n
) up to and including Easter Sunday;
"IncludeEasterMonday"
= influences the entire period (n
) up to and including Easter Monday.
logical
. If TRUE
, the program uses the Julian Easter (expressed in Gregorian calendar).
numeric
indicating the duration of the Easter effect (length in days, between 1 and 15).
logical
. If TRUE
, the program performs a t-test for the significance of the Easter effect.
The Easter effect is considered as significant if the modulus of t-statistic is greater than 1.96.
logical
. If TRUE
, the automatic detection of outliers is enabled in the defined time span.
The time span of the series to be searched for outliers is controlled by the following six variables:
outlier.from, outlier.to, outlier.first, outlier.last, outlier.exclFirst
and outlier.exclLast
;
where outlier.from
and outlier.to
have priority over the remaining span control variables,
outlier.last
and outlier.first
have priority over outlier.exclFirst
and outlier.exclLast
,
and outlier.last
has priority over outlier.first
.
a character in format "YYYY-MM-DD" indicating the start of the time span (e.g. "1900-01-01").
It can be combined with outlier.to
.
a character in format "YYYY-MM-DD" indicating the end of the time span (e.g. "2020-12-31").
It can be combined with outlier.from
.
numeric
specifying the number of periods considered at the beginning of the series.
numeric
specifying the number of periods considered at the end of the series.
numeric
specifying the number of periods excluded at the beginning of the series. It can be combined with outlier.exclLast
.
numeric
specifying the number of periods excluded at the end of the series. It can be combined with outlier.exclFirst
.
logical
. If TRUE
, the automatic detection of additive outliers is enabled (outlier.enabled
must also be set to TRUE
).
logical
. If TRUE
, the automatic detection of transitory changes is enabled (outlier.enabled
must also be set to TRUE
).
logical
. If TRUE
, the automatic detection of level shifts is enabled (outlier.enabled
must also be set to TRUE
).
logical
. If TRUE
, the automatic detection of seasonal outliers is enabled (outlier.enabled
must also be set to TRUE
).
logical
. If TRUE
, the critical value for the outliers' detection procedure is automatically determined
by the number of observations in the outlier detection time span. If FALSE
, the procedure uses the entered critical value (outlier.cv
).
numeric
. The entered critical value for the outliers' detection procedure. The modification of this variable
is only taken in to account when outlier.usedefcv
is set to FALSE
.
logical
for the exact likelihood estimation method. It controls the method applied for a parameter estimation
in the intermediate steps of the automatic detection and correction of outliers. If TRUE
, an exact likelihood estimation method is used.
When FALSE
, the fast Hannan-Rissanen method is used.
numeric
. The rate of decay for the transitory change outlier.
logical
. If TRUE
, the automatic modelling of the ARIMA model is enabled.
If FALSE
, the parameters of the ARIMA model can be specified.
Control variables for the automatic modelling of the ARIMA model (automdl.enabled
is set to TRUE
):
logical
. If TRUE
, the default model (ARIMA(0,1,1)(0,1,1)) may be chosen in the first step
of the automatic model identification. If the Ljung-Box Q statistics for the residuals is acceptable, the default model is accepted
and no further attempt will be made to identify another model.
numeric
, the cancellation limit. If the difference in moduli of an AR and an MA roots (when estimating ARIMA(1,0,1)(1,0,1) models
in the second step of the automatic identification of the differencing orders) is smaller than the cancellation limit, the two roots are assumed equal and canceled out.
numeric
, the first unit root limit. It is the threshold value for the initial unit root test in the automatic differencing procedure.
When one of the roots in the estimation of the ARIMA(2,0,0)(1,0,0) plus mean model, performed in the first step of the automatic model identification procedure,
is larger than first unit root limit in modulus, it is set equal to unity.
numeric
, the second unit root limit. When one of the roots in the estimation of the ARIMA(1,0,1)(1,0,1) plus mean model,
which is performed in the second step of the automatic model identification procedure, is larger than second unit root limit in modulus,
it is checked if there is a common factor in the corresponding AR and MA polynomials of the ARMA model that can be canceled (see automdl.cancel
).
If there is no cancellation, the AR root is set equal to unity (i.e. the differencing order changes).
numeric
, the arma limit. It is the threshold value for t-statistics of ARMA coefficients and the constant term used
for the final test of model parsimony. If the highest order ARMA coefficient has a t-value smaller than this value in magnitude, the order of the model is reduced.
If the constant term has a t-value smaller than the ARMA limit in magnitude, it is removed from the set of regressors.
numeric
, ReduceCV. The percentage by which the outlier critical value will be reduced
when an identified model is found to have a Ljung-Box statistic with an unacceptable confidence coefficient.
The parameter should be between 0 and 1, and will only be active when automatic outlier identification is enabled.
The reduced critical value will be set to (1-ReduceCV)xCV, where CV is the original critical value.
numeric
, the Ljung Box limit, setting the acceptance criterion for the confidence intervals of the Ljung-Box Q statistic.
If the LjungBox Q statistics for the residuals of a final model is greater than Ljung Box limit, then the model is rejected, the outlier critical value is reduced,
and model and outlier identification (if specified) is redone with a reduced value.
logical
. If TRUE
, the program compares the model identified by the automatic procedure to the default model (ARIMA(0,1,1)(0,1,1))
and the model with the best fit is selected. Criteria considered are residual diagnostics, the model structure and the number of outliers.
Control variables for the non-automatic modelling of the ARIMA model (automdl.enabled
is set to FALSE
):
logical
. If TRUE
, the mean is considered as part of the ARIMA model.
numeric
. The order of the non-seasonal autoregressive (AR) polynomial.
numeric
. The regular differencing order.
numeric
. The order of the non-seasonal moving average (MA) polynomial.
numeric
. The order of the seasonal autoregressive (AR) polynomial.
numeric
. The seasonal differencing order.
numeric
. The order of the seasonal moving average (MA) polynomial.
Control variables for the user-defined ARMA coefficients. Such coefficients can be defined for the regular and seasonal autoregressive (AR) polynomials
and moving average (MA) polynomials. The model considers the coefficients only if the procedure for their estimation (arima.coefType
) is provided,
and the number of provided coefficients matches the sum of (regular and seasonal) AR and MA orders (p,q,bp,bq
).
logical
. If TRUE
, the program uses the user-defined ARMA coefficients.
a vector providing the coefficients for the regular and seasonal AR and MA polynomials.
The length of the vector must be equal to the sum of the regular and seasonal AR and MA orders. The coefficients shall be provided in the following order:
regular AR (Phi - p
elements), regular MA (Theta - q
elements), seasonal AR (BPhi - bp
elements)
and seasonal MA (BTheta - bq
elements).
E.g.: arima.coef=c(0.6,0.7)
with arima.p=1, arima.q=0,arima.bp=1
and arima.bq=0
.
avector defining the ARMA coefficients estimation procedure. Possible procedures are:
"Undefined"
= no use of user-defined input (i.e. coefficients are estimated),
"Fixed"
= fixes the coefficients at the value provided by the user,
"Initial"
= the value defined by the user is used as initial condition.
For orders for which the coefficients shall not be defined, the arima.coef
can be set to NA
or 0
or the arima.coefType
can be set to "Undefined"
.
E.g.: arima.coef = c(-0.8,-0.6,NA)
, arima.coefType = c("Fixed","Fixed","Undefined")
.
numeric
, the forecasting horizon. The length of the forecasts generated by the RegARIMA model
in periods (positive values) or years (negative values). By default, the program generates two years forecasts (fcst.horizon
set to -2
).
The available predefined 'JDemetra+' model specifications are described in the table below:
Identifier | | Log/level detection | | Outliers detection | | Calendar effects | | ARIMA | TR0 | | NA | | NA | |
NA | | Airline(+mean) | TR1 | | automatic | | AO/LS/TC | | NA | | Airline(+mean) | TR2 | |
automatic | | AO/LS/TC | | 2 td vars + Easter | | Airline(+mean) | TR3 | | automatic | | AO/LS/TC | | NA | |
automatic | TR4 | | automatic | | AO/LS/TC | | 2 td vars + Easter | | automatic | TR5 | | automatic | |
AO/LS/TC | | 7 td vars + Easter | | automatic | TRfull | | automatic | | AO/LS/TC | | automatic | | automatic |
More information and examples related to 'JDemetra+' features in the online documentation: https://jdemetra-new-documentation.netlify.app/
# \donttest{
myseries <- ipi_c_eu[, "FR"]
myspec1 <- regarima_spec_tramoseats(spec = "TRfull")
myreg1 <- regarima(myseries, spec = myspec1)
# To modify a pre-specified model specification
myspec2 <- regarima_spec_tramoseats(spec = "TRfull",
tradingdays.mauto = "Unused",
tradingdays.option = "WorkingDays",
easter.type = "Standard",
automdl.enabled = FALSE, arima.mu = TRUE)
myreg2 <- regarima(myseries, spec = myspec2)
# To modify the model specification of a "regarima" object
myspec3 <- regarima_spec_tramoseats(myreg1,
tradingdays.mauto = "Unused",
tradingdays.option = "WorkingDays",
easter.type = "Standard", automdl.enabled = FALSE,
arima.mu = TRUE)
myreg3 <- regarima(myseries, myspec3)
# To modify the model specification of a "regarima_spec" object
myspec4 <- regarima_spec_tramoseats(myspec1,
tradingdays.mauto = "Unused",
tradingdays.option = "WorkingDays",
easter.type = "Standard",
automdl.enabled = FALSE, arima.mu = TRUE)
myreg4 <- regarima(myseries, myspec4)
# Pre-specified outliers
myspec1 <- regarima_spec_tramoseats(spec = "TRfull",
usrdef.outliersEnabled = TRUE,
usrdef.outliersType = c("LS", "LS"),
usrdef.outliersDate = c("2008-10-01" ,"2003-01-01"),
usrdef.outliersCoef = c(10, -8), transform.function = "None")
s_preOut(myspec1)
myreg1 <- regarima(myseries, myspec1)
myreg1
s_preOut(myreg1)
# User-defined variables
var1 <- ts(rnorm(length(myseries))*10, start = start(myseries),
frequency = 12)
var2 <- ts(rnorm(length(myseries))*100, start = start(myseries),
frequency = 12)
var <- ts.union(var1, var2)
myspec1 <- regarima_spec_tramoseats(spec = "TRfull",
usrdef.varEnabled = TRUE, usrdef.var = var)
s_preVar(myspec1)
myreg1 <- regarima(myseries,myspec1)
myspec2 <- regarima_spec_tramoseats(spec = "TRfull",
usrdef.varEnabled = TRUE,
usrdef.var = var, usrdef.varCoef = c(17,-1),
transform.function = "None")
myreg2 <- regarima(myseries, myspec2)
# Pre-specified ARMA coefficients
myspec1 <- regarima_spec_tramoseats(spec = "TRfull",
arima.coefEnabled = TRUE, automdl.enabled = FALSE,
arima.p = 2, arima.q = 0, arima.bp = 1, arima.bq = 1,
arima.coef = c(-0.12, -0.12, -0.3, -0.99),
arima.coefType = rep("Fixed", 4))
myreg1 <- regarima(myseries, myspec1)
myreg1
summary(myreg1)
s_arimaCoef(myspec1)
s_arimaCoef(myreg1)
# }
Run the code above in your browser using DataLab