This is a CPOConstructor
to be used to create a
CPO
. It is called like any R function and returns
the created CPO
.
Select columns by type or name. The parameters “type” and “pattern” are additive; if both are given, all column that match either will be returned.
cpoSelectFreeProperties
behaves just as cpoSelect
, with the additional function
that it is treated like a CPO
that removes all data properties from the data.
This disables the internal property check and can be useful when trying to compose CPO
s
that do not have compatible properties.
cpoSelect(type = character(0), index = integer(0), names = character(0),
pattern = NULL, pattern.ignore.case = FALSE, pattern.perl = FALSE,
pattern.fixed = FALSE, invert = FALSE, id, export = "export.default",
affect.type = NULL, affect.index = integer(0),
affect.names = character(0), affect.pattern = NULL,
affect.invert = FALSE, affect.pattern.ignore.case = FALSE,
affect.pattern.perl = FALSE, affect.pattern.fixed = FALSE)cpoSelectFreeProperties(type = character(0), index = integer(0),
names = character(0), pattern = NULL, pattern.ignore.case = FALSE,
pattern.perl = FALSE, pattern.fixed = FALSE, invert = FALSE, id,
export = "export.default", affect.type = NULL,
affect.index = integer(0), affect.names = character(0),
affect.pattern = NULL, affect.invert = FALSE,
affect.pattern.ignore.case = FALSE, affect.pattern.perl = FALSE,
affect.pattern.fixed = FALSE)
[character
]
One or more out of “numeric”, “ordered”, “factor”, “other”.
The type of columns to keep. Default is character(0)
.
[integer
]
Indices of columns to keep. Note that the index counts columns without the target column(s).
This and the next parameter make it possible to re-order columns. While all columns which match either
“type”, “pattern” or “index” remain in the resulting data, the ones
selected by “index” are put at the front in the order specified.
Default is integer(0)
.
[character
]
Names of columns to keep. Matching columns will be kept in order of their names occurring, but after
the columns indicated in “index”.
[character(1)
]
A pattern to match against the column names. Same as in grep
.
Default is NULL
for no matching.
[logical(1)
]
Influences behaviour of “pattern”: Whether to perform case insensitive matching. Same as in grep
.
Default is FALSE
.
[logical(1)
]
Influences behaviour of “pattern”: Should Perl-compatible regexps be used? Same as in grep
.
Default is FALSE
.
[logical(1)
]
Influences behaviour of “pattern”: Whether to use match pattern
as as is. Same as in grep
.
Default is FALSE
.
[logical(1)
]
Invert column selection: Drop the named columns and return the rest, instead of keeping the selected
columns only. Default is FALSE
.
[character(1)
]
id to use as prefix for the CPO's hyperparameters. this
must be used to avoid name clashes when composing two
CPOs of the same type, or with learners or other CPOS
with hyperparameters with clashing names.
[character
]
Either a character vector indicating the parameters to
export as hyperparameters, or one of the special values
“export.all” (export all parameters),
“export.default” (export all parameters that are exported by default),
“export.set” (export all parameters that were set during construction),
“export.default.set” (export the intersection of the “default” and “set” parameters),
“export.unset” (export all parameters that were not set during construction) or
“export.default.unset” (export the intersection of the “default” and “unset” parameters).
Default is “export.default”.
[character
| NULL
]
Type of columns to affect. A subset of “numeric”, “factor”, “ordered”, “other”, or NULL
to not match by column type. Default is NULL
.
[numeric
]
Indices of feature columns to affect. The order of indices given is respected. Target column indices are not counted
(since target columns are always included). Default is integer(0)
.
[character
]
Feature names of feature columns to affect. The order of names given is respected. Default is character(0)
.
[character(1)
| NULL
]
grep
pattern to match feature names by. Default is NULL
(no pattern matching)
[logical(1)
]
Whether to affect all features not matched by other affect.*
parameters.
[logical(1)
]
Ignore case when matching features with affect.pattern
; see grep
. Default is FALSE
.
[logical(1)
]
Use Perl-style regular expressions for affect.pattern
; see grep
. Default is FALSE
.
[logical(1)
]
Use fixed matching instead of regular expressions for affect.pattern
; see grep
. Default is FALSE
.
[CPO
].
This function creates a CPO object, which can be applied to
Task
s, data.frame
s, link{Learner}
s
and other CPO objects using the %>>%
operator.
The parameters of this object can be changed after creation
using the function setHyperPars
. The other
hyper-parameter manipulating functins, getHyperPars
and getParamSet
similarly work as one expects.
If the “id” parameter is given, the hyperparameters will have this id as aprefix; this will, however, not change the parameters of the creator function.
CPO constructor functions are called with optional values of parameters, and additional “special” optional values.
The special optional values are the id
parameter, and the affect.*
parameters. The affect.*
parameters
enable the user to control which subset of a given dataset is affected. If no affect.*
parameters are given, all
data features are affected by default.
Other CPOs: cpoApplyFunRegrTarget
,
cpoApplyFun
, cpoAsNumeric
,
cpoCache
, cpoCbind
,
cpoCollapseFact
,
cpoDropConstants
,
cpoDummyEncode
,
cpoFilterAnova
,
cpoFilterCarscore
,
cpoFilterChiSquared
,
cpoFilterFeatures
,
cpoFilterGainRatio
,
cpoFilterInformationGain
,
cpoFilterKruskal
,
cpoFilterLinearCorrelation
,
cpoFilterMrmr
, cpoFilterOneR
,
cpoFilterPermutationImportance
,
cpoFilterRankCorrelation
,
cpoFilterRelief
,
cpoFilterRfCImportance
,
cpoFilterRfImportance
,
cpoFilterRfSRCImportance
,
cpoFilterRfSRCMinDepth
,
cpoFilterSymmetricalUncertainty
,
cpoFilterUnivariate
,
cpoFilterVariance
,
cpoFixFactors
, cpoIca
,
cpoImpactEncodeClassif
,
cpoImpactEncodeRegr
,
cpoImputeConstant
,
cpoImputeHist
,
cpoImputeLearner
,
cpoImputeMax
, cpoImputeMean
,
cpoImputeMedian
,
cpoImputeMin
, cpoImputeMode
,
cpoImputeNormal
,
cpoImputeUniform
, cpoImpute
,
cpoLogTrafoRegr
, cpoMakeCols
,
cpoMissingIndicators
,
cpoModelMatrix
,
cpoOversample
, cpoPca
,
cpoProbEncode
,
cpoQuantileBinNumerics
,
cpoRegrResiduals
,
cpoResponseFromSE
, cpoSample
,
cpoScaleMaxAbs
,
cpoScaleRange
, cpoScale
,
cpoSmote
, cpoSpatialSign
,
cpoTransformParams
, cpoWrap
,
makeCPOCase
, makeCPOMultiplex