AbbreviationPipe
class is responsible for detecting
the existing abbreviations in the data field of each Instance
.
Identified abbreviations are stored inside the abbreviation field of
Instance
class. Moreover if needed, is able to perform inline
abbreviations replacement.
This class inherits from GenericPipe
and implements the
pipe
abstract function.
bdpar::GenericPipe
-> AbbreviationPipe
new()
Creates a AbbreviationPipe
object.
AbbreviationPipe$new(
propertyName = "abbreviation",
propertyLanguageName = "language",
alwaysBeforeDeps = list("GuessLanguagePipe"),
notAfterDeps = list(),
replaceAbbreviations = TRUE,
resourcesAbbreviationsPath = NULL
)
propertyName
A character
value. Name of the property
associated with the GenericPipe
.
propertyLanguageName
A character
value. Name of the
language property.
alwaysBeforeDeps
A list
value. The dependencies
alwaysBefore (GenericPipes
that must be executed before
this one).
notAfterDeps
A list
value. The dependencies
notAfter (GenericPipes
that cannot be executed after
this one).
replaceAbbreviations
A logical
value. Indicates if
the abbreviations are replaced or not.
resourcesAbbreviationsPath
A character
value. Path
of resource files (in json format) containing the correspondence between
abbreviations and meaning.
pipe()
Preprocesses the Instance
to obtain/replace
the abbreviations. The abbreviations found in the data are added to the
list of properties of the Instance
.
AbbreviationPipe$pipe(instance)
instance
A Instance
value. The Instance
to preprocess.
The Instance
with the modifications that have
occurred in the pipe.
findAbbreviation()
Checks if the abbreviation is in the data.
AbbreviationPipe$findAbbreviation(data, abbreviation)
data
A character
value. The text where abbreviation
will be searched.
abbreviation
A character
value. Indicates the
abbreviation to find.
A logical
value depending on whether the
abbreviation is in the data.
replaceAbbreviation()
Replaces the abbreviation in the data for the extendedAbbreviation.
AbbreviationPipe$replaceAbbreviation(abbreviation, extendedAbbreviation, data)
abbreviation
A character
value. Indicates the
abbreviation to replace.
extendedAbbreviation
A character
value. Indicates the
string to replace for the abbreviations found.
data
A character
value. The text where abbreviation
will be replaced.
The data with the abbreviations replaced.
getPropertyLanguageName()
Gets the name of property language.
AbbreviationPipe$getPropertyLanguageName()
Value of name of property language.
getResourcesAbbreviationsPath()
Gets the path of abbreviations resources.
AbbreviationPipe$getResourcesAbbreviationsPath()
Value of path of abbreviations resources.
setResourcesAbbreviationsPath()
Sets the path of abbreviations resources.
AbbreviationPipe$setResourcesAbbreviationsPath(path)
path
A character
value. The new value of the path of
abbreviations resources.
clone()
The objects of this class are cloneable with this method.
AbbreviationPipe$clone(deep = FALSE)
deep
Whether to make a deep clone.
AbbreviationPipe
class requires the resource files (in json format)
containing the correspondence between abbreviations and meaning. To this end,
the language of the text indicated in the propertyLanguageName should
be contained in the resource file name (ie. abbrev.xxx.json where xxx is the
value defined in the propertyLanguageName ). The location of the
resources should be defined in the "resources.abbreviations.path"
field of bdpar.Options variable.
bdpar.Options
, ContractionPipe
,
File2Pipe
, FindEmojiPipe
,
FindEmoticonPipe
, FindHashtagPipe
,
FindUrlPipe
, FindUserNamePipe
,
GuessDatePipe
, GuessLanguagePipe
,
Instance
, InterjectionPipe
,
MeasureLengthPipe
, GenericPipe
,
ResourceHandler
, SlangPipe
,
StopWordPipe
, StoreFileExtPipe
,
TargetAssigningPipe
, TeeCSVPipe
,
ToLowerCasePipe