This function is a wrapper around iconv()
that allows for converting
character encoding of multiple text files in a corpus folder, preferably
into UTF-8.
change.encoding(corpus.dir = "corpus/", from, to = "utf-8",
keep.original = TRUE, output.dir = NULL)
The function saves reencoded text files.
path to the folder containing the corpus.
original character encoding. See the Details section (below) for some hints on how to get the original encoding.
character encoding to convert into.
shall the original files be stored?
folder for the reencoded files.
Steffen Pielström
Stylo works on UTF-8-enconded texts by default. This function allows you to convert your corpus, if not yet encoded in UTF-8. To check the current encoding of text files in your corpus folder, you can use the function check.encoding()
.
check.encoding