R does not allow non-ASCII characters in preloaded datasets, and linguistic datasets can hardly fit within ASCII. Unicode is, however, allowed in raw data files. They cannot be automatically loaded when soundcorrs
is attached because staged install makes it impossible to use system.file
in this manner, and they cannot be included as a Unicode-escaped output of dput
because Windows does not know how to convert this to its native encoding. This function makes the process of loading as painless as possible.