stringi (version 1.2.4)

stri_unescape_unicode: Unescape All Escape Sequences

Description

Unescapes all known escape sequences

Usage

stri_unescape_unicode(str)

Arguments

str

character vector

Value

Returns a character vector. If an escape sequence is ill-formed, result will be NA and a warning will be given.

Details

Uses ICU facilities to unescape Unicode character sequences.

The following ASCII standard escapes are recognized: \a, \b, \t, \n, \v, \?, \e, \f, \r, \", \', \\.

Moreover, the function understands the following ones: \uXXXX (4 hex digits), \UXXXXXXXX (8 hex digits), \xXX (1-2 hex digits), \ooo (1-3 octal digits), \cX (control-X; X is masked with 0x1F). For \xXX and \ooo beware of non-valid UTF8 byte sequences.

Note that some versions of R on Windows cannot handle characters defined with \UXXXXXXXX. We are working on that.

See Also

Other escape: stri_escape_unicode

Examples

Run this code
# NOT RUN {
stri_unescape_unicode("a\\u0105!\\u0032\\n")

# }

Run the code above in your browser using DataLab