stringi (version 1.1.7)

stri_enc_isutf16be: Check If a Data Stream Is Possibly in UTF16 or UTF32

Description

These functions detect whether a given byte stream is valid UTF-16LE, UTF-16BE, UTF-32LE, or UTF-32BE.

Usage

stri_enc_isutf16be(str)

stri_enc_isutf16le(str)

stri_enc_isutf32be(str)

stri_enc_isutf32le(str)

Arguments

str

character vector, a raw vector, or a list of raw vectors

Value

Returns a logical vector.

Details

These functions are independent of the way R marks encodings in character strings (see Encoding and stringi-encoding). Anyway, most often, you will provide input data as raw vectors here.

Negative answer means that a string is surely not in valid UTF-16 or UTF-32. Positive result does not mean that we should be absolutely sure.

Also, note that sometimes a data stream may be classified as both valid UTF-16LE and UTF-16BE.

See Also

Other encoding_detection: stri_enc_detect2, stri_enc_detect, stri_enc_isascii, stri_enc_isutf8, stringi-encoding