stringi (version 1.1.7)

stri_numbytes: Count the Number of Bytes

Description

Counts the number of bytes needed to store each string in computer's memory.

Usage

stri_numbytes(str)

Arguments

str

character vector or an object coercible to

Value

Returns an integer vector of the same length as str.

Details

This is often not the function you would normally use in your string processing activities. See rather stri_length.

For 8-bit encoded strings, this is the same as stri_length. For UTF-8 strings, the returned values may be greater than the number of code points, as UTF-8 is not a fixed-byte encoding: one code point may be encoded by 1-4 bytes (according to the current Unicode standard).

Missing values are handled properly, as opposed to the built-in nchar(str, "bytes") function call.

The strings do not need to be re-encoded to perform this operation.

The returned values does not of course include the trailing NUL bytes, which are used internally to mark the end of string data (in C).

See Also

Other length: stri_isempty, stri_length, stri_width

Examples

Run this code
# NOT RUN {
stri_numbytes(letters)
stri_numbytes(c('abc', '123', '\u0105\u0104'))

# }
# NOT RUN {
# this used to fail on Windows, as there was no native support for 4-bytes
# Unicode characters; see, however, stri_escape_unicode():
stri_numbytes('\U7fffffff') # compare stri_length('\U7fffffff')
# }
# NOT RUN {
# }

Run the code above in your browser using DataCamp Workspace