
Last chance! 50% off unlimited learning
Sale ends in
This function gives general statistics for a character vector,
e.g., obtained by loading a text file with the
readLines
or stri_read_lines
function,
where each text line' is represented by a separate string.
stri_stats_general(str)
character vector to be aggregated
Returns an integer vector with the following named elements:
Lines
- number of lines (number of
non-missing strings in the vector);
LinesNEmpty
- number of lines with at least
one non-WHITE_SPACE
character;
Chars
- total number of Unicode code points detected;
CharsNWhite
- number of Unicode code points
that are not WHITE_SPACE
s;
... (Other stuff that may appear in future releases of stringi).
None of the strings may contain \r
or \n
characters,
otherwise you will get at error.
Below by `white space` we mean the Unicode binary property
WHITE_SPACE
, see stringi-search-charclass
.
The official online manual of stringi at https://stringi.gagolewski.com/
Other stats:
stri_stats_latex()
# NOT RUN {
s <- c('Lorem ipsum dolor sit amet, consectetur adipisicing elit.',
'nibh augue, suscipit a, scelerisque sed, lacinia in, mi.',
'Cras vel lorem. Etiam pellentesque aliquet tellus.',
'')
stri_stats_general(s)
# }
Run the code above in your browser using DataLab