stringi v1.4.3


Monthly downloads



Character String Processing Facilities

Fast, correct, consistent, portable, as well as convenient character string/text processing in every locale and any native encoding. Owing to the use of the 'ICU' (International Components for Unicode) library, the package provides 'R' users with platform-independent functions known to 'Java', 'Perl', 'Python', 'PHP', and 'Ruby' programmers. Available features include: pattern searching (e.g., with 'Java'-like regular expressions or the 'Unicode' collation algorithm), random string generation, case mapping, string transliteration, concatenation, Unicode normalization, date-time formatting and parsing, and many more.

Functions in stringi

Name Description
stri_enc_set Set or Get Default Character Encoding in stringi
stri_count Count the Number of Pattern Matches
stri_duplicated Determine Duplicated Elements
stri_enc_detect Detect Character Set and Language
stri_length Count the Number of Code Points
stri_enc_isutf16be Check If a Data Stream Is Possibly in UTF16 or UTF32
stri_enc_isutf8 Check If a Data Stream Is Possibly in UTF-8
stri_locale_info Query Given Locale
stri_locale_list List Available Locales
stri_opts_fixed Generate a List with Fixed Pattern Search Engine's Settings
stri_list2matrix Convert a List to a Character Matrix
stri_locale_set Set or Get Default Locale in stringi
stri_count_boundaries Count the Number of Text Boundaries
stri_opts_regex Generate a List with Regex Matcher Settings
stri_datetime_add Date and Time Arithmetic
stri_timezone_info Query a Given Time Zone
stri_subset Select Elements that Match a Given Pattern
stri_width Determine the Width of Code Points
stri_datetime_now Get Current Date and Time
stri_locate_all Locate Occurrences of a Pattern
stri_wrap Word Wrap Text to Format Paragraphs
stringi-search String Searching
stri_datetime_symbols List Localizable Date-Time Formatting Data
stri_stats_general General Statistics for a Character Vector
stri_read_raw [DRAFT API] Read Whole Text File as Raw
stri_remove_empty Remove All Empty Strings from a Character Vector
stri_enc_toutf8 Convert Strings To UTF-8
stri_encode Convert Strings Between Given Encodings
stri_extract_all_boundaries Extract Data Between Text Boundaries
stri_flatten Flatten a String
stri_match_all Extract Regex Pattern Matches, Together with Capture Groups
stri_locate_all_boundaries Locate Text Boundaries
stri_split_boundaries Split a String at Text Boundaries
stri_split Split a String By Pattern Matches
stri_stats_latex Statistics for a Character Vector Containing LaTeX Commands
stri_unescape_unicode Un-escape All Escape Sequences
stri_unique Extract Unique Elements
stri_trans_tolower Transform Strings with Case Mapping
stringi-encoding Character Encodings and stringi
stri_trans_char Translate Characters
stringi-locale Locales and stringi
stringi-search-charclass Character Classes in stringi
stringi-search-coll Locale-Sensitive Text Searching in stringi
%s<% Compare Strings with or without Collation
%s+% Concatenate Two Character Vectors
stri_enc_detect2 Detect Locale-Sensitive Character Encoding
stri_datetime_format Date and Time Formatting and Parsing
stri_datetime_fstr Convert strptime-style Format Strings
stri_datetime_create Create a Date-Time Object
stri_enc_fromutf32 Convert From UTF-32
stri_info Query Default Settings for stringi
stri_isempty Determine if a String is of Length Zero
stri_na2empty Replace NAs with Empty Strings
stri_numbytes Count the Number of Bytes
stri_order Ordering Permutation
stri_datetime_fields Get Values for Date and Time Fields
stri_pad_both Pad (Center/Left/Right Align) a String
stri_reverse Reverse Each String
stri_sort Sorting
stri_sub Extract a Substring From or Replace a Substring In a Character Vector
stri_sub_all Extract or Replace Multiple Substrings
stri_trans_nfc Perform or Check For Unicode Normalization
stri_enc_info Query a Character Encoding
stri_detect Detect a Pattern Match
stri_dup Duplicate Strings
stri_trim_both Trim Characters from the Left and/or Right Side of a String
stri_enc_isascii Check If a Data Stream Is Possibly in ASCII
stri_enc_tonative Convert Strings To Native Encoding
stringi-package THE String Processing Package
stri_enc_toutf32 Convert Strings To UTF-32
stringi-search-boundaries Text Boundary Analysis in stringi
stri_escape_unicode Escape Unicode Code Points
stri_enc_list List Known Character Encodings
stri_extract_all Extract Occurrences of a Pattern
stri_rand_lipsum A Lorem Ipsum Generator
stri_rand_shuffle Randomly Shuffle Code Points in Each String
stri_replace_all Replace Occurrences of a Pattern
stri_replace_na Replace Missing Values in a Character Vector
stri_timezone_list List Available Time Zone Identifiers
stri_enc_mark Get Declared Encodings of Each String
stri_join Concatenate Character Vectors
stri_timezone_get Set or Get Default Time Zone in stringi
stri_write_lines [DRAFT API] Write Text Lines to a Text File
stri_join_list Concatenate Strings in a List
stringi-arguments Passing Arguments to Functions in stringi
stri_opts_brkiter Generate a List with BreakIterator Settings
stri_opts_collator Generate a List with Collator Settings
stri_rand_strings Generate Random Strings
stri_read_lines [DRAFT API] Read Text Lines from a Text File
stri_split_lines Split a String Into Text Lines
stri_startswith Determine if the Start or End of a String Matches a Pattern
stri_trans_general General Text Transforms, Including Transliteration
stri_trans_list List Available Text Transforms and Transliterators
stringi-search-fixed Locale-Insensitive Fixed Pattern Matching in stringi
stringi-search-regex Regular Expressions in stringi
stri_compare Compare Strings with or without Collation
stri_enc_toascii Convert To ASCII
No Results!

Last month downloads


Date 2019-03-12
SystemRequirements ICU4C (>= 52, optional)
Type Package
Biarch TRUE
License file LICENSE
RoxygenNote 6.1.1
NeedsCompilation yes
Packaged 2019-03-12 21:12:07 UTC; gagolews
License_is_FOSS yes
Repository CRAN
Date/Publication 2019-03-12 23:20:03 UTC

Include our badge in your README