Learn R Programming

⚠️There's a newer version (1.8.7) of this package.Take me there.

stringi (version 1.1.6)

Character String Processing Facilities

Description

Allows for fast, correct, consistent, portable, as well as convenient character string/text processing in every locale and any native encoding. Owing to the use of the ICU library, the package provides R users with platform-independent functions known to Java, Perl, Python, PHP, and Ruby programmers. Available features include: pattern searching (e.g., with ICU Java-like regular expressions or the Unicode Collation Algorithm), random string generation, case mapping, string transliteration, concatenation, Unicode normalization, date-time formatting and parsing, etc.

Copy Link

Version

Install

install.packages('stringi')

Monthly Downloads

1,258,679

Version

1.1.6

License

file LICENSE

Maintainer

Marek Gagolewski

Last Published

November 17th, 2017

Functions in stringi (1.1.6)

stri_datetime_create

[DRAFT API] Create a Date-Time Object
stri_datetime_format

[DRAFT API] Date and Time Formatting and Parsing
stri_datetime_fields

[DRAFT API] Get Values for Date and Time Fields
stri_count

Count the Number of Pattern Matches
%s<%

Compare Strings with or without Collation
%s+%

Concatenate Two Character Vectors
stri_datetime_fstr

[DRAFT API] Convert strptime-style Format Strings
stri_compare

Compare Strings with or without Collation
stri_count_boundaries

Count the Number of Text Boundaries
stri_datetime_symbols

[DRAFT API] List Localizable Date-Time Formatting Data
stri_datetime_add

[DRAFT API] Date and Time Arithmetic
stri_enc_detect2

[DRAFT API] Detect Locale-Sensitive Character Encoding
stri_enc_set

Set or Get Default Character Encoding in stringi
stri_duplicated

Determine Duplicated Elements
stri_enc_toascii

Convert To ASCII
stri_datetime_now

[DRAFT API] Get Current Date and Time
stri_enc_info

Query a Character Encoding
stri_enc_isutf16be

Check If a Data Stream Is Possibly in UTF16 or UTF32
stri_detect

Detect a Pattern Match
stri_enc_fromutf32

Convert From UTF-32
stri_enc_isutf8

Check If a Data Stream Is Possibly in UTF-8
stri_enc_isascii

Check If a Data Stream Is Possibly in ASCII
stri_length

Count the Number of Code Points
stri_enc_list

List Known Character Encodings
stri_list2matrix

Convert a List to a Character Matrix
stri_enc_mark

Get Declared Encodings of Each String
stri_enc_detect

[DRAFT API] Detect Character Set and Language
stri_locale_info

Query Given Locale
stri_enc_toutf8

Convert Strings To UTF-8
stri_locale_list

List Available Locales
stri_encode

Convert Strings Between Given Encodings
stri_join

Concatenate Character Vectors
stri_locate_all_boundaries

Locate Specific Text Boundaries
stri_join_list

Concatenate Strings in a List
stri_match_all

Extract Regex Pattern Matches, Together with Capture Groups
stri_escape_unicode

Escape Unicode Code Points
stri_extract_all

Extract Occurrences of a Pattern
stri_pad_both

Pad (Center/Left/Right Align) a String
stri_locale_set

Set or Get Default Locale in stringi
stri_rand_lipsum

A Lorem Ipsum Generator
stri_locate_all

Locate Occurrences of a Pattern
stri_read_lines

[DRAFT API] Read Text Lines from a Text File
stri_info

Query Default Settings for stringi
stri_read_raw

[DRAFT API] Read Whole Text File as Raw
stri_isempty

Determine if a String is of Length Zero
stri_subset

Select Elements that Match a Given Pattern
stri_numbytes

Count the Number of Bytes
stri_timezone_info

[DRAFT API] Query a Given Time Zone
stri_opts_brkiter

Generate a List with BreakIterator Settings
stri_extract_all_boundaries

Extract Text Between Text Boundaries
stri_dup

Duplicate Strings
stri_flatten

Flatten a String
stri_enc_tonative

Convert Strings To Native Encoding
stri_enc_toutf32

Convert Strings To UTF-32
stri_opts_collator

Generate a List with Collator Settings
stri_stats_general

General Statistics for a Character Vector
stri_timezone_list

[DRAFT API] List Available Time Zone Identifiers
stri_opts_fixed

Generate a List with Fixed Pattern Search Engine's Settings
stri_timezone_get

[DRAFT API] Set or Get Default Time Zone in stringi
stri_opts_regex

Generate a List with Regex Matcher Settings
stri_rand_shuffle

Randomly Shuffle Code Points in Each String
stri_rand_strings

Generate Random Strings
stri_order

Ordering Permutation and Sorting
stri_stats_latex

Statistics for a Character Vector Containing LaTeX Commands
stri_startswith

Determine if the Start or End of a String Matches a Pattern
stri_sub

Extract a Substring From or Replace a Substring In a Character Vector
stri_unescape_unicode

Unescape All Escape Sequences
stri_replace_all

Replace Occurrences of a Pattern
stri_unique

Extract Unique Elements
stri_replace_na

Replace Missing Values in a Character Vector
stringi-package

THE String Processing Package
stri_width

Determine the Width of Code Points
stri_reverse

Reverse Each String
stri_wrap

Word Wrap Text to Format Paragraphs
stringi-search-fixed

Locale-Insensitive Fixed Pattern Matching in stringi
stri_split

Split a String By Pattern Matches
stringi-search-regex

Regular Expressions in stringi
stri_split_boundaries

Split a String at Specific Text Boundaries
stri_trans_nfc

Perform or Check For Unicode Normalization
stri_split_lines

Split a String Into Text Lines
stri_trim_both

Trim Characters from the Left and/or Right Side of a String
stri_trans_general

General Text Transforms, Including Transliteration
stringi-locale

Locales and stringi
stringi-encoding

Character Encodings and stringi
stringi-search-coll

Locale-Sensitive Text Searching in stringi
stri_trans_list

List Available Text Transforms and Transliterators
stringi-search

String Searching
stringi-search-boundaries

Text Boundary Analysis in stringi
stri_write_lines

[DRAFT API] Write Text Lines to a Text File
stringi-arguments

Passing Arguments to Functions in stringi
stri_trans_tolower

Transform String with Case Mapping
stri_trans_char

Translate Characters
stringi-search-charclass

Character Classes in stringi