quanteda (version 2.1.2)

tokens_tortl: [Experimental] Change direction of words in tokens

Description

This function adds a Unicode direction mark to tokens types for punctuations and symbols to correct how right-to-left languages (e.g. Arabic, Hebrew, Persian, and Urdu) are printed in HTML-based consoles (e.g. R Studio). This is an experimental function subject to future change.

Usage

tokens_tortl(x)

char_tortl(x)

Arguments

x

the input object whose punctuation marks will be modified by the direction mark