This function preprocesses article text for further analysis.
preprocess_text(
text_data,
text_column = "abstract",
remove_stopwords = TRUE,
custom_stopwords = NULL,
stem_words = FALSE,
min_word_length = 3,
max_word_length = 50
)A data frame with processed text and extracted terms.
A data frame containing article text data (title, abstract, etc.).
Name of the column containing text to process.
Logical. If TRUE, removes stopwords.
Character vector of additional stopwords to remove.
Logical. If TRUE, applies stemming to words.
Minimum word length to keep.
Maximum word length to keep.