Train a full model pipeline including text preprocessing, TF-IDF vectorization, random forest tuning, and training.
A list containing the trained TF-IDF model, vectorizer, random forest model, and test accuracy.
A named list for custom slang replacements (optional).
Maximum number of features for TF-IDF vectorizer (default 10000).
Minimum document frequency for TF-IDF (default 2).
Maximum document frequency for TF-IDF (default 0.8).
Grid of values for `mtry` parameter to tune in random forest (default: c(5, 10, 20)).
Grid of values for `ntree` parameter to tune in random forest (default: c(100, 200, 300)).
Path to the stopwords RDS file (default: "final_stopwords.rds").
Path to save the trained vectorizer (default: "trained_vectorizer.rds").
Path to save the trained TF-IDF model (default: "trained_tfidf_model.rds").
Path to save the trained random forest model (default: "trained_rf_ranger_model.rds").
Path to cache the training data frame (default: "train_df_cached.rds").