tm_clean

A Meeting Query dataset in the form of a data frame.

data

A character vector accepting either <code>"words"</code> or <code>"ngrams"</code>,
determining type of tokenisation to return.

token

A single-column data frame labelled <code>'word'</code> containing
custom stopwords to remove.

stopwords

This function processes the <code>Subject</code> column in a Meeting Query by applying
tokenisation using<code>tidytext::unnest_tokens()</code>, and removing any stopwords
supplied in a data frame (using the argument <code>stopwords</code>). This is a
sub-function that feeds into <code>tm_freq()</code>, <code>tm_cooc()</code>, and <code>tm_wordcloud()</code>.
The default is to return a data frame with tokenised counts of words or
ngrams.

Opinionated functions that enable easier and faster
analysis of Workplace Analytics data. There are three main types of functions in 'wpa':
(i) Standard functions create a 'ggplot' visual or a summary table based on a specific
Workplace Analytics metric; (2) Report Generation functions generate HTML reports on
a specific analysis area, e.g. Collaboration; (3) Other miscellaneous functions cover
more specific applications (e.g. Subject Line text mining) of Workplace Analytics data.
This package adheres to 'tidyverse' principles and works well with the pipe syntax.
'wpa' is built with the beginner-to-intermediate R users in mind, and is optimised for
simplicity.

Martin Chan

Tools for Analysing and Visualising Workplace Analytics Data

Carlos Morales

tm_clean: Clean subject line text prior to analysis

Description

Usage

Arguments

Value

See Also

Examples