corpus_analytics

Produces a table of corpus analytics including numbers of complete observations at each step, word counts, lexical diversity (e.g., TTR), stopword ratios, etc. Granularity of the summary statistics are guided by the user (e.g., by conversation, by conversation and speaker, collapsed all)

Imports conversation transcripts into R, concatenates them into a single dataframe appending event identifiers, cleans and formats the text, then yokes user-specified psycholinguistic database values to each word.  'ConversationAlign' then computes alignment indices between two interlocutors across each transcript for >40 possible semantic, lexical, and affective dimensions. In addition to alignment, 'ConversationAlign' also produces a table of analytics (e.g., token count, type-token-ratio) in a summary table describing your particular text corpus.

Jamie Reilly

ConversationAlign

Process Text and Compute Linguistic Alignment in Conversation
Transcripts

Virginia Ulichney

Ben Sacks

Sarah Weinstein

Chelsea Helion

Gus Cooney

corpus_analytics function

<dl><dt>dat_prep</dt>
<dd>takes dataframe produced from the df_prep() function</dd></dl>

Arguments

corpus_analytics — corpus_analytics

<dl>

<dt>dat_prep</dt>
<dd>takes dataframe produced from the df_prep() function</dd>

</dl>

corpus_analytics: corpus_analytics

Description

Usage

Value

Arguments