calculate_bigram_probabilities

Helper function that calculates joint and marginal probabilities for
bigrams in the input data using dplyr. It processes the data to create
bigrams and computes their probabilities along with individual token
probabilities.

internal

Support package for the textbook "An Introduction to
Quantitative Text Analysis for Linguists: Reproducible Research Using
R" (Francom, 2024) <doi:10.4324/9781003393764>. Includes functions to
acquire, clean, and analyze text data as well as functions to document
and share the results of text analysis. The package is designed to be
used in conjunction with the book, but can also be used as a standalone
package for text analysis.

Jerid Francom

qtkit

Quantitative Text Kit

calculate_bigram_probabilities function

<dl><dt>data</dt>
<dd>A data frame containing the corpus</dd>
<dt>doc_index</dt>
<dd>Column name for document index</dd>
<dt>token_index</dt>
<dd>Column name for token position</dd>
<dt>type</dt>
<dd>Column name for the actual tokens/terms</dd></dl>

Arguments

Calculate Probabilities for Bigrams — calculate_bigram_probabilities

<dl>

<dt>data</dt>
<dd>A data frame containing the corpus</dd>


<dt>doc_index</dt>
<dd>Column name for document index</dd>


<dt>token_index</dt>
<dd>Column name for token position</dd>


<dt>type</dt>
<dd>Column name for the actual tokens/terms</dd>

</dl>

calculate_bigram_probabilities: Calculate Probabilities for Bigrams

Description

Usage

Value

Arguments