Learn R Programming

minorparties (version 1.0.0)

process_platform_emphasis: Calculate platforms' issue-area emphasis scores

Description

process_platform_emphasis() takes a tibble of platforms, splits each platform into sentences, and calculates issue-area emphasis scores for each sentence and for the platform as a whole using the ManifestoBERTA model. These issue-area emphasis scores, respectively, represent the probability that each sentence is discussing each issue-area and the proportion of the platform that is devoted to each issue-area.

Usage

process_platform_emphasis(tibble, cleaning = TRUE)

Value

Tibble. The input tibble with two additional list columns (if a platform cannot be processed due to a lack of text, the function will return an empty list for that platform):

  • sentence_emphasis_scores: List column. A list per sentence in the platform (in order), containing:

    • sentence: Character. The sentence.

    • scores: Tibble. The sentence's emphasis score on each issue-area, containing:

      • issue: Character column. The issue-area name.

      • score: Numeric column. The sentence's score for that issue-area (summing to 1).

    • overall_emphasis_scores: List column. A tibble with the platform's overall emphasis scores, containing:

      • issue: Character column. The issue-area name.

      • score: Numeric column. The platform's score for that issue-area.

Arguments

tibble

Tibble. One row per platform, containing, at minimum:

  • text: Character column. The full text of each platform.

cleaning

Logical. Whether to apply basic text cleaning before processing each platform. Defaults to TRUE.

Examples

Run this code
if (FALSE) { # interactive()
tibble <- minorparties::sample_data
processed_tibble <- process_platform_emphasis(tibble)
}

Run the code above in your browser using DataLab