Learn R Programming

vald.extractor (version 0.1.1)

classify_sports: Automated Sports Taxonomy Mapping

Description

Applies regex-based pattern matching to standardize inconsistent sport/team naming conventions into a clean categorical variable. This is the core "value-add" for multi-sport organizations where team names may vary (e.g., "Football", "Soccer", "FSI" all map to "Football").

Usage

classify_sports(
  data,
  group_col = "all_group_names",
  output_col = "sports_clean"
)

Value

Data frame with an additional column containing standardized sports categories.

Arguments

data

Data frame containing athlete metadata.

group_col

Character. Name of the column containing group/team names. Default is "all_group_names".

output_col

Character. Name for the new standardized sports column. Default is "sports_clean".

Details

Classify Sports from Group Names

Examples

Run this code
# \donttest{
if (FALSE) {
  metadata <- standardize_vald_metadata(profiles, groups)
  metadata <- classify_sports(metadata)
  table(metadata$sports_clean)
}
# }

Run the code above in your browser using DataLab