Provides a robust and reproducible pipeline for extracting, cleaning, and analyzing athlete performance data generated by 'VALD' 'ForceDecks' systems. The package supports batch-oriented data processing for large datasets, standardized data transformation workflows, and visualization utilities for sports science research and performance monitoring. It is designed to facilitate reproducible analysis across multiple sports with comprehensive documentation and error handling.
The vald.extractor package extends the valdr package by providing a fault-tolerant, production-ready pipeline for extracting, cleaning, and visualizing VALD ForceDecks data across multiple sports. It implements chunked batch processing to prevent timeout errors, OAuth2 authentication for metadata enrichment, and automated sports taxonomy mapping.
**Data Extraction:**
fetch_vald_batch: Chunked trial extraction with fault tolerance
fetch_vald_metadata: OAuth2 authentication for profiles and groups
**Data Cleaning:**
standardize_vald_metadata: Unnest groups and create unified athlete records
classify_sports: Automated sports taxonomy mapping
patch_metadata: Fix missing demographics from external files
**Data Transformation:**
split_by_test: Split by test type with suffix removal
**Analysis & Visualization:**
summary_vald_metrics: Generate summary statistics
plot_vald_trends: Longitudinal trend visualization
plot_vald_compare: Cross-sectional group comparisons
Fault-tolerant chunked extraction prevents API timeout errors
Automated sports classification saves hours of manual categorization
Generic programming with suffix removal enables DRY analysis code
Publication-ready visualizations with professional themes
Maintainer: Praveen D Chougale praveenmaths89@gmail.com
Authors:
Usha Ananthakumar usha@som.iitb.ac.in
Useful links: