Generates summary statistics for climate, environmental and health data
air_pollution_descriptive_stats(
data,
env_labels = c(pm25 = "PM2.5 (µg/m³)", tmax = "Max Temperature (°C)", precipitation
= "Precipitation (mm)", humidity = "Humidity (%)", wind_speed = "Wind Speed (m/s)"),
save_outputs = FALSE,
output_dir = NULL,
moving_average_window = 3L,
plot_corr_matrix = FALSE,
correlation_method = "pearson",
plot_dist = FALSE,
plot_na_counts = FALSE,
plot_scatter = FALSE,
plot_box = FALSE,
plot_seasonal = FALSE,
plot_regional = FALSE,
plot_total = FALSE,
detect_outliers = FALSE,
calculate_rate = FALSE
)Invisibly returns the national data with moving averages
Dataframe containing a daily time series of climate, environmental and health data
Named vector. Labels for environmental variables with units.
Logical. Whether to save outputs. Defaults to FALSE.
Character. Directory to save descriptive statistics. Defaults to NULL.
Numeric. Window size for moving average calculations. Defaults to 3 (3-day moving average).
Logical. Whether to plot correlation matrix. Defaults to FALSE.
Character. Correlation method. One of 'pearson', 'spearman', 'kendall'.
Logical. Whether to plot distribution histograms. Defaults to FALSE.
Logical. Whether to plot NA counts. Defaults to FALSE.
Logical. Whether to plot scatter plots. Defaults to FALSE.
Logical. Whether to plot box plots. Defaults to FALSE.
Logical. Whether to plot seasonal trends. Defaults to FALSE.
Logical. Whether to plot regional trends. Defaults to FALSE.
Logical. Whether to plot total health outcomes per year. Defaults to FALSE.
Logical. Whether to detect outliers. Defaults to FALSE.
Logical. Whether to calculate rate per 100k people.. Defaults to FALSE.