Learn R Programming

jwutil

This is a set of simple utilities for various data manipulation and caching tasks. The goal is to use base functions well, without bringing in any dependencies. Main areas of interest are data frame manipulation, such as converting factors in multiple binary indicator columns, and disk caching of data frames (which is optionally done by date range). There are testing functions which provide testthat extensions to permute arguments to function calls.

install from CRAN

install.packages("jwutil")

install from github

library(devtools)
install_github("jackwasey/jwutil")

Copy Link

Version

Install

install.packages('jwutil')

Monthly Downloads

5

Version

1.2.3

License

GPL-3

Issues

Pull Requests

Stars

Forks

Repository

https://github.com/jackwasey/jwutil

Maintainer

Jack O Wasey

Last Published

May 6th, 2019

Functions in jwutil (1.2.3)

%in%/match equivalent for two environment arguments

drop_rows_with_na

drops rows with NA values in specified fields

add_time_to_date

convert separate lists of dates and times to POSIXlt objects

countNonNaCumulative

running totals of number of non-NA values in consecutive fields

update a set of data frame field names

env_to_vec_flip

return a new environment with names and values swapped

countNotNumeric

count non-numeric elements

fastIntToString

Fast convert integer vector to character vector

fastIntToStringRcpp

Convert integers to strings as quickly as possible

all unique combinations of a vector and all its non-zero subsets

check if a time is valid in 24h clock

Which elements of a character vector are numeric

get_numeric_char_field_names

Find columns which are numeric

opt_binary_brute

selects columns from a data frame using an optimization function

inverse of %in%

Determine whether a value is, or should be, `NA`

minimal basic pre-processing metrics

get items or numerics that would be dropped in a merge

count NA in vector

get_factor_fields

get names of the factor fields in a data frame

Fix NA-like strings to be NA (or other value of choice)

is the object a Date

list all functions in a package

Summarize objects

Return percentage string to given significant figures

show largest objects

List all items in a package

filter data with diagnostics

fillMissingCombs

fill out missing combinations of factors with NA

determine whether a list is nested

get NA field names from data frame

Extract code from knitr vignette and source it

Take clipboard contents, and write sorted character vector back

update_github_pkgs

Update github_install packages

is every row sorted?

mimic the R CMD check test

Make a list using input argument names as names

as_numeric_nowarn

convert factor or vector to numeric without warnings

as_char_no_warn

convert to character vector without warning

platformIsLinux

Are we running on Linux, Mac or Windows?

Build with current Makevars, but with clang scan-build static analysis

Tools for testing and data manipulation not found elsewhere

random_test_dates

generate random Dates or POSIXlt test datetimes

logical_to_binary

Convert logical columns of data frame to 0s and 1s

vec_to_env_true

create environment from vector

trim null or empty values from a list

read_xlsx_linux

read .xlsx file, interpret as CSV, and return a data frame

random_test_numbers

create extreme random numbers

Find minimum R version required for package

return proportion of NA values per field

count numeric elements

proportion of non-descending rows in matrix

Load packages with library, installing any which are missing

recursive remove

trim null or empty values from a list

Proportion of NA values in a vector

dput_expect_equal

dput a testthat test

strip whitespace from ends of each string in given character vector

stringr does this, but here we have a small amount of base R code

return all matches for regular expression

two_cat_to_logical

Take dataframe, and convert any columns with just two categories into logical

Match across columns for multiple lookup values

read file from zip at URL

str_multi_match

return the actual matches from a bracketed regex

Match pairs of strings to get named vector

Zero NA values in a data.frame

merge lists by names

permuteWithRepeats

Generate all permutations of input, reusing values in each result row

Fast Factor Generation

Convert factor into a data.frame of logicals

Print integers with percentage of total rounded to integer

Compile and test package with and without profile guided optimization

numbers_to_long_and_float

convert numbers to long and float types

Convert a number into rounded integer percentage string

strip all whitespace

strip_for_formula

strip a string so that it can be used as a variable name in a formula.

Generate all permutations of input

save_in_data_dir

Save given variable in package data directory

Shuffle a vector

unzip a single file from URL

unzip_to_data_raw

Unzip file to data-raw

Get the first/only argument of the last run command

bad input data for tests

expect_that_combine_all_args

alternative expect_that from testthat which permutes all the inputs to a function which should give the same result where n args >=2 and the function is commutative.

binary_col_names

names of fields which are numeric, binary or combinations thereof

build simple linear formula from variable names

extreme_numbers

extreme numbers

drop_duplicate_fields

Drop fields with duplicate data