Learn R Programming

predictrace (version 2.0.1)

first_names_race: Surnames and number of people of each race with that first name

Description

A dataset containing over 167 thousands surnames and the number of people of each race with that surname. Citation for this data: Tzioumis, Konstantinos (2018) Demographic aspects of first names, Scientific Data, 5:180025 [dx.doi.org/10.1038/sdata.2018.25].

Usage

first_names_race

Arguments

Format

A data frame with 4,251 rows and 8 variables:

name

Surname

likely_race

The most likely race based on the probability of each race

probability_american_indian

Probability that the surname is American Indian

probability_asian

Probability that the surname is Asian

probability_black

Probability that the surname is Black

probability_hispanic

Probability that the surname is Hispanic

probability_white

Probability that the surname is White

probability_2races

Probability that the surname is two or more races

...