A data frame with 2233 observations on the following 24 variables.
Worda factor with 2284 words.
CelSnumeric vector with log-transformed lemma frequency in the CELEX lexical
database.
Fdifnumeric vector with the logged ratio
of written frequency (CELEX) to spoken frequency (British National Corpus).
Vfnumeric vector with log morphological family size.
Dentnumeric vector with derivational entropy.
Ientnumeric vector with inflectional entropy.
NsySnumeric vector with the log-transformed count of
synonym sets in WordNet in which the word is listed.
NsyCnumeric vector with the log-transformed count of
synonym sets in WordNet in which the word is listed as part of a compound.
Lennumeric vector with length of the word in letters.
Ncounumeric vector with orthographic neighborhood density.
Bigrnumeric vector with mean log bigram frequency.
InBinumeric vector with log frequency of initial diphone.
spelVnumeric vector with type count of orthographic neighbors.
spelNnumeric vector with token count of orthographic neighbors.
phonVnumeric vector with type count of phonological neighbors.
phonNnumeric vector with token count of phonological neighbors.
friendsVnumeric vector with type counts of consistent words.
friendsNnumeric vector with token counts of consistent words.
ffVnumeric vector with type count of forward inconsistent words.
ffNnumeric vector with token count of forward inconsistent words.
fbVnumeric vector with type count of backward inconsistent words.
fbNnumeric vector with token count of backward inconsistent words
ffNonzeroa numeric vector with the count of forward inconsistent words
with nonzero frequency.
NVratioa numeric vector with the logarithmically transformed ratio
of the noun and verb frequencies.