The Toronto Word Pool consists of 1080 words in various grammatical classes together with a variety of normative variables.
The TWP contains high frequency nouns, adjectives, and verbs taken
originally from the Thorndike-Lorge (1944) norms.
This word pool has been used in hundreds of studies at Toronto and elsewhere.
data(TWP)A data frame with 1093 observations on the following 12 variables.
itmnoitem number
wordthe word
imageryimagery rating
concretenessconcreteness rating
lettersnumber of letters
frequencyword frequency, from the Kucera-Francis norms
foaa measure of first order approximation to English. In a first-order approximation, the probability of generating any string of letters is based on the frequencies of occurrence of individual letters in the language.
soaa measure of second order approximation to English, bawsed on bigram frequencies.
onrOrthographic neighbor ratio, taken from Landauer and Streeter (1973). It is the ratio of the frequency of the word in Kucera and Francis (1967) count divided by the sum of the frequencies of all its orthographic neighbors.
dictcodedictionary codes, a factor indicating the collection of grammatical classes, 1-5, for a given word form
nounpercent noun usage. Words considered unambiguous based on dictcode
are listed as 0 or 100; other items were rated in a judgment task.
canadiana factor indicating an alternative Canadian spelling of a given word
The last 13 words in the list are alternative Canadian spellings of words
listed earlier, and have duplicate itmno values.
Kucera and Francis, W.N. (1967). Computational Analysis of Present-Day American English. Providence: Brown University Press.
Landauer, T. K., & Streeter, L. A. Structural differences between common and rare words: Failure of equivalent assumptions for theories of word recognition. Journal of Verbal Learning and Verbal Behavior, 1973, 11, 119-131.
# NOT RUN {
data(TWP)
str(TWP)
summary(TWP)
# select low imagery, concreteness and frequency words
R <- list(imagery=c(1,5), concreteness=c(1,4), frequency=c(0,30))
pickList(TWP, R)
# }
Run the code above in your browser using DataLab