Learn R Programming

SMPracticals (version 1.3-1)

bard: Shakespeare's Word Type Frequencies

Description

These are the frequencies with which Shakespeare used word types. There are 846 word types which appear more than 100 times in his total works, giving an overall total of 31534 word types.

Usage

data(bard)

Arguments

source

Efron, B. and Thisted, R. (1976) Estimating the number of unseen species: How many words did Shakespeare know? Biometrika, 63, 435--448.

Thisted, R. and Efron, B. (1987 ) Did Shakespeare write a newly-discovered poem? Biometrika, 74, 445--455.

Details

The canon of Shakespeare's accepted works contains 884,647 words, with 31,534 distinct word types. A word type is a distinguishable arrangement of letters, so `king' is different from `kings' and `alehouse' different from both `ale' and `house'.

References

Davison, A. C. (2003) Statistical Models. Cambridge University Press. Page 629.