# HistData

Data Sets from the History of Statistics and Data Visualization

The HistData package provides a collection of small data sets that are interesting and important in the history of statistics and data visualization. The goal of the package is to make these available, both for instructional use and for historical research. Some of these present interesting challenges for graphics or analysis in R.

Some of the data sets have examples which reproduce an historical graph or analysis. These are meant mainly as starters for more extensive re-analysis or graphical elaboration.

They are part of a program of research called statistical historiography (Friendly, 2007; Friendly & Denis, 2001; Friendly et-al, 2016) meaning the use of statistical methods to study problems and questions in the history of statistics and graphics. They are also used in a forthcoming book on the history of data visualization (Friendly & Wainer, 2018).

## References

Friendly, M. (2007). A Brief History of Data Visualization. In Chen, C., Hardle, W. & Unwin, A. (eds.)
Handbook of Computational Statistics: Data Visualization, Springer-Verlag, III, Ch. 1, 1-34. preprint

Friendly, M. & Denis, D. (2001). Milestones in the history of thematic cartography, statistical graphics, and data visualization. http://datavis.ca/milestones/

Friendly, M. & Sigal, M. & Harnanansingh, D. (2016). "The Milestones Project: A Database for the History of Data Visualization,"
In Kostelnick, C. & Kimball, M. (ed.), Visible Numbers: The History of Data Visualization, Ashgate Press, Chapter 10.

Friendly, M. & Wainer, H. (2018). The Origin of Graphical Species. Harvard University Press.

## Functions in HistData

 Name Description Cholera William Farr's Data on Cholera in London, 1849 GaltonFamilies Galton's data on the heights of parents and their children, by child Guerry Data from A.-M. Guerry, "Essay on the Moral Statistics of France" Fingerprints Waite's data on Patterns in Fingerprints Galton Galton's data on the heights of parents and their children Michelson Michelson's Determinations of the Velocity of Light Macdonell Macdonell's Data on Height and Finger Length of Criminals, used by Gosset (1908) Snow John Snow's Map and Data on the 1854 London Cholera Outbreak HistData-package Data sets from the History of Statistics and Data Visualization HalleyLifeTable Halley's Life Table Jevons W. Stanley Jevons' data on numerical discrimination Minard Data from Minard's famous graphic map of Napoleon's march on Moscow SnowMap Draw John Snow's Map of Cholera in London OldMaps Latitudes and Longitudes of 39 Points in 11 Old Maps Pyx Trial of the Pyx Yeast Student's (1906) Yeast Cell Counts Quarrels Statistics of Deadly Quarrels Prostitutes Parent-Duchatelet's time-series data on the number of prostitutes in Paris ZeaMays Darwin's Heights of Cross- and Self-fertilized Zea May Pairs PolioTrials Polio Field Trials Data Virginis John F. W. Herschel's Data on the Orbit of the Twin Stars $\gamma$ Virginis Langren van Langren's Data on Longitude Distance between Toledo and Rome Wheat Playfair's Data on Wages and the Price of Wheat Nightingale Florence Nightingale's data on deaths from various causes in the Crimean War PearsonLee Pearson and Lee's data on the heights of parents and children classified by gender EdgeworthDeaths Edgeworth's Data on Death Rates in British Counties Cavendish Cavendish's Determinations of the Density of the Earth ChestSizes Chest measurements of 5738 Scottish Militiamen Bowley Bowley's data on values of British and Irish trade, 1855-1899 Dactyl Edgeworth's counts of dactyls in Virgil's Aeneid DrinksWages Elderton and Pearson's (1910) data on drinking and wages Arbuthnot Arbuthnot's data on male and female birth ratios in London from 1629-1710. CushnyPeebles Cushny-Peebles Data: Soporific Effects of Scopolamine Derivatives Armada La Felicisima Armada No Results!