R Dataset / Package HistData / Galton
Attachment  Size 

dataset30705.csv  19.56 KB 
Documentation 

On this Picostat.com statistics page, you will find information about the Galton data set which pertains to Galton's data on the heights of parents and their children. The Galton data set is found in the HistData R package. You can load the Galton data set in R by issuing the following command at the console data("Galton"). This will load the data into a variable called Galton. If R says the Galton data set is not found, you can try installing the package by issuing this command install.packages("HistData") and then attempt to reload the data. If you need to download R, you can go to the R project website. You can download a CSV (comma separated values) version of the Galton R data set. The size of this file is about 20,032 bytes. Galton's data on the heights of parents and their childrenDescriptionGalton (1886) presented these data in a table, showing a crosstabulation of 928 adult children born to 205 fathers and mothers, by their height and their midparent's height. He visually smoothed the bivariate frequency distribution and showed that the contours formed concentric and similar ellipses, thus setting the stage for correlation, regression and the bivariate normal distribution. Usagedata(Galton) FormatA data frame with 928 observations on the following 2 variables.
DetailsThe data are recorded in class intervals of width 1.0 in. He used noninteger values for the center of each class interval because of the strong bias toward integral inches. All of the heights of female children were multiplied by 1.08 before tablulation to compensate for sex differences. See Hanley (2004) for a reanalysis of Galton's raw data questioning whether this was appropriate. SourceGalton, F. (1886). Regression Towards Mediocrity in Hereditary Stature Journal of the Anthropological Institute, 15, 246263 ReferencesFriendly, M. & Denis, D. (2005). The early origins and development of the scatterplot. Journal of the History of the Behavioral Sciences, 41, 103130. Galton, F. (1869). Hereditary Genius: An Inquiry into its Laws and Consequences. London: Macmillan. Hanley, J. A. (2004). "Transmuting" Women into Men: Galton's Family Data on Human Stature. The American Statistician, 58, 237243. See: http://www.medicine.mcgill.ca/epidemiology/hanley/galton/ for source materials. Stigler, S. M. (1986). The History of Statistics: The Measurement of Uncertainty before 1900. Cambridge, MA: Harvard University Press, Table 8.1 Wachsmuth, A. W., Wilkinson L., Dallal G. E. (2003). Galton's bend: A previously undiscovered nonlinearity in Galton's family stature regression data. The American Statistician, 57, 190192. http://www.cs.uic.edu/~wilkinson/Publications/galton.pdf See Also
Examplesdata(Galton)########################################################################### # sunflower plot with regression line and data ellipses and lowess smooth ###########################################################################with(Galton, { sunflowerplot(parent,child, xlim=c(62,74), ylim=c(62,74)) reg < lm(child ~ parent) abline(reg) lines(lowess(parent, child), col="blue", lwd=2) if(require(car)) { dataEllipse(parent,child, xlim=c(62,74), ylim=c(62,74), plot.points=FALSE) } })  Dataset imported from https://www.rproject.org. 
Picostat Manual 

How To Register With a Username
How To Register With Google Single Sign On (SSO)
How To Login With a Username and Password
How To Login With Google Single Sign On (SSO)
How To Import a Dataset
How To Perform Statistical Analysis with Picostat
How To Use Educational Applications with Picostat

Recent Queries For This Dataset 

No queries made on this dataset yet. 
Title  Authored on  Content type 

OpenIntro Statistics Dataset  gpa_study_hours  August 9, 2020  12:25 PM  Dataset 
R Dataset / Package MASS / petrol  March 9, 2018  1:06 PM  Dataset 
R Dataset / Package HistData / Snow.streets  March 9, 2018  1:06 PM  Dataset 
R Dataset / Package datasets / WWWusage  March 9, 2018  1:06 PM  Dataset 
R Dataset / Package DAAG / dewpoint  March 9, 2018  1:06 PM  Dataset 