Background on the long-term study of diabetes among the Pima in Arizona. Data-set cleaning. (Note that while about half the records contain physically impossible values, this is routinely used as an example of a data set with no missing values in computer science.) Fitting logistic regression for a binary outcome. Calculations with fitted logistic regression models. Separating associations from between variables from significant predictors. Model-checking.
Posted at March 30, 2011 23:03 | permanent link