2021 06

Books to Read While the Algae Grow in Your Fur, June 2021

Attention conservation notice: I have no taste, and no qualifications to opine on cryptozoology, folklore, economics, or humanistic geography.

Anne Perry, The Cater Street Hangman: Mind candy historical mystery. Enjoyable, but I fail to see why this should have sparked a series of dozens of books over decades.
Benjamin Radford and Joe Nickell, Lake Monster Mysteries: Investigating the World's Most Elusive Creatures: Shorter: There are no lake monsters, just logs, otters, and stories about lake monsters.; Longer: Mostly this is an account of the authors' travels to various lakes which are claimed to have monsters, and the authors' (very tame) adventures debunking the stories, i.e., providing mundane accounts of what could have caused sightings or what's really in particular photographs. They are very fond of invoking logs, tree stumps, and otters. (I am persuaded about the timber and open-minded about the otters.) This is pretty standard fare, of the kind I have enjoyed since I was a boy and my mother would buy me issues of Skeptical Inquirer.; There is also a not-quite-fully-articulated theory of lake monsters hinted at here. If I try to draw this out explicitly, it'd be something like this: lake monsters are a modern myth, originating with Loch Ness in the 1930s, with the idea being that lakes are inhabited by surviving plesiosaurs, or something near enough. (One ancestor of the myth is thus the genre of "lost world" adventure stories.) Pre-modern stories about strange creatures in lakes get invoked by the myth as "evidence", regardless of their content or context; occasionally accounts of pre-modern stories are fabricated as needed. When people who know the myth see strange things on lakes, which is common enough, knowledge of the myth provides an interpretation for an ambiguous experience, and an opportunity for recounting the myth with an additional report attached. (It is enough for these purposes that the people be able to say "I don't know what I saw, but I saw something".) The myth spreads from lake to lake, partly through natural diffusion, and partly through the efforts of local chambers of commerce to drum up tourism.; As I said, the theory of lake monsters in the previous paragraph is me trying to articulate Radford and Nickell's hints by stringing their scattered remarks together with bits of Dan Sperber and Pascal Boyer. The authors themselves repeatedly refer to a work by an actual folklorist (Michel Meuger's 1988 Lake Monster Traditions: A Cross-Cultural Analysis) in ways which make me eager to track down a copy.
Jeff Lemire and Dean Ormston, Black Hammer: Secret Origins
Alex Robinson's Lower Regions
Rick Remender, Eric Nguyen et al., Strange Girl
Kel Symons and Mathew Reynolds, The Mercenary Sea: Comic book mind candy, assorted.
Pierro Sraffa, Production of Commodities by Means of Commodities: Preliude to a Critique of Economic Theory: This is a little book drafted in the 1920s and published in 1960, which became the subject of a huge literature. I have read a lot about it over the years, since it became a touchstone for some strands of heterodox economics, but never actually read it until this month. Having done so I find it very strange, not least because I feel like it could have be shortened still further, and yet clarified, if Sraffa had just used some basic theory for directed graphs and invoked the Frobenius-Perron theorem. (It's possible that the theory about directed graphs didn't exist when he first wrote, and even that the Frobenius-Perron theorem was then too obscure, but by 1960?) I am in fact tempted to re-write it doing just that, but I presume somebody out there in neo-Ricardian / post-Keynesian / post-Marxist land has done so, and I call upon the LazyWeb for a reference.; (Thanks to Z. M. Shalizi for lending me his copy.)
Yi-Fu Tuan, Dominance and Affection: The Making of Pets [JSTOR]: This is a beautifully-written and thought-provoking, perhaps even disturbing, book. It's an examination across history and time of the ways people make others --- plants, animals, and indeed other people --- into playthings, into objects which they can manipulate, and consequently bestow affection upon. I am sure there are people who can read it without coming to look at their own affections in a different light, but I'd prefer not to know them.; This book is part of a loose series that Tuan wrote, looking at what one might call the moral psychology of different aspects of humans' experience of their environments --- Segmented Worlds and Self, Landscapes of Fear, Escapism, Cosmos and Hearth, etc. These are all marked by the same virtues as this book: vast learning worn lightly, smooth-flowing writing, and an acute ethical sensitivity that is never preachy. I recommend them all very highly indeed.; (Thanks to Jan Johnson for the gift of this book.)
Norbert Wiener, The Fourier Integral and Certain of Its Applications: Recommended purely for historical interest. If you already are familiar with Fourier analysis and are curious to see it at any earlier stage in its development, this is interesting work from a pioneer. (And it's full of curious sidelights, such as the fact that Wiener in 1933 doesn't have the word "convolution" in its modern mathematical-English sense, but uses the German Faltung for lack of any translation.) But I don't think there are insights or techniques which aren't fully assimilated into the modern mainstream.
Glenn C. Loury, The Anatomy of Racial Inequality: Re-read for course prep. If it was in print I'd probably make it a required text; as it is I expect to assign passages from chapters 2 ("Racial Stereotypes") and 3 ("Racial Stigma") in the unit on mechanisms that create and perpetuate inequalities.

Books to Read While the Algae Grow in Your Fur; Scientifiction and Fantastica; Mathematics; Pleasures of Detection, Portraits of Crime; Tales of Our Ancestors; The Dismal Science; Commit a Social Science; Philosophy; Psychoceramics

Posted at June 30, 2021 23:59 | permanent link

Course Announcement: "Statistics of Inequality and Discrimination" (36-313)

Attention conservation notice: Advertisement for a course you won't take, at a university you don't attend. Even if the subject is of some tangential interest, why not check back in a few months to see if the teacher has managed to get himself canceled, and/or produced anything worthwhile?

In the fall I will, again, be teaching something new:

36-313, Statistics of Inequality and Discrimination
9 units
Time and place: Tuesdays and Thursdays, 1:25 -- 2:45 pm, location TBA
Description: Many social questions about inequality, injustice and unfairness are, in part, questions about evidence, data, and statistics. This class lays out the statistical methods which let us answer questions like Does this employer discriminate against members of that group?, Is this standardized test biased against that group?, Is this decision-making algorithm biased, and what does that even mean? and Did this policy which was supposed to reduce this inequality actually help? We will also look at inequality within groups, and at different ideas about how to explain inequalities between groups. The class will interweave discussion of concrete social issues with the relevant statistical concepts.
Prerequisites: 36-202 ("Methods for Statistics and Data Science") (and so also 36-200, "Reasoning with Data")

This is a class I've been wanting to teach for some years now, and I'm very happy to finally get the chance to ~~feel my well-intentioned but laughably inadequate efforts crushed beneath massive and justified opprobrium evoked from all sides~~ ~~bore and perplex some undergrads who thought they were going to learn something interesting in stats. class for a change~~ try it out.

Tentative topic schedule

About one week per.

"Recall": Reminders about probability and statistics: populations, distribution within a population, distribution functions, joint and conditional probability; samples and inference from samples. Reminders (?) about social concepts: ascriptive and attained social categories; status, class, race, caste, sex, gender, income, wealth.
Income and wealth inequality: What does the distribution of income and wealth look like within a population? How do we describe population distributions, especially when there is an extreme range of values (a big difference between the rich and poor)? Where does the idea of "the 1%" wealthy elite come from? How has income inequality changed over recent decades?
Statistical tools: measures of central tendency (median, mode, mean), of dispersion, and of skew; the concept of "heavy tails" (the largest values being orders of magnitude larger than typical values); log-normal and power law distributions; fitting distributions to existing data; positive feedback, multiplicative growth and "cumulative advantage" processes.
Income disparities: How does income (and wealth) differ across groups? How do we compare average or typical values? How do we compare entire distributions? How have income inequalities by race and sex changed over recent decades?
Statistical tools: permutation tests for differences in mean (and other measures of the average); two-sample tests for differences in distribution; inverting tests to find the range of differences compatible with the data; the "analysis of variance" method of comparing populations; the "relative distribution" method of comparing populations
Detecting discrimination in hiring: Do employers discriminate in hiring (or schools in admission, etc.)? How can we tell? When are differences in hiring rates evidence for discrimination? How do statistical perspectives on this question line up with legal criteria for "disparate treatment" and "disparate impact"?
Statistical tools: tests for differences in proportions or probabilities; adjusting for applicant characteristics; deciding what to adjust for
Detecting discrimination in policing: Do the police discriminate against members of particular racial groups? When do differences in traffic stops, arrests, or police-caused deaths indicate discrimination? Does profiling or "statistical discrimination" make sense for the police? Can groups be simultaneously be over- and under- policed?
Statistical tools: test for differences in proportions; signal detection theory; adjusting for systematically missing data; self-reinforcing equilibria
Algorithmic bias: Can predictive or decision-making algorithms be biased? What would that even mean? Do algorithms trained on existing data necessarily inherit the biases of the world? What notions of fairness or unbiased can we actually implement for algorithms? What trade-offs are involved in enforcing different notions of fairness? Are "risk-prediction instruments" fair?
Statistical tools: Methods for evaluating the accuracy of predictions; differential error rates across groups; decision trees; optimization and multi-objective optimization.
Standardized tests: Are standardized tests for school admission biased against certain racial groups? What does it mean to measure qualifications, and how would we know whether tests really are measuring qualifications? What does it mean for a measurement to be biased? When do differences across groups indicate biases? (Disparate impact again.) Why correlating outcomes with test scores among admitted students may not make sense. The "compared to what?" question.
Statistical tools: Predictive validity; differential prediction; "conditioning on a collider"
Intelligence tests: Are intelligence tests biased? How do we measure latent attributes? How do we know the latent attributes even exist? What would it mean for there to be such a thing as "general intelligence", that could be measured by tests? What, if anything, do intelligence tests measure? What rising intelligence test results (the Flynn Effect) tell us?
Statistical tools: correlation between test scores; factor models as an explanation of correlations; estimating factor values from tests; measurement invariance; alternatives to factor models
Implicit bias: Do "implicit association tests" measure unconscious biases? Again on measurement, as well as what it would mean for a bias to be "implicit" or "unconscious". What, if anything, do implicit association tests measure?
Statistical tools: Approaches to "construct validity".
Interventions on implicit bias: Can trainings or other interventions reduce implicit bias? How do we investigate the effectiveness of interventions? How do we design a good study an intervention? How do we pool information from multiple studies. Do implicit bias interventions change behavior? Does having a chief diversity officer increase faculty diversity?
Statistical tools: Experimental design: selecting measurements of outcomes, and the importance of randomized studies; meta-analytic methods for combining information.
Explaining, or explaining away, inequality: To what extent can differences in outcomes between groups be explained by differences in their attributes (e.g., explaining differences in incomes by differences in marketable skills)? How should we go about making such adjustments? Is it appropriate to treat discrimination as the "residual" left unexplained? When does adjusting or controlling for a variable contribute to an explanation, and when is it "explaining away" discrimination? What would it mean to control for race, sex or gender?
Statistical tools: Observational causal inference; using regression to "control for" multiple variables at once; using graphical models to represent causal relations between variables; how to use graphical models to decide what should and what should not be controlled for; the causal model implicit in decisions about controls.
Self-organizing inequalities and "structural" or "systematic" inequalities: Models of how inequalities can perpetuate themselves even when nobody is biased. Models of how inequalities can appear even when nobody is biased. The Schelling model of spatial segregation as a "paradigm". How relevant are Schelling-type models to actual, present-day inequalities?
Statistical tools: Agent-based models; models of social learning and game theory.
Statistics and its history: The development of statistics in the 19th and early 20th century was intimately tied to the eugenics movement, which was deeply racist and even more deeply classist, but also often anti-sexist. The last part of the course will cover this history, and explain how many of the intellectual tools we have gone over to document, and perhaps to help combat, inequality and discrimination were invented by people who wanted to use them for quite different purposes. The twin learning objectives for this section are for students to grasp something of this history, and to grasp why the "genetic fallacy", of judging ideas by where they come from (their "genesis") is, indeed, foolish and wrong.
Statistical tools: N/A.

Evaluation

There will be one problem set per week; each of these homeworks will involve some combination of (very basic) statistical theory, (possibly less basic) calculations using the theory we've gone over, and analysis of real data sets using the methods discussed in class. There will also be readings for each class session, and a short-answer quiz after each session will combine questions based on lecture content with questions based on the readings.

There will not be any exams.

My usual policy is to drop a certain number of homeworks, and a certain number of lecture/reading questions, no questions asked. The number of automatic drops isn't something I'll commit to here and now (similarly, I won't make any promises here about the relative weight of homework vs. lecture-related questions).

Textbook, Lecture Notes

There is, unfortunately, no one textbook which covers the material we'll go over at the required level. You will, instead, get very detailed lecture notes after each lecture. There will also be a lot of readings from various books and articles. (I will not agree with every reading I assign.)

Teaching: Statistics of Inequality and Discrimination; Corrupting the Young; Enigmas of Chance; Commit a Social Science

Posted at June 03, 2021 23:59 | permanent link

Three-Toed Sloth

June 30, 2021

Books to Read While the Algae Grow in Your Fur, June 2021

June 03, 2021

Course Announcement: "Statistics of Inequality and Discrimination" (36-313)

Tentative topic schedule

Evaluation

Textbook, Lecture Notes