September 30, 2024

Books to Read While the Algae Grow in Your Fur, September 2024

Attention conservation notice: I have no taste, and no qualifications to opine on world history, or even on random matrix theory. Also, most of my reading this month was done at odd hours and/or while chasing after a toddler, so I'm less reliable and more cranky than usual.

Marc Potters and Jean-Philippe Bouchaud, A First Course in Random Matrix Theory: for Physicists, Engineers and Data Scientists, doi:10.1017/9781108768900
I learned of random matrix theory in graduate school; because of my weird path, it was from May's Stability and Complexity in Model Ecosystems, which I read in 1995--1996. (I never studied nuclear physics and so didn't encounter Wigner's ideas about random Hamiltonians.) In the ensuing nearly-thirty-years, I've been more or less aware that it exists as a subject, providing opaquely-named results about the distributions of eigenvectors of matrices randomly sampled from various distributions. It has, however, become clear to me that it's relevant to multiple projects I want to pursue, and since I don't have one student working on all of them, I decided to buckle down and learn some math. Fortunately, nowadays this means downloading a pile of textbooks; this is the first of my pile which I've finished.
The thing I feel most confident in saying about the book, given my confessed newbie-ness, is that Potters and Bouchaud are not kidding about their subtitle. This is very, very much physicists' math, which is to say the kind of thing mathematicians call "heuristic" when they're feeling magnanimous *. I am still OK with this, despite years of using and teaching probability theory at a rather different level of rigor/finickiness, but I can imagine heads exploding if those with the wrong background tried to learn from this book. (To be clear, I think more larval statisticians should learn to do physicists' math, because it is really good heuristically.)
To say just a little about the content, the main tool in here is the "Stieljtes transform", which for an $N\times N$ matrix $\mathbf{A}$ with eigenvalues $\lambda_1, \ldots \lambda_N$ is a complex-valued function of a complex argument $z$, \[ g^{\mathbf{A}}_N(z) = \frac{1}{N}\sum_{i=1}^{N}{\frac{1}{z-\lambda_i}} \] This can actually be seen as a moment-generating function, where the $k^{\mathrm{th}}$ "moment" is the normalized trace of $\mathbf{A^k}$, i.e., $N^{-1} \mathrm{tr}{\mathbf{A}^k}$. (Somewhat unusually for a moment generating function, the dummy variable is $1/z$, not $z$, and one takes the limit of $|z| \rightarrow \infty$ instead of $\rightarrow 0$.)
The hopes are that (i) $g_N$ will converge to a limiting function as $N\rightarrow\infty$, \[ g(z) = \int{\frac{\rho(d\lambda)}{z-\lambda}} \] and (ii) the limiting distribution $\rho$ of eigenvalues can be extracted from $g(z)$. The second hope is actually less problematic mathematically **. Hope (i), the existence of a limiting function, is just assumed here. At a very high level, Potters and Bouchaud's mode of approach is to derive an expression for $g_N(z)$ in terms of $g_{N-1}(z)$, and then invoke the assumption (i), to get a single self-consistent equation for the limiting $g(z)$. There are typically multiple solutions to these equations, but also usually only one that makes sense, so the others are ignored ***.
At this very high level, Potters and Bouchaud derive limiting distributions of eigenvalues, and in some cases eigenvectors, for a lot of distributions of matrices with random entries: symmetric matrices with IID Gaussian entries, Hermitian matrices with complex Gaussian entries, sample covariance matrices, etc. They also develop results for deterministic matrices perturbed by random noise, and a whole alternate set of derivations based on the replica trick from spin glass theory, which I do not feel up to explaining. These are then carefully applied to topics in estimating sample covariance matrices, especially in the high-dimensional limit where the number of variables grows with the number of observations. This in turn feeds in to a final chapter on designing optimal portfolios when covariances have to be estimated by mortals, rather than being revealed by the Oracle.
My main dis-satisfaction with the book is that I left it without any real feeling for why the eigenvalue density of symmetric Gaussian matrices with standard deviation $\sigma$ approaches $\rho(x) = \frac{\sqrt{4\sigma^2 - x^2}}{2\pi \sigma^2}$, but other ensembles have different limiting distributions. (E.g., why is the limiting distribution only supported on $[-2\sigma, 2\sigma]$, rather than having, say, unbounded support with sub-exponential tails?) That is, for all the physicists' tricks used to get solution, I feel a certain lack of "physical insight" into the forms of the solutions. Whether any further study will make me happier on this score, I couldn't say. In the meanwhile, I'm glad I read this, and I feel more prepared to tackle the more mathematically rigorous books in my stack, and even to make some headway on my projects. §
*: As an early example, a key step in deriving a key result (pp. 21--23) is to get the asymptotic expected value of such-and-such a random variable. Using a clever trick for computing the elements of an inverse matrix in terms of sub-matrices, they get a formula for the expected value of the reciprocal of that variable. They then say (eq. 2.33 on p. 22) that this is clearly the reciprocal of the desired limiting expected value, because after all fluctuations must be vanishing. ^
**: We consider $z$ approaching the real axis from below, say $z=x-i\eta$ for small $\eta$. Some algebraic manipulation then makes the imaginary part of $g(x-i\eta)$ look like the convolution of the eigenvalue density $\rho$ with a Cauchy kernel of bandwidth $\eta$. A deconvolution argument then gives $\lim_{\eta \downarrow 0}{\mathrm{Im}(gx-i\eta)} = \pi \rho(x)$. This can be approximated with a finite value of $N$ and $\eta$ (p. 26 discusses the numerical error). ^
***: There is an interesting question about physicists' math here, actually. Sometimes we pick and choose among options that, as sheer mathematics, seem equally good, we "discard unphysical solutions". But sometimes we insist that counter-intuitive or even bizarre possibilities which are licensed by the math have to be taken seriously, physically (not quite "shut up and calculate" in its original intention, but close). I suspect that knowing when to do one rather than the other is part of the art of being a good theoretical physicist... ^
Fernand Braudel, The Perspective of the World, volume 3 of Civilization and Capitalism, 15th--18th Century
This is the concluding volume of Braudel's trilogy, where he tries (as the English title indicates) to give a picture of how the world-as-a-whole worked during this period. It's definitely the volume I find least satisfying. Braudel organizes everything around a notion of "world economies" borrowed from Immanuel Wallerstein (an unfortunate choice of guide), postulating that these are always centered on a single dominating city, and spends a lot of his time tracking the shifts of what he says is the dominating city of the European world economy. But by his own definition of world economy, I don't see how there was more than one during his period, because all his other "world economies (East Asia, India, sub-Saharan Africa, the Americas, etc., etc.) were all tied in to the same economic system as western-and-central Europe. In fact, Braudel goes on at great length about these ties! (At most, Australia and Oceania might have been outside the world economy during this period.) This is also the volume where the, let us say, eccentricity of Braudel's economic thought began to press on me *. It was his discussions of cycles, "the conjuncture" and time-series decomposition which however truly irritated me. Or, rather, it made me want to sit him down and give him a lecture on the Yule-Slutsky effect, because I am quite certain he was smart enough to grasp it **. --- All these remarks are, of course, the height of presumption on my part. §
Previously.
*: After quoting a detailed passage from Ricardo about how both Portugal and England are better off if the former grows wine and the latter grows wheat and they exchange, Braudel spends many pages going over how Portuguese wine-growers came to rely on credit from English merchants. Stipulating that this is all true, and even stipulating that in some sense those English merchants dominated the Portuguese vintners, it does not refute Ricardo! (The cooperative socialist commonwealth will care very much about comparative advantage.) Or, again, Braudel repeatedly talks about how certain cities or countries were dis-advantaged by their high wages, without ever considering that some employers there must have felt those wages were worth paying. Indeed many employers there must have, or those would not have been the prevailing wages. --- In general, I sympathize with wanting to rescue older perspectives, here those of the mercantilists, from the condescension of posterity, but I think Braudel takes that too far, to the detriment of his understanding of his material. ^
**: To be fair, there are some hints in those passages that Braudel might have been happy to accept Slutsky's perspective on the effect. Namely: the appearance of low-frequency cycles is the natural consequence of high-frequency noise (what Braudel would call "events") whose effects just take time to work their way through the economic system. (This reminds me that I need to actually read Barnett's biography of Slutsky one of these years.) ^
Tamim Ansary, The Invention of Yesterday: A 50,000-Year History of Human Culture, Conflict, and Connection
It's not quite true to say that this is an attempt to write Marshall Hodgson's never-completed world history as a volume of pop history. This is not quite true because it is also, and equally, inspired by McNeill and McNeill's The Human Web. The result is extremely engaging, and while I didn't particularly learn from it, I daresay most of the prospective audience will not, in fact, have read as widely in Ansary's sources as I happen to have done. §
Errata: When describing Mesopotamian civilizations, Ansary repeatedly refers to Sumerian as a Semitic language, which is wrong. This is not particularly consequential, and I didn't notice any other errors of fact.
Disclaimer: My grandfather and Ansary's father were friends, so he's a family connection.

Books to Read While the Algae Grow in Your Fur; Writing for Antiquity; The Great Transformation; Mathematics; Enigmas of Chance; The Dismal Science

Posted at September 30, 2024 23:59 | permanent link

Three-Toed Sloth