Notebooks
http://bactra.org/notebooks
Cosma's NotebooksenPower Law Distributions, 1/f Noise, Long-Memory Time Series
http://bactra.org/notebooks/2021/10/26#power-laws
<P>Why do physicists care about power laws so much?
<P>I'm probably not the best person to speak on behalf of our tribal obsessions
(there was a long debate among the faculty at my thesis defense as to whether
"this stuff is really physics"), but I'll do my best. There are two parts
to this: power-law decay of correlations, and power-law size distributions.
The link is tenuous, at best, but they tend to get run together in our heads,
so I'll treat them both here.
<P>The reason we care about power law correlations is that we're conditioned to
think they're a sign of something interesting and complicated happening. The
first step is to convince ourselves that in boring situations, we don't see
power laws. This is fairly easy: there are pretty good and rather generic
arguments which say that systems in thermodynamic equilibrium, i.e. boring
ones, should have correlations which decay exponentially over space and time;
the reciprocals of the decay rates are the correlation length and the
correlation time, and say how big a typical fluctuation should be. This is
roughly first-semester graduate statistical mechanics. (You can find those
arguments in, say, volume one of Landau and Lifshitz's <cite>Statistical
Physics.)</cite>
<P>Second semester graduate stat. mech. is where those arguments break down ---
either for systems which are far from equilibrium
(e.g., <a href="turbulence.html">turbulent flows</a>), or in equilibrium but
very close to a critical point (e.g., the transition from a solid to liquid
phase, or from a non-magnetic phase to a magnetized
one). <a href="phase-transitions.html">Phase transitions</a> have fluctuations
which decay like power laws, and many non-equilibrium systems do too. (Again,
for phase transitions, Landau and Lifshitz has a good discussion.) If you're a
statistical physicist, phase transitions
and <a href="noneq-sm.html">non-equilibrium</a> processes <em>define</em> the
terms "complex" and "interesting" --- especially phase transitions, since we've
spent the last forty years or so developing a very successful theory of
critical phenomena. Accordingly, whenever we see power law correlations, we
assume there must be something complex and interesting going on to produce
them. (If this sounds like the fallacy of affirming the consequent, that's
because it is.) By a kind of transitivity, this makes power laws interesting
in themselves.
<P>Since, as physicists, we're generally more comfortable working in the
frequency domain than the time domain, we often transform the autocorrelation
function into the Fourier spectrum. A power-law decay for the correlations as
a function of time translates into a power-law decay of the spectrum as a
function of frequency, so this is also called "1/f noise".
<P>Similarly for power-law distributions. A simple use of the Einstein
fluctuation formula says that thermodynamic variables will have Gaussian
distributions with the equilibrium value as their mean. (The usual version of
this argument is <em>not</em> very precise.) We're also used to seeing
exponential distributions, as the probabilities of microscopic states. Other
distributions weird us out. Power-law distributions weird us out even more,
because they seem to say there's no typical scale or size for the variable,
whereas the exponential and the Gaussian cases both have natural scale
parameters. There is a connection here with fractals, which also lack typical
scales, but I don't feel up to going into that, and certainly a lot of the
power laws physicists get excited about have no obvious connection to any kind
of (approximate) fractal geometry. And there are lots of power law
distributions in all kinds of data, especially social data --- that's why
they're also called Pareto distributions, after the sociologist.
<P>Physicists have devoted quite a bit of time over the last two decades to
seizing on what look like power-laws in various non-physical sets of data, and
trying to explain them in terms we're familiar with, especially phase
transitions. (Thus "<a href="soc.html">self-organized criticality</a>".) So
badly are we infatuated that there is now a huge, rapidly growing literature
devoted to <a href="tsallis.html">"Tsallis statistics" or "non-extensive
thermodynamics"</a>, which is a recipe for modifying normal statistical
mechanics so that it produces power law distributions; and this, so far as I
can see, is its <em>only</em> good feature. (I will not attempt, here, to
support that sweeping negative verdict on the work of many people who have more
credentials and experience than I do.) This has not been one of our more
successful undertakings, though the basic motivation --- "let's see what we can
do!" --- is one I'm certainly in sympathy with.
<P>There have been two problems with the efforts to explain all power laws
using the things statistical physicists know. One is that (to mangle Kipling)
there turn out to be nine and sixty ways of constructing power laws,
and <em>every single one of them is right</em>, in that it does indeed produce
a power law. Power laws turn out to result from a kind of central limit
theorem for multiplicative growth processes, an observation which apparently
dates back to Herbert Simon, and which has been rediscovered by a number of
physicists (for instance, Sornette). Reed and Hughes have established an even
more deflating explanation (see below). Now, just because these simple
mechanisms exist, doesn't mean they explain any particular case, but
it <em>does</em> mean that you can't legitimately argue "My favorite mechanism
produces a power law; there is a power law here; it is very unlikely there
would be a power law if my mechanism were not at work; therefore, it is
reasonable to believe my mechanism is at work here." (Deborah Mayo would say
that finding a power law does not constitute a <a
href="../reviews/error/">severe test</a> of your hypothesis.) You need to do
"differential diagnosis", by identifying other, <em>non</em>-power-law
consequences of your mechanism, which other possible explanations don't share.
This, we hardly ever do.
<P>Similarly for 1/f noise. Many different kinds of stochastic process, with
no connection to critical phenomena, have power-law correlations.
Econometricians and <a href="time-series.html">time-series analysts</a> have
studied them for quite a while, under the general heading of "long-memory"
processes. You can get them from things as simple as a superposition of
Gaussian autoregressive processes. (We have begun to awaken to this fact,
under the heading of "fractional Brownian motion".)
<P>The other problem with our efforts has been that a lot of the power-laws
we've been trying to explain are not, in fact, power-laws. I should perhaps
explain that statistical physicists are called that, not because we know a lot
of statistics, but because we study the large-scaled, aggregated effects of the
interactions of large numbers of particles, including, specifically, the
effects which show up as fluctuations and noise. In doing this we learn,
basically, nothing about <a href="statistics.html">drawing inferences from
empirical data</a>, beyond what we may remember about curve fitting and
propagation of errors from our undergraduate lab courses. Some of us,
naturally, <em>do</em> know a lot of statistics, and
even <a href="http://www.stat.cmu.edu/~cshalizi/">teach</a> it --- I might
mention Josef Honerkamp's superb <cite>Stochastic Dynamical Systems</cite>.
(Of course, that book is out of print and hardly ever cited...)
<P>If I had, oh, let's say fifty dollars for every time I've seen a slide (or a
preprint) where one of us physicists makes a log-log plot of their data, and
then reports as the exponent of a new power law the slope they got from doing a
least-squares linear fit, I'd at least not grumble. If my colleagues had gone
to statistics textbooks and looked up how to estimate the parameters of a
Pareto distribution, I'd be a happier man. If any of them had actually tested
the hypothesis that they had a power law against alternatives like stretched
exponentials, or especially log-normals, I'd think
the <a href="millenarian.html">millennium</a> was at hand. (If you want to
know how to do these things, please
<a href="http://arxiv.org/abs/0706.1062">read this paper</a>, whose merits are
entirely due to my co-authors.) The situation for 1/f noise is not so dire,
but there have been and still are plenty of abuses, starting with the fact that
simply taking the fast Fourier transform of the autocovariance function
does <em>not</em> give you a reliable estimate of the power
spectrum, <em>particularly</em> in the tails. (On that point, see, for
instance, Honerkamp.)</P>
<P>See also:
<a href="chaos.html">Chaos and Dynamical Systems</a>;
<a href="complex-networks.html">Complex Networks</a>;
<a href="soc.html">Self-Organized Criticality</a>;
<a href="time-series.html">Time Series</a>;
<a href="tsallis.html">Tsallis Statistics</a>
<ul>Recommended, bigger picture:
<li>Michael Mitzenmacher, "A Brief History of Generative Models for
Power Law and Lognormal Distributions", <cite>Internet Mathematics</cite>
<strong>1</strong> (2003): 226--251
[<a
href="http://www.internetmathematics.org/volumes/1/2/pp226_251.pdf">PDF</a>]
<li>M. E. J. Newman, "Power laws, Pareto distributions and Zipf's
law", <a href="http://arxiv.org/abs/cond-mat/0412004">cond-mat/0412004</a> [If
you read one other thing on power laws, read this]
<li>Manfred Schroeder, <cite>Fractals, Chaos, Power Laws: Minutes from
an Infinite Paradise</cite>
</ul>
<ul>Recommended, more technical or more specialized:
<li>Robert J. Adler, Raise E. Feldman and Murad S. Taqqu
(eds.), <cite>A Practical Guide to Heavy Tails</cite> [Presumes that you
already know something about statistics and stochastic processes, so not
suitable for beginners.]
<li>Barry C. Arnold, <cite>Pareto Distributions</cite> [Fine guide
to the statistical literature, as it was in 1983; still valuable, though
many things which were nasty computations then are easy now.]
<li>Ayan Bhattacharya, Bohan Chen, Remco van der Hofstad, Bert Zwart, "Consistency of the PLFit estimator for power-law data", <a href="http://arxiv.org/abs/2002.06870">arxiv:2002.06870</a> [That is, of the Clauset et al. 2009 estimator]
<li>Arijit Chakrabarty, "Effect of truncation on large deviations for heavy-tailed random vectors", <a href="http://arxiv.org/abs/1107.2476">arxiv:1107.2476</a>
<li>Aaron Clauset, Maxwell Young, and Kristian Skrede Gleditsch, "Scale
Invariance in the Severity of
Terrorism", <a href="http://arxiv.org/abs/physics/0606007">physics/0606007</a>
[Surprising, but well-supported]
<li>F. Clementi, T. Di Matteo, M. Gallegati, "The Power-law Tail
Exponent of Income Distributions", <a href="http://dx.doi.org/10.1016/j.physa.2006.04.027"><cite>Physica
A</cite> <strong>370</strong> (2006): 49--53</a>, <a
href="http://arxiv.org/abs/physics/0603061">physics/0603061</a>
[An interesting way to improve
the accuracy of Hill-type (tail-conditional maximum likelihood) estimates of
the scaling parameter. Written with few concessions to those who are neither
statisticians nor econometricians. Not directly suitable for determining
the <em>range</em> of the scaling region. Income distribution is used only as
an example.]
<li>Andrew M. Edwards, Richard A. Phillips, Nicholas W. Watkins, Mervyn
P. Freeman, Eugene J. Murphy, Vsevolod Afanasyev, Sergey V. Buldyrev,
M. G. E. da Luz, E. P. Raposo, H. Eugene Stanley and Gandhimohan
M. Viswanathan, "Revisiting Lévy flight search patterns of wandering
albatrosses, bumblebees and
deer", <a href="http://dx.doi.org/10.1038/nature06199"><cite>Nature</cite>
<strong>449</strong> (2007): 1044--1048</a>
<li>Paul Embrechts and Makoto Maejima, <cite>Selfsimilar
Processes</cite>
<li>Robert P. Freckleton and William J. Sutherland [thanks to Nick Watkins for pointing these out]
<ul>
<li>"Do in-hospital waiting lists show self-regulation?", <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1279500/"><cite>Journal of the Royal Society of Medicine</cite> <strong>95</strong> (2002): 164</a>
<li>"Hospital waiting-lists (Communication arising): Do power laws imply self-regulation", <a href="http://dx.doi.org/10.1038/35096646"><cite>Nature</cite> <strong>413</strong> (2001): 382</a>
</ul>
<li>Alexander Gnedin, Ben Hansen, Jim Pitman, "Notes on the occupancy problem with infinitely many boxes: general asymptotics and power laws",
<cite>Probability Surveys</cite> <strong>4</strong> (2007):
146--171, <a href="http://arxiv.org/abs/math.PR/0701718">arxiv:math.PR/0701718</a>
<li>Michel L. Goldstein, Steven A. Morris and Gary G. Yen, "Fitting to
the Power-Law Distribution", <a
href="http://arxiv.org/abs/cond-mat/0402322">cond-mat/0402322</a> [Pedestrian,
but accurate, exposition in terms physicists and engineers are likely to
understand. Insufficiently sourced to the statistical literature; e.g., their
calculation of the maximum likelihood estimator was first published in 1952.]
<li>Timothy Graves, Robert B. Gramacy, Christian Franzke, Nicholas Watkins, "A brief history of long memory", <a href="http://arxiv.org/abs/1406.6018">arxiv:1406.6018</a>
<li>Vladimir Hlasny, "Parametric representation of the top of income distributions: Options, historical evidence, and model selection",
<a href="https://doi.org/10.1111/joes.12435"><cite>Journal of Economic Surveys</cite> <strong>35</strong> (2021): 1217--1256</a>
<li>Josef Honerkamp, <cite>Stochastic Dynamical Systems: Concepts,
Numerical Methods, Data Analysis</cite>
<li>Yuji Ijiri and Herbert Simon, <cite>Skew Distributions and the
Sizes of Business Firms</cite> [Collects Simon and co.'s pioneering papers on
power laws and related distributions --- including "On a Class of Skew
Distribution Functions", below --- as well as considering the limitations,
alternatives, modifications to match data, statistical issues, the connection
to Bose-Einstein statistics, the importance of going beyond just staring at
distributional plots if you want to learn about mechanisms, etc., etc. This
was all published in 1977...]
<li>A. James and M. J. Plank, "On fitting power laws to ecological
data", <a href="http://arxiv.org/abs/0712.0613">arxiv:0712.0613</a>
<li>Raya Khanin and Ernst Wit, "How Scale-Free Are Biological
Networks?",
<a href="http://dx.doi.org/"><cite>Journal of Computational Biology</cite>
<strong>13</strong> (2006): 810--818</a> [Ans.: not very scale-free at all.]
<li>Joel Keizer, <cite>Statistical Thermodynamics of Nonequilibrium
Processes</cite> [Has a good discussion of critical fluctuations in chapter
8. <a href="../reviews/keizer/">Review: Molecular Fluctuations for Fun and
Profit</a>]
<li>Paul Krugman, <cite>The Self-Organizing Economy</cite> [Has a nice
discussion of power-law size distributions in economics. <a
href="../reviews/self-organizing-economy/">Review</a>]
<li>Michael LaBarbera, "Analyzing Body Size as a Factor in Ecology and
Evolution", <cite>Annual Review of Ecology and
Systematics</cite> <strong>20</strong> (1989): 91--117 [Statistical problems in
many studies of power-law scaling in biology, their effects on the conclusions
of those studies (ranging from "wrong, but correctable" to "meaningless"), and
how to do it right. <a
href="http://www.jstor.org/pss/2097086">JSTOR</a>]
<li>J. Laherrère and D. Sornette, "Stretched exponential
distributions in nature and economy: 'fat tails' with characteristic scales",
<cite>The European Physical Journal B</cite> <strong>2</strong> (1998):
525--539
<li>L. D. Landau and E. M. Lifshitz, <cite>Statistical Physics</cite>
[For the theory of fluctuations in statistical mechanics, and for critical
phenomena in equilibrium]
<li>Adrián López García de Lomana, Qasim K. Beg,
G. de Fabritiis and Jordi Villà-Freixa, "Statistical Analysis of Global Connectivity and Activity Distributions in Cellular Networks", <a href="http://arxiv.org/abs/1004.3138">arxiv:1004.3138</a>
<li>R. Dean Malmgren, Daniel B. Stouffer, Adilson E. Motter, Luis A.N. Amaral, "A Poissonian explanation for heavy-tails in e-mail communication",
<a href="http://dx.doi.org/10.1073/pnas.0800332105"><cite>Proceedings of the
National Academy of Sciences</cite> (USA) <strong>105</strong> (2008): 18153--18158</a>, <a href="http://arxiv.org/abs/0901.0585">arxiv:0901.0585</a>
<li>Elliott W. Montroll and Michael F. Shlesinger, "On 1/f noise and
other distributions with long
tails", <a href="http://www.pnas.org/content/79/10/3380.abstract"><cite>Proceedings
of the National Academy of Sciences (USA)</cite> <strong>79</strong> (1982):
3380--3383</a>
<li>V. F. Pisarenko and D. Sornette, "New statistic for financial
return distributions: power-law or exponential?", <a
href="http://arxiv.org/abs/physics/0403075">physics/0403075</a> [Actually, two
new statistics: one converges to a constant if the distribution you're sampling
from is an exponential, independent of the exponent, and the other converges to
a constant if the distribution is a power law, independent of the power. They
even have some indications of the sampling distributions, so you can at least
gauge the statistical signifcance, i.e., the probability of deviations from the
ideal value, even though the distribution really is of the appropriate type. I
don't recall anything about the power of these statistics, however (i.e., the
probability that a power law will look like an exponential, or vice-versa).]
<li>William J. Reed and Barry D. Hughes, "From Gene Families and Genera
to Incomes and Internet File Sizes: Why Power Laws are so Common in
Nature", <a href="http://dx.doi.org/10.1103/PhysRevE.66.067103"><cite>Physical
Review E</cite> <strong>66</strong> (2002): 067103</a> [This is, as I said,
perhaps the most deflating possible explanation for power law size
distributions. Imagine you have some set of piles, each of which grows,
multiplicatively, at a constant rate. New piles are started at random times,
with a constant probability per unit time. (This is a good model of my
office.) Then, at any time, the age of the piles is exponentially distributed,
and their size is an exponential function of their age; the two exponentials
cancel and give you a power-law size distribution. The basic combination of
exponential growth and random observation times turns out to work even if it's
only the <em>mean</em> size of piles which grows exponentially.]
<li> M. V. Simkin and V. P. Roychowdhury, "Re-inventing Willis",
<a href="http://arxiv.org/abs/physics/0601192">physics/0601192</a> [The comical,
yet pathetic, history of the innumerable re-inventions of basic mechanisms
which plague this area]
<li>Herbert Simon, "On a Class of Skew Distribution Functions",
<cite>Biometrika</cite> <strong>42</strong> (1955): 425--440 [<a href="http://www.jstor.org/pss/2333389">JSTOR</a>]
<li>Didier Sornette
<ul>
<li>"Multiplicative Processes and Power Laws" <a
href="http://arxiv.org/abs/cond-mat/9708231">cond-mat/9708231</a>
= <cite>Physical Review E</cite> <strong>57</strong> (1998): 4811--4813
<li>"Mechanism for Powerlaws without Self-Organization"
<a href="http://arxiv.org/abs/cond-mat/0110426">cond-mat/0110426</a>
</ul>
<li>Stilian A. Stoev, George Michailidis, and Murad S. Taqqu,
"Estimating heavy-tail exponents through max self-similarity", <a
href="http://arxiv.org/abs/math.ST/0609163">math.ST/0609163</a>
<li>Bruce J. West and Bill Deering, <citE>The Lure of Modern Science:
Fractal Thinking</cite> [Despite the <em>painful</em> title, this is actually a
very good book. I disagree with some of the more philosophical positions they
take, but on the actual science and math they're quite sound.]
<li>Wei Biao Wu, Yinxiao Huang and Wei Zheng, "Covariances estimation
for long-memory
processes", <a href="http://projecteuclid.org/euclid.aap/1269611147"><cite>Advances
in Applied Probability</cite> <strong>42</strong> (2010): 137--157</a> [How big
are the errors in your covariance estimates?]
<li>Damian H. Zanette, "Zipf's law and the creation of musical
context", <a href="http://arxiv.org/abs/cs.CL/0406015">cs.CL/0406015</a> [This
<em>sounds</em> bizarre, and I'd not have bothered to even note it if I didn't
know Zanette's work in other areas, which shows him to be a good and careful
scientist. And this is actually an interesting and meaningful little paper,
which has something non-trivial to say about music. It's worth noting,
perhaps, that the distribution he actually ends up fitting isn't a pure power
law, but a modification inspired by Simon's paper. Thanks to John Burke for
prodding me to actually read it.]
</ul>
<ul>Not altogether recommended (without being actively dis-recommended either):
<li>R. Alexander Bentley, Paul Ormerod, Michael Batty, "An evolutionary
model of long tailed distributions in the social
sciences", <a href="http://arxiv.org/abs/0903.2533">arxiv:0903.2533</a> [This
is a minor modification of the classical Yule/Simon mechanism for random
growth, with the main advantage being that (with the right parameter tweaking)
it allows for more turn-over of which values are most common. Unsurprisingly,
this is done by adding extra parameters, and so the family of distributions is
more flexible. But they use bad statistical procedures, and the finding that
the estimated power law exponent grows as the amount of data held in the tail
shrinks is simply explained: the tails aren't power laws.]
</ul>
<ul>Recommended, of a not entirely serious character:
<li>Mason Porter's <a href="http://www.cafepress.com/ThePowerLawShop">Power Law Shop</a>
</ul>
<ul>Modesty forbids me to recommend:
<li>Aaron Clauset, CRS and M. E. J. Newman, "Power-law distributions in
empirical data", <cite>SIAM Review</cite> <strong>51</strong> (2009): 661--703, <a href="http://arxiv.org/abs/0706.1062">arxiv:0706.1062</a> [with commentary
by <a href="http://cs.unm.edu/~aaron/blog/archives/2007/06/power_laws_and.htm">Aaron</a>
and <a href="http://bactra.org/weblog/491.html">myself</a>]
</ul>
<ul>Pride compels me to recommend:
<li>Georg M. Goerg, "Lambert W random variables: A new family of generalized skewed distributions with applications to risk estimation",
<a href="http://dx.doi.org/10.1214/11-AOAS457"><cite>Annals of Applied
Statistics</cite>
<strong>5</strong> (2011):
2197--2230</a>, <a href="http://arxiv.org/abs/0912.4554">arxiv:0912.4554</a>
[Done while George was my student, but entirely independent work]
</ul>
<ul>To read:
<li>Eduardo G. Altmann and Holger Kantz, "Recurrence time analysis,
long-term correlations, and extreme events", <a
href="http://arxiv.org/abs/physics/0503056">physics/0503056</a>
<li>J. A. D. Aston, "Modeling macroeconomic time series via heavy
tailed distributions", <a
href="http://arxiv.org/abs/math.ST/0702844">math.ST/0702844</a>
<li>Stefan Aulbach and Michael Falk, "Testing for a generalized Pareto
process", <a href="http://dx.doi.org/10.1214/12-EJS728"><cite>Electronic
Journal of Statistics</cite> <strong>6</strong> (2012): 1779--1802</a>
<li>Katarzyna Bartkiewicz, Adam Jakubowski, Thomas Mikosch, Olivier Wintenberger, "Infinite variance stable limits for sums of dependent random variables", <a href="http://arxiv.org/abs/0906.2717">arxiv:0906.2717</a>
<li>Michael Batty, "Rank Clocks", <a
href="http://dx.doi.org/10.1038/nature05302"><cite>Nature</cite>
<strong>444</strong> (2006): 592--596</a>
<li>Marco Bee, Massimo Riccaboni and Stefano Schiavo, "Pareto versus lognormal: A maximum entropy test", <a href="http://dx.doi.org/10.1103/PhysRevE.84.026104"><citE>Physical
Review E</cite> <strong>84</strong> (2011); 026104</a>
<li>Jan Beran, Bikramjit Das, Dieter Schell, "On robust tail index estimation for linear long-memory processes", <a href="http://dx.doi.org/10.1111/j.1467-9892.2011.00774.x"><cite>Journal of Time Series Analysis</cite> <strong>33</strong> (2012): 406--423</a>
<li>Patrice Bertail, Stéphan Clémençon, and
Jessica Tressou, "Regenerative block-bootstrap confidence intervals for tail
and extremal indexes", <a href="http://dx.doi.org/10.1214/13-EJS807"><cite>Electronic Journal of Statistics</cite> <strong>7</strong> (2013): 1224--1248</a>
<li>P. Besbeas and B. J. T. Morgan, "Improved estimation of the stable
laws", <a href="http://dx.doi.org/10.1007/s11222-008-9050-6"><cite>Statistics
and Computing</cite> <strong>18</strong> (2008): 219--231</a>
<li>Eric Beutner, Henryk Zähle, "Continuous mapping approach to the asymptotics of U- and V-statistics", <a href="http://arxiv.org/abs/1203.1112">arxiv:1203.1112</a>
<li>Danny Bickson, Carlos Guestrin, "Linear Characteristic Graphical Models: Representation, Inference and Applications", <a href="http://arxiv.org/abs/1008.5325">arxiv:1008.5325</a> [<a href="graphical-models.html">Graphical models</a> with heavy-tailed latent variables]
<li>Thierry Bochud and Damien Challet, "Optimal approximations of
power-laws with exponentials", <a
href="http://arxiv.org/abs/physics/0605149">physics/0605149</a> ["We propose an
explicit recursive method to approximate a power-law with a finite sum of
weighted exponentials. Applications to moving averages with long memory are
discussed in relationship with stochastic volatility models." The last part
sounds like a rediscovery of Granger.]
<li>Stéphane Boucheron and Maud Thomas, "Tail index estimation, concentration and adaptivity", <a href="http://arxiv.org/abs/1503.05077">arxiv:1503.05077</a>
<li>Christian Brownlees, Emilien Joly, Gabor Lugosi, "Empirical risk minimization for heavy-tailed losses", <a href="http://arxiv.org/abs/1406.2462">arxiv:1406.2462</a>
<li><a href="http://ssrn.com/author=75695">Laurent E. Calvet</a> and
Adlai J. Fisher, <cite>Multifractal Volatility: Theory, Forecasting, and
Pricing</cite> [Thanks to Prof. Calvet for bringing this to my attention]
<li>Anna Carbone and Giuliano Castelli, "Scaling Properties of
Long-Range Correlated Noisy Signals,"
<a href="http://arxiv.org/abs/cond-mat/0303465">cond-mat/0303465</a>
<li>Alexandra Carpentier, Arlene K.H. Kim, "Adaptive and minimax optimal estimation of the tail coefficient", <a href="http://arxiv.org/abs/1309.2585">arxiv:1309.2585</a>
<li>C. Cattuto, V. Loreto and V. D. P. Servedio, "A Yule-Simon process
with memory", <a
href="http://arxiv.org/abs/cond-mat/0608672">cond-mat/0608672</a> [Memo
to self: compare this to the auto-correlated Yule-Simon process in
Ijiri and Simon's book.]
<li>Arijit Chakrabarty, "Central Limit Theorem and Large Deviations for truncated heavy-tailed random vectors", <a href="http://arxiv.org/abs/1003.2159">arxiv:1003.2159</a>
<li>Arijit Chakrabarty, Gennady Samorodnitsky, "Understanding heavy tails in a bounded world or, is a truncated heavy tail heavy or not?", <a href="http://arxiv.org/abs/1001.3218">arxiv:1001.3218</a>
<li>Anirban Chakraborti, Marco Patriarca, "A Variational Principle for
Pareto's power
law", <a href="http://arxiv.org/abs/cond-mat/0605325">cond-mat/0605325</a>
<li>Ali Chaouche and Jean-Noel Bacro, "Statistical Inference for
the Generalized Pareto Distribution: Maximum Likelihood Revisited", <a href="http://dx.doi.org/10.1080/03610920500501429"><citE>Communications in Statistics: Theory and Methods</cite> <strong>35</strong>
(2006): 785--802</a>
<li>F. Clementi, M. Gallegati, "Pareto's Law of Income Distribution:
Evidence for Germany, the United Kingdom, and the United
States", <a href="http://arxiv.org/abs/physics/0504217">physics/0504217</a>
<li>Cline, heavy-tailed noise, 1983 (?)
<li>B. Conrad and M. Mitzenmacher, "Power Laws for Monkeys Typing
Randomly: The Case of Unequal Probabilities", <cite>IEEE Transactions on
Information Theory</cite> <strong>50</strong> (2004): 1403--1414
<li>J. Danielsson, L. de Haan, L. Peng and C. G. de Vries, "Using a Bootstrap Method to Choose the Sample Fraction in Tail Index Estimation",
<a href="http://dx.doi.org/10.1006/jmva.2000.1903"><cite>Journal of Multivariate Analysis</cite> <strong>76</strong> (2001): 236--248</a>
<li>Bikramjit Das and Siddney I. Resnick, "QQ plots, Random sets and
data from a heavy tailed distribution", <a
href="http://arxiv.org/abs/math.PR/0702551">math.PR/0702551</a>
<li>Anirban Dasgupta, John Hopcroft, Jon Kleinberg and
Mark Sandler, "On Learning Mixtures of Heavy-Tailed Distributions"
<li>Nima Dehghani, Nicholas G. Hatsopoulos, Zach D. Haga, Rebecca A. Parker, Bradley Greger, Eric Halgren, Sydney S. Cash, Alain Destexhe, "Avalanche analysis from multi-electrode ensemble recordings in cat, monkey and human cerebral cortex during wakefulness and sleep", <a href="http://arxiv.org/abs/1203.0738">arxiv:1203.0738</a> [Ummm, we explain why you can't use $R^2$ that way in the paper you cite...]
<li>T. Di Matteo, T. Aste and M. Gallegati, "Innovation flow through
social networks: Productivity distribution", <a
href="http://arxiv.org/abs/physics/0406091">physics/0406091</a> [Those look an
awful lot like log-normals to me.]
<li>Paul Doukhan, George Oppenheim and Murad S. Taqqu
(eds.), <cite>Theory and Applications of Long-Range Dependence</cite>
<li>Rick Durrett and Jason Schweinsberg, "Power laws for family sizes
in a duplication model", <a
href="http://arxiv.org/abs/math.PR/0406216">math.PR/0406216</a>
<cite>Annals of Probability</cite> <strong>33</strong> (2005): 2094--2126
<li>R. Fox and M. S. Taqqu
<ul>
<li>"Noncentral Limit Thorems for Quadratic Forms in Random
Variables Having Long-Range Dependence," <a href="http://projecteuclid.org/euclid.aop/1176993001"><cite>Annals of Probability</cite>
<strong>13</strong> (1985) 428--446</a>
<li>"Central Limit Theorems for Quadratic Forms in Random
Variables Having Long-Range Dependence," <cite>Probability Theory and Related
Fields</cite> <strong>74</strong> (1987): 213--240
</ul>
<li>G. Frenkel, E. Katzav, M. Schwartz and N. Sochen, "Distribution of
Anomalous Exponents of Natural Images", <a
href="http://dx.doi.org/10.1103/PhysRevLett.97.103902"><cite>Physical Review
Letters</cite> <strong>97</strong> (2006): 103902</a>
<li>U. Frisch and D. Sornette, "Extreme Deviations and Applications",
<cite>J. Phys. I France</cite> <strong>7</strong> (1997): 1155--1171
<li>Akihiro Fujihara, Toshiya Ohtsuki and Hiroshi Yamamoto
<ul>
<li>"Power-law tails in nonstationary stochastic processes with
asymmetrically multiplicative interactions", <a
href="http://dx.doi.org/10.1103/PhysRevE.70.031106"><citE>Physical Review
E</cite> <strong>70</strong> (2004): 031106</a> = <a
href="http://arxiv.org/abs/cond-mat/0506785">cond-mat/0506785</a>
<li>"Similarity and Probability Distribution Functions in
Many-body Stochastic Processes with Multiplicative Interactions", <a
href="http://arxiv.org/abs/cond-mat/0508615">cond-mat/0508615</a>
</ul>
<li>Akihiro Fujihara, Satoshi Tanimoto, Toshiya Ohtsuki, Hiroshi
Yamamoto, "Log-normal distribution in growing systems with weighted
multiplicative interactions", <a
href="http://arxiv.org/abs/cond-mat/0511625">cond-mat/0511625</a>
<li>Yoshi Fujiwara, Corrado Di Guilmi, Hideaki Aoyama, Mauro Gallegati,
Wataru Souma, "Do Pareto-Zipf and Gibrat laws hold true? An analysis with
European
Firms", <a href="http://arxiv.org/abs/cond-mat/0310061">cond-mat/0310061</a>
<li>Xavier Gabaix, "Power Laws in Economics and Finance"
[<a href="http://pages.stern.nyu.edu/~xgabaix/papers/pl-ar.pdf">PDF preprint</a>]
<li>Michael Golosovsky and Sorin Solomon, "Stochastic Dynamical Model
of a Growing Citation Network Based on a Self-Exciting Point
Process", <a href="http://dx.doi.org/10.1103/PhysRevLett.109.098701"><cite>Physical
Review Letters</cite> <strong>109</strong> (2012): 098701</a>
<li>M. Ivette Gomes, M. Isabel Fraga Alves, Paulo Araujo Santos,
"PORT Hill and Moment Estimators for Heavy-Tailed Models",
<a href="http://dx.doi.org/10.1080/03610910802050910"><cite>Communications in Statistics:
Simulation and Computation</cite> <strong>37</strong>
(2008): 1281--1306</a>
<li>Alexander Gnedin, Ben Hansen, Jim Pitman, "Notes on the occupancy
problem with infinitely many boxes: general asymptotics and power
laws", <a href="http://arxiv.org/abs/math.PR/0701718">math.PR/0701718</a>
<li>J. A. Gubner, "Theorems and Fallacies in the Theory of
Long-Range-Dependent Processes", <a
href="http://dx.doi.org/10.1109/TIT.2004.842768"><cite>IEEE Transactions on
Information Theory</cite> <strong>51</strong> (2005): 1234--1239</a>
<li>Alexandra Guerrero and Leonard A. Smith, "A maximum likelihood
estimator for long-range persistence", <a
href="http://dx.doi.org/10.1016/j.physa.2005.03.002"><cite>Physica
A</cite> <strong>355</strong> (2005): 619--632</a>
<li>Rudolf Hanel and Stefan Thurner, "On the Derivation of power-law
distributions within standard statistical mechanics", <a
href="http://arxiv.org/abs/cond-mat/0412016">cond-mat/0412016</a>
<li>Bruce M. Hill and Michael Woodroofe, "Stronger Forms of Zipf's
Law", <cite>Journal of the American Statistical
Association</cite> <strong>70</strong> (1975): 212--219 [Deriving Zipf's law
from Bose-Einstein
statistics. <a href="http://www.jstor.org/stable/2285406">JSTOR</a>]
<li>Byoung Hee Hong, Kyoung Eun Lee, Jae Woo Lee, "Power Law in Firms
Bankruptcy", <a href="http://arxiv.org/abs/physics/0701302">physics/0701302</a>
<li>Y. Hosoya
<ul>
<li>"The quasi-likelihood approach to statistical inference on
multiple time-series with long-range dependence," <cite>Journal of
Econometrics</cite> <strong>73</strong> (1996): 217--236
<li>"A limit theory for long-range dependence and statistical
inference on related models," <a href="http://projecteuclid.org/euclid.aos/1034276623"><cite>Annals of Statistics</cite>
<strong>25</strong> (1997): 105--137</a>
</ul>
<li>Henrik Hult and Gennady Samorodnitsky, "Large deviations for point processes based on stationary sequences with heavy tails", <a href="http://projecteuclid.org/euclid.jap/1269610814"><cite>Journal of Applied Probability</cite> <strong>47</strong> (2010): 1--40</a>
<li>Takashi Ichinomiya, "Power-law distribution in Japanese racetrack
betting", <a href="http://arxiv.org/abs/physics/0602165">physics/0602165</a>
<li>Sanja Janicevic, Lasse Laurson, Knut Jorgen Maloy, Stephane Santucci, and Mikko J. Alava, "Interevent Correlations from Avalanches Hiding Below the Detection Threshold", <a href="http://dx.doi.org/10.1103/PhysRevLett.117.230601"><cite>Physical Review Letters</cite> <strong>117</strong> (2016): 230601</a>
<li>Milton Jara, Tomasz Komorowski and Stefano Olla,
"Limit theorems for additive functionals of a Markov chain", <a href="http://arxiv.org/abs/0809.0177">arxiv:0809.0177</a> [Convergence to alpha-stable
distributions]
<li>Predrag R. Jelenkovic, Jian Tan, "Modulated Branching Processes,
Origins of Power Laws and Queueing
Duality", <a href="http://arxiv.org/abs/0709.4297">0709.4297</a>
<li>Junghyo Jo, Jean-Yves Fortin, M. Y. Choi, "Weibull-type limiting distribution for replicative systems", <cite>Physical Review E</cite> <strong>83</strong> (2011): 031123, <a href="http://arxiv.org/abs/1103.3038">arxiv:1103.3038</a>
<li>Taisei Kaizoji, "Power laws and market
crashes", <a href="http://arxiv.org/abs/physics/0603138">physics/0603138</a>
<li>Imen Kammoun, Vernoique Billat and Jean-Marc Bardet, "A new
stochastic process to model Heart Rate series during exhaustive run and an
estimator of its fractality
parameter", <a href="http://arxiv.org/abs/0803.3675">arxiv:0803.3675</a>
[Includes statistical criticism of the common, but deeply unsatisfying,
"detrended fluctuation analysis" method of estimating the Hurst exponent.]
<li>B. Kaulakys and J. Ruseckas, "Stochastic nonlinear
differential equation generating 1/f noise", <cite>Physical Review E</cite>
<strong>70</strong> (2004): 020101 = <a
href="http://arxiv.org/abs/cond-mat.0408507">cond-mat.0408507</a>
<li>K. Kiyani, S. C. Chapman and B. Hnat, "A method for extracting the
scaling exponents of a self-affine, non-Gaussian process from a finite length
timeseries", <a href="http://arxiv.org/abs/physics/0607238">physics/0607238</a>
<li>K. H. Kiyani, S. C. Chapman, N. W. Watkins, "Pseudo-nonstationarity in the scaling exponents of finite interval time series", <cite>Physical
Review E</cite> <strong>79</strong> (2009): 036109, <a href="http://arxiv.org/abs/0808.2036">arxiv:0808.2036</a>
<li>Henry Lam, Jose Blanchet, Damian Burch, Martin Z. Bazant, "Corrections to the Central Limit Theorem for Heavy-tailed Probability Densities",
<a href="http://dx.doi.org/0.1007/s10959-011-0379-y"><citE>Journal of Theoretical Probability</cite>
<strong>24</strong> (2011): 895--927</a>, <a href="http://arxiv.org/abs/1103.4306">arxiv:1103.4306</a>
<li>Francois M. Longin, "The Asymptotic Distribution of Extreme Stock
Market Returns", <cite>The Journal of Business</cite> <strong>69</strong>
(1996): 383--408
[<a
href="http://www.jstor.org/stable/2353373">JSTOR</a>]
<li>Fotis Loukissas, "Precise Large Deviations for Long-Tailed Distributions", <a href="http://dx.doi.org/10.1007/s10959-011-0367-2"><cite>Journal of Theoretical Probability</cite> <strong>25</strong> (2012): 913--924</a>
<li>Bruce D. Malamud, James D. A. Millington and George L. W. Perry,
"Characterizing wildfire regimes in the United States", <a
href="http://dx.doi.org/10.1073/pnas.0500880102"><cite>Proceedings of the
National Academy of Sciences</cite> (USA) <strong>102</strong> (2005):
4694--4699</a>
<li>Y. Malevergne, V.F. Pisarenko, D. Sornette, "Empirical
Distributions of Log-Returns: between the Stretched Exponential and the Power
Law?", <a href="http://arxiv.org/abs/physics/0305089">physics/0305089</a>
<li>Alon Manor and Nadav M. Shnerb, "Multiplicative Noise and Second Order Phase Transitions", <a href="http://dx.doi.org/10.1103/PhysRevLett.103.030601"><cite>Physical
Review Letters</cite> <strong>103</strong> (2009): 030601</a>
<li>Natalia Markovich, <cite>Nonparametric Analysis of Univariate
Heavy-Tailed Data: Research and Practice</cite>
<li>Matteo Marsili, "On the concentration of large deviations for fat tailed distributions", <a href="http://arxiv.org/abs/1201.2817">arxiv:1201.2817</a>
<li>Yosef E. Maruvka, David A. Kessler, Nadav M. Shnerb, "The
Birth-Death-Mutation process: a new paradigm for fat tailed
distributions", <a href="http://arxiv.org/abs/1011.4110">arxiv:1011.4110</a> [I
suspected from the abstract that this was Yet Another Rediscovery of the
Yule-Simon mechanism. However, after actually looking through the paper
(prompted by Dr. Shnerb), I see that they are in fact doing something more, and
that I was just wrong. I still need to read it properly, however, before
deciding what I think about the actual proposal.]
<li>Joseph L. McCauley, Gemunu H. Gunaratne, Kevin E. Bassler, "Hurst
Exponents, Markov Processes, and Fractional Brownian
motion", <a href="http://arxiv.org/abs/cond-mat/0609671">cond-mat/0609671</a>
<li>Richard Metzler, "Comment on 'Power-law correlations in the
southern-oscillation-index fluctuations characterizing El
Nino'", <a href="http://dx.doi.org/10.1103/PhysRevE.67.018201"><cite>Physical
Review E</cite> <strong>67</strong> (2003): 018201</a>
<li>Salvatore Miccich`, "Modeling long-range memory with stationary Markovian processes", <a href="http://dx.doi.org/10.1103/PhysRevE.79.031116"><cite>Physical
Review E</cite> <strong>79</strong> (2009): 031116</a>, <a href="http://arxiv.org/abs/arxiv:0806.0722">arxiv:arxiv:0806.0722</a>
<li>Thomas Mikosch, Sidney Resnick, Holger Rootzén, and Alwin Stegeman. "Is Network Traffic Appriximated by Stable Lévy Motion or Fractional Brownian Motion?", <a href="http://projecteuclid.org/euclid.aoap/1015961155"><cite>Annals of Applied Probability</cite> <strong>12</strong> (2002): 23--68</a>
<li>Thomas Mikosch, Olivier Wintenberger, "Precise large deviations for dependent regularly varying sequences", <a href="http://arxiv.org/abs/1206.1395">arxiv:1206.1395</a>
<li>Edoardo Milotti, "Model-based fit procedure for power-law-like
spectra", <a href="http://arxiv.org/abs/physics/0510011">physics/0510011</a>
<li>Mariusz Mirek, "Heavy tail phenomenon and convergence to stable laws for iterated Lipschitz maps", <a href="http://dx.doi.org/10.1007/s00440-010-0312-9"><cite>Probability Theory and Related Fields</cite> <strong>151</strong> (2011): 705--734</a>
<li>Elliott W. Montroll and Michael Shlesinger, "Maximum entropy
formalism, fractals, scaling phenomena and 1/f noise: A tale of tails",
<cite>Journal of Statistical Physics</cite> <strong>32</strong> (1983):
209--230
<li>Eric Moulines, Francois Roueff, Murad S. Taqqu, "A Wavelet Whittle
estimator of the memory parameter of a non-stationary Gaussian time
series", <a href="http://arxiv.org/abs/math/0601070">math/0601070</a>
<li>Newton J. Moura Jr. and Marcelo B. Ribeiro, "Zipf Law for Brazilian
Cities", <a href="http://arxiv.org/abs/physics/0511216">physics/0511216</a>
<li>J. F. Muzy, E. Bacry and A. Kozhemyak, "Extreme values and fat
tails of multifractal
fluctuations", <a href="http://dx.doi.org/"><cite>Physical Review
E</cite> <strong>73</strong> (2006): 066114</a>
= <a href="http://arxiv.org/abs/cond-mat/0509357">cond-mat/0509357</a>
["problem of the estimation of extreme event occurrence probability for data
drawn from some multifractal process. We also study the heavy (power-law) tail
behavior of probability density function associated with such data. We show
that because of strong correlations, standard extreme value approach is not
valid and classical tail exponent estimators should be interpreted cautiously"]
<li>Fuyuo Nagayama, "Wealth Inequality Among the Forbes 400 and U.S. Households Overall" [<a href="http://www.philadelphiafed.org/research-and-data/publications/research-rap/2013/wealth-inequality-among-forbes-400-and-us-households-overall.pdf">PDF preprint via Federal Reserve Bank of Philadelphia</a>]
<li>Marko Obradović, Milan Jovanović, Bojana Milošević, "Goodness-of-Fit Tests for Pareto Distribution Based on a Characterization and their Asymptotics", <a href="http://arxiv.org/abs/1310.5510">arxiv:1310.5510</a>
<li>Richard Perline, "Strong, Weak and False Inverse Power Laws",
<a href="http://dx.doi.org/10.1214/088342304000000215"><cite>Statistical
Science</cite> <strong>20</strong> (2005): 68--88</a>
<li>Sergei Petrovskii, Alla Mashanova, and Vincent A. A. Jansen, "Variation in individual walking behavior creates the impression of a Lévy flight", <a href="http://dx.doi.org/10.1073/pnas.1015208108"><cite>Proceedings of
the National Academy of Sciences</cite> (USA) <strong>108</strong> (2011): 8704--8707</a>
<li>William Rea, Les Oxley, Marco Reale and Jennifer Brown,
"Estimators for Long Range Dependence: An Empirical Study", <a href="http://arxiv.org/abs/0901.0762">arxiv:0901.0762</a>
<li>S. Redner, "Random multiplicative processes: An elementary
tutorial", <cite>American Journal of Physics</cite> <strong>58</strong>
(1990): 267--273
<li>Sidney I. Resnick, <cite><a
href="http://www.springer.com/978-0-387-24272-9&uid=13349451&mid=414478&aid=">Heavy-Tail Phenomena: Probabilistic and
Statistical Modeling</a></cite>
<li>Sidney Resnick and Catalin Starica, "Tail Index Estimation for
Dependent Data", <a href="http://projecteuclid.org/euclid.aoap/1028903376"><cite>The Annals of Applied
Probability</citE> <strong>8</strong> (1998): 1156--1183</a>
<li>Massimo Riccaboni, Fabio Pammolli, Sergey V. Buldyrev, Linda Ponta,
H. Eugene Stanley , "The Size Variance Relationship of Business Firm Growth
Rates", <a href="http://arxiv.org/abs/0904.1404">arxiv:0904.1404</a>
= <citE>Proceedings of the National Academy of Sciences</cite>
(USA) <strong>105</strong> (2008): 19595--19600
<li>C. Y. Robert, "Automatic Declustering of Rare Events", <a href="http://dx.doi.org/10.1093/biomet/ast013"><cite>Biometrika</cite> <strong>100</strong> (2013): 587--606</a>
<li>Alexander Roitershtein, "One-dimensional linear recursions with
Markov-dependent
coefficients", <a href="http://arxiv.org/abs/math/0409335">math/0409335</a>
= <a href="http://dx.doi.org/10%2E1214/105051606000000844"><citE>Annals of
Applied Probability</cite> <strong>17</strong> (2007): 572--608</a> [To
summarize the abstract, suppose S(n) = A(n) + B(n)*S(n-1), where A(n) and B(n)
are Markov sequences. Then "the distribution tail of its stationary solution
has a power law decay." This sounds like Simon's argument made more general.]
<li>Holger Rootzen, M. Ross Leadbetter and Laurens de Haan, "On the
distribution of tail array sums for strongly mixing stationary
sequences", <a href="http://projecteuclid.org/euclid.aoap/1028903454"><cite>Annals of Applied Probability</cite> <strong>8</strong>
(1998): 868--885</a>
<li>Gordon J Ross, Tim Jones, "Understanding the Heavy Tailed Dynamics in Human Behavior", <a href="http://arxiv.org/abs/1505.01547">arxiv:1505.01547</a>
<li>Gennady Samorodnitsky and Murad S. Taqqu, <cite>Stable
Non-Gaussian Random Processes</cite>
<li>Francois G. Schmitt and Yongxiang Huang, <cite><a href="http://cambridge.org/9781107067615">Stochastic Analysis of Scaling Time Series:
From Turbulence Theory to Applications</a></cite>
<li>David J. Schwab, Ilya Nemenman, Pankaj Mehta, "Zipf's law and criticality in multivariate data without fine-tuning", <a href="http://arxiv.org/abs/1310.0448">arxiv:1310.0448</a>
<li>D. Sornette and V. F. Pisarenko, "Properties of a simple bilinear
stochastic model: estimation and predictability", <a
href="http://arxiv.org/abs/physics/0703217">physics/0703217</a>
<li>Attilio L. Stella, Fulvio Baldovin, "Anomalous scaling due to correlations: Limit theorems and self-similar processes", <a href="http://arxiv.org/abs/0909.0906">arxiv:0909.0906</a>
<li>Stilian A Stoev, George Michailidis, "On the Estimation of the Heavy-Tail Exponent in Time Series using the Max-Spectrum", <a href="http://arxiv.org/abs/1005.4329">arxiv:1005.4329</a>
<li>Stilian A. Stoev and Murad S. Taqqu, "Limit Theorems for Sums of Heavy-tailed Variables with Random Dependent Weights",
<a href="http://dx.doi.org/10.1007/s11009-006-9011-5"><cite>Methodology and
Computing in Applied Probability</cite> <strong>9</strong> (2007): 55--87</a>
<li>Sarah Touati, Mark Naylor, and Ian G. Main, "Origin and Nonuniversality of the Earthquake Interevent Time Distribution", <a href="http://dx.doi.org/10.1103/PhysRevLett.102.168501"><cite>Physical Review Letters</cite> <strong>102</strong> (2009): 168501</a>
<li>Ciprian Tudor and Frederi Viens, "Variations and estimators for the selfsimilarity order through Malliavin calculus", <a href="http://arxiv.org/abs/0709.3896">arxiv:0709.3896</a>
<li>Caglar Tuncay, "A universal model for languages and cities, and
their
lifetimes", <a href="http://arxiv.org/abs/physics/0703144">physics/0703144</a>
<li>Marta Tyran-Kaminska, "Convergence to Lévy stable processes under strong mixing conditions", <a href="http://arxiv.org/abs/0907.1185">arxiv:0907.1185</a>
<li>Sergio Venturini, Francesca Dominici, Giovanni Parmigiani, "Gamma
shape mixtures for heavy-tailed distributions", <cite>Annals of Applied
Statistics</cite> <strong>2</strong> (2008): 756--776
= <a href="http://arxiv.org/abs/0807.4663">arxiv:0807.4663</a>
<li>Yogesh Virkar, Aaron Clauset, "Power-law distributions in binned empirical data", <a href="http://arxiv.org/abs/1208.3524">arxiv:1208.3524</a>
<li>Nicholas Watkins, "Mandelbrot's 1/f fractional renewal models of 1963-67: The non-ergodic missing link between change points and long range dependence", <a href="http://dx.doi.org/10.1007/978-3-319-55789-2_14"><cite>Advances in Time Series Analysis and Forecasting</cite> ITISE 2016</a>, <a href="http://arxiv.org/abs/1603.00738">arxiv:1603.00738</a>
<li>Rafal Weron
<ul>
<li>"Estimating long range dependence: finite sample
properties and confidence intervals," <a
href="http://arxiv.org/abs/cond-mat/0103510">cond-mat/0103510</a>
<li>"Measuring long-range dependence in electricity prices,"
<a href="http://arxiv.org/abs/cond-mat/0103621">cond-mat/0103621</a>
</ul>
<li>T. S. T. Wong and W. K. Li, "A note on the estimation of extreme
value distributions using maximum product of spacings",
<a href="http://arxiv.org/abs/math.ST/0702830">math.ST/0702830</a>
<li><a href="http://galton.uchicago.edu/faculty/wu.html">Wei Biao
Wu</a>, Xiaofeng Shao, "Invariance principles for fractionally integrated
nonlinear
processes", <a href="http://arxiv.org/abs/math.PR/0608223">math.PR/0608223</a>
<li>Seokhoon Yun, "The Extremal Index of a Higher-Order
Stationary Markov Chain", <a href="http://projecteuclid.org/euclid.aoap/1028903534"><cite>The Annals of Applied Probability</cite>
<strong>8</strong> (1998): 408--437</a>
<li>Damian H. Zanette, "Zipf's law and city sizes: A short tutorial
review on multiplicative processes in urban
growth", <a href="http://arxiv.org/abs/0704.3170">arxiv:0704.3170</a>
<li>Ting Zhang, Hwai-Chung Ho, Martin Wendler, Wei Biao Wu, "Block Sampling under Strong Dependence", <a href="http://arxiv.org/abs/1312.5807">arxiv:1312.5807</a>
<li>Qiuye Zhao and Mitch Marcus, "Long-tail Distributions and Unsupervised Learning of Morphology" [<a href="http://www.seas.upenn.edu/~qiuye/papers/ZM2012_Coling.pdf">PDF</a>. Replaces Zipf distribution over words with a log-normal. Doesn't test whether that's a better fit, but claims to give nice results in other tasks.]
</ul>