Analysis of Network Data

15 Jun 2016 14:22

That is, of data in the form of networks --- I don't (as such) care about packet flow or other aspects of computer networks...

Things I wish I knew how to do: bootstrap a network, non-parametrically. (The model with a fixed degree sequence is a start, but what's the equivalent of the block bootstraps used for time series, which preserve dependence?) Cross-validation on networks. (You could say that link prediction is leave-one-out CV, but how about k-fold CV?) Estimate a distribution over networks by somehow smoothing an adjacency matrix. Compare networks to say if they came from the same distribution. — These may or may not be aspects of a single problem.

Community discovery is an important sub-topic, and I like exponential family random graph models, stochastic block models and graph limits enough to give them their own notebooks.

Although many of the relevant papers appear in the journal Social Networks, published by Elsevier, a company known to also publish advertising disguised as peer-reviewed scientific journals (e.g., The Australasian Journal of Bone and Joint Medicine), I know of no particular reason to believe that their findings are actually meretricious propaganda on behalf of a paying client. It would, however, be better if the community would shift to a journal whose publisher did not pollute the process of scientific communication whenever it was profitable to do so.

See also: Complex networks; Community discovery; Exponential families of random graph models; Graph Theory; Graph Sampling Algorithms; Graph Spectra; Homophily vs. influence; Inferring networks from non-network data; Joint modeling of texts and networks; Network comparison; Political networks; Power laws (for questions about "scale-free" networks); Relational learning; Social networks; Statistics in general; Statistics of structured data; Visualizing network data