Data Mining

11 Oct 2018 13:34

I've taught a course on this, so I ought to be able to describe it, oughtn't I? Data mining, more stuffily "knowledge discovery in databases", is the art of finding and extracting useful patterns in very large collections of data. It's not quite the same as machine learning, because, while it certainly uses ML techniques, the aim is to directly guide action (praxis!), rather than to develop a technology and theory of induction. In some ways, in fact, it's closer to what statistics calls "exploratory data analysis", though with certain advantages and limitations that come from having really big data to explore.

Kernel methods probably deserve their own notebook.

Ethical and political issues in data mining definitely deserve their own notebook.

See also: Clinical and Actuarial Compared; Clustering; Statistics for Structured Data; Text Mining