Data Mining

17 May 2021 10:41

I've taught a course on this, so I ought to be able to describe it, oughtn't I? Data mining, more stuffily "knowledge discovery in databases", is the art of finding and extracting useful patterns in very large collections of data. It's not quite the same as machine learning, because, while it certainly uses ML techniques, the aim is to directly guide action (praxis!), rather than to develop a technology and theory of induction. In some ways, in fact, it's closer to what statistics calls "exploratory data analysis", though with certain advantages and limitations that come from having really big data to explore.

Kernel methods get their own notebook.

Ethical and political issues in data mining definitely deserve their own notebook.

See also: Clinical and Actuarial Compared; Clustering; Recommender Systems and Collaborative Filtering; Statistics for Structured Data; Text Mining; Variable and Feature Selection for Regression and Classification