Thomas Girke (thomas.girke@ucr.edu)

Last update: 18 May, 2016

Alternative formats of this tutorial: Single-page .Rmd HTML, .Rmd, .R, Slide Show

Introduction

  • What is Clustering?
    • Clustering is the classification of data objects into similarity groups (clusters) according to a defined distance measure.
    • It is used in many fields, such as machine learning, data mining, pattern recognition, image analysis, genomics, systems biology, etc.
    • Machine learning typically regards data clustering as a form of unsupervised learning.
  • Why Clustering and Data Mining in R?}
    • Efficient data structures and functions for clustering
    • Reproducible and programmable
    • Comprehensive set of clustering and machine learning libraries
    • Integration with many other data analysis tools
  • Useful Links
Jump to: next_page