The Curse of Dimensionality

  • Data in only one dimension is relatively packed
  • Adding a dimension “stretch” the points across that dimension, making them further apart
  • Adding more dimensions will make the points further apart—high dimensional data is extremely sparse
  • Distance measure becomes meaningless—due to equi-distance

graphs adapted from Parsons et al. KDD Explorations 2004