Current Slide

Small screen detected. You are viewing the mobile version of SlideWiki. If you wish to edit slides you will need to use a larger device.

Similarity Metrics

  • Nearest neighbor method depends on a similarity (or distance) metric.

  • Simplest for continuous m-dimensional instance space is Euclidean distance.

  • Simplest for m-dimensional binary instance space is Hamming distance (number of feature values that differ).

  • For text, cosine similarity of tf.idf weighted vectors is typically most effective.


Speaker notes:

Content Tools

Sources

There are currently no sources for this slide.