Current Slide

Small screen detected. You are viewing the mobile version of SlideWiki. If you wish to edit slides you will need to use a larger device.

Cosine Similarity

  • A document can be represented by thousands of attributes, each recording the frequency of a particular word (such as keywords) or phrase in the document.

  • Other vector objects: gene features in micro-arrays, …
  • Applications: information retrieval, biologic taxonomy, gene feature mapping, ...
  • Cosine measure: If d1 and d2 are two vectors (e.g., term-frequency vectors), then
                 cos(d1, d2) =  (d1 ⋅ d2) /||d1|| ||d2|| ,
       where ⋅ indicates vector dot product, ||d||: the length of vector d


Speaker notes:

Content Tools

Sources

There are currently no sources for this slide.