Current Slide

Small screen detected. You are viewing the mobile version of SlideWiki. If you wish to edit slides you will need to use a larger device.

High Dimensional Data

  • Pictures like the one at right are absolutely misleading!

  • Documents are zero along almost all axes

  • Most document pairs are very far apart (i.e., not strictly orthogonal, but only share very common words and a few scattered others)

  • In classification terms: often document sets are separable, for most any classification

  • This is part of why linear classifiers are quite successful in this domain






Speaker notes:

Content Tools

Sources

There are currently no sources for this slide.