Small screen detected. You are viewing the mobile version of SlideWiki. If you wish to edit slides you will need to use a larger device.
Vector representation doesn’t consider the ordering of words in a document
John is quicker than Mary and Mary is quicker than John have the same vectors
This is called the bag of words model.
In a sense, this is a step back: The positional index was able to distinguish these two documents.
We will look at “recovering” positional information later in this course.
For now: bag of words model
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License