Current Slide

Small screen detected. You are viewing the mobile version of SlideWiki. If you wish to edit slides you will need to use a larger device.

Bag of words model

  • Vector representation doesn’t consider the ordering of words in a document

  • John is quicker than Mary and Mary is quicker than John have the same vectors

  • This is called the bag of words model.

  • In a sense, this is a step back: The positional index was able to distinguish these two documents.

  • We will look at “recovering” positional information later in this course.

  • For now: bag of words model


Speaker notes:

Content Tools

Sources

There are currently no sources for this slide.