Current Slide

Small screen detected. You are viewing the mobile version of SlideWiki. If you wish to edit slides you will need to use a larger device.

Using Rocchio for text classification

  • Relevance feedback methods can be adapted for text categorization

    • As noted before, relevance feedback can be viewed as 2-class classification

      • Relevant vs. nonrelevant documents

  • Use standard tf-idf weighted vectors to represent text documents

  • For training documents in each category, compute a prototype vector by summing the vectors of the training documents in the category.

    • Prototype = centroid of members of class

  • Assign test documents to the category with the closest prototype vector based on cosine similarity.


Speaker notes:

Content Tools

Sources

There are currently no sources for this slide.