Current Slide

Small screen detected. You are viewing the mobile version of SlideWiki. If you wish to edit slides you will need to use a larger device.

Automatic Thesaurus Generation

  • Attempt to generate a thesaurus automatically by analyzing the collection of documents

  • Fundamental notion: similarity between two words

  • Definition 1: Two words are similar if they co-occur with similar words.

  • Definition 2: Two words are similar if they occur in a given grammatical relation with the same words.

  • You can harvest, peel, eat, prepare, etc. apples and pears, so apples and pears must be similar.

  • Co-occurrence based is more robust, grammatical relations are more accurate.  

        ↑

    Why?


Speaker notes:

Content Tools

Sources

There are currently no sources for this slide.