Current Slide
Speaker notes:
Content Tools
Sources
There are currently no sources for this slide.
Small screen detected. You are viewing the mobile version of SlideWiki. If you wish to edit slides you will need to use a larger device.
Each document is a vector, one component for each term (= word).
Normally normalize vectors to unit length.
High-dimensional vector space:
Terms are axes
10,000+ dimensions, or even 100,000+
Docs are vectors in this space
How can we do classification in this space?
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License