Current Slide

Small screen detected. You are viewing the mobile version of SlideWiki. If you wish to edit slides you will need to use a larger device.

Parsing a document

  • What format is it in?
    • pdf/word/excel/html?

  • What language is it in?

  • What character set is in use?


     Each of these is a classification problem, which we will study later in the course.

     But these tasks are often done heuristically …


Speaker notes:

Content Tools

Sources

There are currently no sources for this slide.