Current Slide

Small screen detected. You are viewing the mobile version of SlideWiki. If you wish to edit slides you will need to use a larger device.

Extracting relevant information

  • The actual extraction of information from web sites is specified using standards such as XSL Transformation (XSLT)
  • Extracted information can be stored as structured data in XML format or databases.
  • However, using wrappers do not really scale because the actual extraction of information depends again on the web site format and layout

Speaker notes:

Content Tools