Current Slide

Small screen detected. You are viewing the mobile version of SlideWiki. If you wish to edit slides you will need to use a larger device.

Limitations of the current Web - Extracting relevant information

  • The actual extraction of information from web sites is specified using standards such as XSL Transformation (XSLT) [1]
  • Extracted information can be stored as structured data in XML format or databases.
  • However, using wrappers do not really scale because the actual extraction of information depends again on the web site format and layout

 

    [1] http://www.w3.org/TR/xslt

Speaker notes:

Content Tools

Sources

There are currently no sources for this slide.