Current Slide

Small screen detected. You are viewing the mobile version of SlideWiki. If you wish to edit slides you will need to use a larger device.

Data Reduction Strategies

  • Data reduction: Obtain a reduced representation of the data set that is much smaller in volume but yet produces the same (or almost the same) analytical results
  • Why data reduction? — A database/data warehouse may store terabytes of data. Complex data analysis may take a very long time to run on the complete data set.
  • Data reduction strategies
    • Dimensionality reduction, e.g., remove unimportant attributes
      • Wavelet transforms
      • Principal Components Analysis (PCA)
      • Feature subset selection, feature creation
    • Numerosity reduction (some simply call it: Data Reduction)
      • Regression and Log-Linear Models
      • Histograms, clustering, sampling
      • Data cube aggregation
    • Data compression

Speaker notes:

Content Tools

Sources

There are currently no sources for this slide.