Current Slide

Small screen detected. You are viewing the mobile version of SlideWiki. If you wish to edit slides you will need to use a larger device.

BUC: Partitioning

  • Usually, entire data set can’t fit in main memory
  • Sort distinct values
    • partition into blocks that fit
  • Continue processing
  • Optimizations
    • Partitioning
      • External Sorting, Hashing, Counting Sort
    • Ordering dimensions to encourage pruning
      • Cardinality, Skew, Correlation
    • Collapsing duplicates
      • Can’t do holistic aggregates anymore!

Speaker notes:

Content Tools

Sources

There are currently no sources for this slide.