Current Slide

Small screen detected. You are viewing the mobile version of SlideWiki. If you wish to edit slides you will need to use a larger device.

Simple Discretization: Binning

  • Equal-width (distance) partitioning
    • Divides the range into N intervals of equal size: uniform grid
    • if A and B are the lowest and highest values of the attribute, the width of intervals will be: W = (B A)/N.
    • The most straightforward, but outliers may dominate presentation
    • Skewed data is not handled well
  • Equal-depth (frequency) partitioning
    • Divides the range into N intervals, each containing approximately same number of samples
    • Good data scaling
    • Managing categorical attributes can be tricky

Speaker notes:

Content Tools

Sources

There are currently no sources for this slide.