BOAT (Bootstrapped Optimistic Algorithm for Tree Construction)

  • Use a statistical technique called bootstrapping to create several smaller samples (subsets), each fits in memory
  • Each subset is used to create a tree, resulting in several trees
  • These trees are examined and used to construct a new tree T’
    • It turns out that T’ is very close to the tree that would be generated using the whole data set together
  • Adv: requires only two scans of DB, an incremental alg.

