Small screen detected. You are viewing the mobile version of SlideWiki. If you wish to edit slides you will need to use a larger device.
Lossless compression: All information is preserved.
What we mostly do in IR.
Lossy compression: Discard some information
Several of the preprocessing steps can be viewed as lossy compression: case folding, stop words, stemming, number elimination.
Chap/Lecture 7: Prune postings entries that are unlikely to turn up in the top k list for any query.
Almost no loss quality for top k list.
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License