2. Document Filtering
Filtering == Classification Problem
Data Mining Problem
EstimationClassification
Predication
Clustering
Description
Affinity Grouping
Document?
A set of feature
-> text document, image, etc.
3. Spam Filtering
Binary Classification Problem
‘Spam’ or ‘Ham’
Techniques
Naïve Bayesian Classifier
Support Vector Machine
Decision Tree
Rule vs. Model
4. Spam Filtering in Practice
Referred at: Sahil Puri1 et al, “COMPARISON AND ANALYSIS OF SPAM DETECTION ALGORITHMS”, 2013, IJAIEM
5. Referred at: Rene, “New insights into Gmail’s spam filtering”, 2012, emailmarketingtipps.de