BigML is the first Machine Learning service offering Association Discovery on the cloud! With these slides you can learn how to use Association Discovery and other new features such as Partial Dependence Plots, Logistic Regression, Correlations, Statistical Tests and Flatline Editor.
9. BigML Inc Fall 2015 Release 9
Market Basket Analysis
• Dataset of 9,834 grocery cart transac>ons
• Each row is a list of all items in a cart at checkout
GOAL: Discover “interes1ng” rules about what store items
are typically purchased together.
23. BigML Inc Fall 2015 Release 23
Logis1c Regression
DATASET LOGISTIC REGRESSION
• Classifica>on algorithm
• Categorical: one-hot encoded
• Text: mapped to token freq
• Bindings support local model
• I1/I2 regulariza>on
• Currently API only
hPps://bigml.com/developers/logis>cregressions
25. BigML Inc Fall 2015 Release 25
BigML Classifiers
Advantages Disadvantages
Single Tree
easy to interpret
robust to missing data
overfiong
Ensemble
top performer
robust to missing data
hard to interpret
Logis1c Regression
robust to noise
outputs probability
no missing data
hard to interpret