Presentation on building a machine learning pipeline for hunting criminals using Spark streaming for processing input data and extract message/transaction level features and Python based open source libraries to extract user level features from time series, graphs and unstructured data and use them to train a classifier against agent feedback.
6. 6
User Analysis Iteration
Email NLP
Features
User graph
Transactions
time series
Graph Features
Time Series
Features
NLP Features
Agent Feedback
Train/TestClassifier
7. 77
Thank You!
Notebooks for this talk are freely available
David.Talby@atigeo.com
Claudiu.Branzan@atigeo.com
Try xPatterns Connect at: http://xpatterns.com/connect/