We introduce the R API for MLflow, which is an open source platform for managing the machine learning lifecycle. We demonstrate each component of the platform–Tracking, Projects, and Models–and describe how they can be leveraged in practical data science workflows.
17. Efforts in the R ecosystem (excerpt)
mleap: MLeap integration for sparklyr for serializing Spark ML pipelines
tfruns: Track and Visualize Training Runs (for TF and Keras)
packrat: Dependency management system for R.
RStudio Connect: Native TF model deployment, arbitrary R models via plumber
RStudio Connect: Reproducible report publishing and sharing
mlflow: interface to MLflow
17 / 20
18. Efforts in the R ecosystem (excerpt)
mleap: MLeap integration for sparklyr for serializing Spark ML pipelines
tfruns: Track and Visualize Training Runs (for TF and Keras)
packrat: Dependency management system for R.
RStudio Connect: Native TF model deployment, arbitrary R models via plumber
RStudio Connect: Reproducible report publishing and sharing
mlflow: interface to MLflow
We likely won't ever solve everyone's problems with one
framework, but we should be able to standardise on 90% of
the problems and have good/generally accepted guidance
on the rest.
18 / 20
19. MLflow
Tracking: keep track of your parameters, notes, and metrics for experiments.
Project: bundle your project and environment so others can reproduce your results.
Model: serialize and package your scoring function for serving locally and on the cloud.
19 / 20
20. MLflow
Tracking: keep track of your parameters, notes, and metrics for experiments.
Project: bundle your project and environment so others can reproduce your results.
Model: serialize and package your scoring function for serving locally and on the cloud.
DEMO!
20 / 20