DevOps for Data Scientists: Containers for Data Science

placeholder

In this 16-video course explore the use of containers in deploying data science solutions by using Docker with R Python Jupyter and Anaconda. Begin with an introduction to containers and their use for deployment and data science. Then examine approaches to infrastructure as code for data deployment and concepts behind Ansible and Vagrant approaches to data science deployment. Explore the main features of provisioning tools used in data science. You will learn how to use Docker to build data models then use it to perform model testing for deployment to manage R deployments and for a PostgreSQL deployment. Also discover how to use Docker for persistent volumes. Next learners look at using Jupyter Docker Stacks to get up and running with Jupyter and using the Anaconda Distribution to run a Jupyter Notebook. This leads into using Jupyter Notebooks with a Cookiecutter data science project. Then learn about using Docker Compose with PostgreSQL and Jupyter Notebook and using a container deployment for Jupyter Notebooks with R. The concluding exercise involves deploying Jupyter.