Scalable Machine Learning with the Microsoft Machine Learning Server
In this course, you will learn how to create big data machine learning experiments using the Microsoft Machine Learning Server. Detailed code examples in both R and Python demonstrate how to scale your code and work with Apache Spark and SQL Server.
What you'll learn
Working with big data often exceeds the capacity of in-memory dataframes. In this course, Scalable Machine Learning with the Machine Learning Server, you will learn how to build scalable, end-to-end machine learning experiments using both R and Python using the Microsoft Machine Learning Server. First, you will learn how to import, process, transform, and visualize big data. Next, we will cover how to write custom, scalable, distributed functions which can be executed in a number of compute contexts. In addition, you will learn how to use the state of the art machine learning algorithms included in the MicrosoftML package. Then, we will integrate machine learning experiments into SQL Server. Finally, we will cover how to using the machine learning server with Hadoop and Spark, including integration with popular frameworks such as PySpark, SparkR and Sparklyr. We will spin up an HDInsight cluster in Microsoft Azure, and also build a Spark development environment from scracth. When you’re finished with this course, you will have the skills and knowledge needed to build scalable machine learning experiments using R and Python using XDF files, the Hadoop Distributed File System, SQL Server and Apache Spark.
Table of contents
- Module Overview 1m
- A Brief Introduction to Hadoop and Spark 4m
- Integrating the Machine Learning Server with Hadoop and Spark 3m
- Hadoop and Spark Pipelines with HDInsight 4m
- Setting up a Hadoop / Spark Development Environment 4m
- Hadoop and Spark Pipelines with Python and PySpark 4m
- Hadoop and Spark Pipelines with R and Sparklyr 4m