lectio-uninapoli-2021

</link>

Azure Machine Learning Studio

Contents

What is Azure ML Studio (workspace)

Empower developers and data scientists with a wide range of productive experiences for building, training, and deploying machine learning models faster. Accelerate time to market and foster team collaboration with industry-leading MLOps—DevOps for machine learning. Innovate on a secure, trusted platform, designed for responsible AI.

image

Azure Machine Learning Studio portal view

Azure Machine Learning can be used for any kind of machine learning, from classical ml to deep learning, supervised, and unsupervised learning. Whether you prefer to write Python or R code or zero-code/low-code options such as the designer, you can build, train, and track highly accurate machine learning and deep-learning models in an Azure Machine Learning Workspace. Start training on your local machine and then scale out to the cloud. The service also interoperates with popular open-source tools, such as PyTorch, TensorFlow, and scikit-learn.

Azure Machine Learning provides all the tools developers and data scientists need for their machine learning workflows, including:

The Azure Machine Learning designer: drag-n-drop modules to build your experiments and then deploy pipelines.

image

Azure Machine Learning Designer view

Differences with Classic version of the platform

Quick comparison    
  ML Studio Classic Azure ML Studio (workspace)
Drag and drop interface Supported Supported
Code SDKs Unsupported Fully integrated
Experiment Scalable (10-GB limit) Scale with compute target
Training compute targets Proprietary compute target, CPU support only Wide range of customizable training compute targets. Both GPU and CPU
Deployment compute targets Proprietary web service format, not customizable Wide range of customizable deployment compute targets. Includes GPU and CPU support
ML Pipeline Not supported Build flexible, modular pipelines to automate workflows
MLOps Basic model management and deployment Entity versioning (model, data, workflows), workflow automation, integration with CI/CD tooling
Model format Proprietary format, Studio (classic) only Multiple supported formats depending on training job type
Automated model training and hyperparameter tuning Not supported Supported. Code-first and no-code options
Data drift detection Not supported Supported
Data labeling projects Not supported Supported

Why do we need it

Nowadays Machine Learning is achieved by using a set of different and unrelated technologies, tools, languages, storage and compute devices that are often manually combined together.

Azure Machine Learning Studio fills this gap by integrating and orchestrating everything together with a functional web interface.

It allows the data scientists to use all the open source technologies that are today standard in Machine Learning.

It helps reduce the cost by paying only the effective compute time used.

How does it work

The standard flow used to implement your machine learning task follows these steps:

Compute

One ore more compute virtual hardware are created according to your needs. There four different type

Compute Instances

This is the base compute unit

image

Compute istance creation

Customization options are

image

Virtual Machine size

Compute Clusters

For heavy workloads, cluster of the same size of virtual machine can be created. You can choose minimum and maximum number of nodes. In order to minimize costs, an “Idle seconds before scale down” option can be sets

image

Virtual Machine size

Inference Clusters

Docker, Container and Kubernetes are spreading everywhere, especially in Machine Learning, where high workload may be needed for short periods.

image

Kubernetes cluster options

Attached Compute

If you already have any compute instance inside Azure, but created outside Machine Learning Studio already configured, they can be attached with few seconds

Storage

Machine Learning Studio, form an Azure organization point of view, is a Resource Group and, as the name states, it’s a group where different type of cloud resource can be created. Since there can’t be any Machine Learning without any data, a default storage account is created behind where the data scientist can update his data in may forms.

Storage is also used to store the output of experiment, with both data and images.

Notebooks

Notebooks is the section of Azure Machine Learning Studio where you’ll spent most of your time. User file, divided in per-user folder, allows browsing the previously mentioned storage where all your project’s file are stored. There’s a folder, named “tutorial” where a collection of sample projects are present that will help you familiarize with the platform.
Open a notebook file with .ipynb extension and choose a compute unit previously created

image

Notebook inside Machine Learning Studio

It’s possibile to edit the notebooks directly in the Machine Learning studio, but there’re better ways to work with them

Work with Computes

JupyterLab

The compute istance is always a Linux virtual machine with embedded all the tools and framework described in the introduction. One of these tools is JupyterLab that enables you to work with documents and activities such as Jupyter notebooks, text editors, terminals, and custom components in a flexible, integrated, and extensible manner.

image

Jupyter Lab interface

You can arrange multiple documents and activities side by side in the work area using tabs and splitters. Documents and activities integrate with each other, enabling new workflows for interactive computing, for example:

JupyterLab also offers a unified model for viewing and handling data formats. JupyterLab understands many file formats (images, CSV, JSON, Markdown, PDF, Vega, Vega-Lite, etc.) and can also display rich kernel output in these formats. See File and Output Formats for more information.

To navigate the user interface, JupyterLab offers customizable keyboard shortcuts and the ability to use key maps from vim, emacs, and Sublime Text in the text editor.

JupyterLab extensions can customize or enhance any part of JupyterLab, including new themes, file editors, and custom components.

JupyterLab is served from the same server and uses the same notebook document format as the classic Jupyter Notebook.

Command Line

image

Jupyter Lab Command Line

References

Agenda

  1. Presentation :clock1030: (10:30)
  2. Introduction (10:45)
  3. Azure Machine Learning Studio :clock1130: (11:30)
  4. Demo :clock12: (12:00)
  5. Where to go next (12:20)
  6. Q&A (12:25)