Skip to content

Latest commit

 

History

History
34 lines (18 loc) · 1.29 KB

README.md

File metadata and controls

34 lines (18 loc) · 1.29 KB

doAzureParallel Guide

This section will provide information about how Azure works, how best to take advantage of Azure, and best practices when using the doAzureParallel package.

  1. Azure Introduction (link)

    Using the Data Science Virtual Machine (DSVM) & Azure Batch

  2. Virtual Machine Sizes (link)

    How do you choose the best VM type/size for your workload?

  3. Autoscale (link)

    Automatically scale up/down your cluster to save time and/or money.

  4. Azure Limitations (link)

    Learn about the limitations around the size of your cluster and the number of foreach jobs you can run in Azure.

  5. Package Management (link)

    Best practices for managing your R packages across your Azure pool

  6. Distributing your Data (link)

    Best practices and limitations for working with distributed data.

  7. Parallelizing on each VM Core (link)

    Best practices and limitations for parallelizing your R code to each core in each VM in your pool

  8. Persistent Storage (link)

    Taking advantage of persistent storage for long-running jobs