This sample shows how to deploy a Ray cluster on an Azure Kubernetes Service (AKS) cluster using the KubeRay operator. The sample also shows how to run a simple Ray application on the cluster.
Clone the repository to your local machine. Then make sure you have all the prerequisites installed.
- An Azure subscription. If you don't have an Azure subscription, you can create a free account here.
- The Azure CLI installed on your local machine. You can install the Azure CLI by following the instructions here.
- The Azure Kubernetes Service Preview extension must be installed.
- Helm must be installed.
- Terraform client tools or OpenTofu must be installed. This guide makes use of Terrafrom, however the modules used should be compatible with OpenTofu.
- Clone the repository https://github.com/Azure-Samples/aks-ray-sample on to your local machine.
- Navigate to the directory you cloned the repository to.
- Enable execute permissions on the
deploy.sh
script by runningchmod +x deploy.sh
. - Run the
deploy.sh
script by running./deploy.sh
. This script will deploy the AKS cluster, install the KubeRay operator then submit a training job to run on the AKS cluster.