Virtual Cluster in Red Cloud

From CAC Documentation wiki
Jump to navigation Jump to search

This documentation describes how to deploy and use a virtual HPC cluster in Red Cloud. The virtual cluster uses slurm scheduler to dynamically launch Red Cloud instances to run batch or interactive jobs submitted to the batch queue, and delete the instances when the jobs complete.

Deploy Virtual Cluster

DYI

Red Cloud subscribers can roll their own virtual clusters using the Ansible playbooks and instriuctions in the Slurm Cluster in Openstack GitHub repo. The playbooks will deploy a single user virtual cluster in Red Cloud.

Need Help or Additional Features

If you need help deploying the cluster or need additional features (e.g. multi-user login, custom software installation, file system export via SMB or Globus etc.), email CAC Help with your requirements. Consulting rates will apply in addition to Red Cloud subscriptions for custom work on your virtual cluster.

Log into the Head Node

ssh to the IP address or hostname of the virtual cluster head node. If you rolled your own cluster using the Slurm Cluster in Openstack repo, ssh as image_init_user with the ssh private key ssh_private_keyfile as defined in vars/main.yml.

If CAC deployed the virtual cluster with multi-user capability, ssh to the IP address or hostname of the virtual cluster head node using your CAC user name and password.

Launch Compute Nodes via Slurm

Batch Job

Interactive Session

Compute Node Status