It is really a 2-part question, so let me answer in parts.
Part 1: which DevCloud. This thread is posted on the forum for Intel DevCloud for the Edge, which is not designed for DNN training. It has compute nodes to edge inference. The better choice for training would be Intel DevCloud for Data-centric Workloads — it has compute nodes more suitable for training workloads.
Part 2: how to train in parallel. On Intel DevCloud for Data-centric workloads, you can train on multiple nodes in two ways: