Intel® Optimized AI Frameworks
Receive community support for questions related to PyTorch* and TensorFlow* frameworks.
73 Discussions

Distribute XGBoost on multiple nodes

Misra
Novice
1,734 Views

Hi,

 

Does Intel's distribution of XGBoost (in the oneAPI AI Analytics Toolkit) support multi-node training? If so, can this experiment be executed on Devcloud? Is there any reference documentation available?

 

Regards,

Manjari

0 Kudos
8 Replies
JananiC_Intel
Moderator
1,709 Views

Hi,


Thanks for posting in Intel forum.


We will check on this and get back to you soon. 


For your information, currently we don't have any reference documentation for Intel XGBoost multi node training, we will let you know once we get an update regarding this.


Regards,

Janani Chandran


0 Kudos
JananiC_Intel
Moderator
1,670 Views

Hi,

 

For multi-node training with Intel XGBoost in DevCloud follow the steps mentioned in the below article.

 

https://medium.com/intel-analytics-software/distributed-xgboost-with-modin-on-ray-fc17edef7720

 

In DevCloud, multi-node computation is only available through the job queue.

 

Syntax for multi-node:

 

qsub -l nodes=<count>:ppn=2

 

You can combine a request for multiple nodes with a request for their specific features.

 

Try this and let us know the updates.

 

Regards,

Janani Chandran

 

0 Kudos
mmisra18
Beginner
1,627 Views
0 Kudos
JananiC_Intel
Moderator
1,646 Views

Hi,


Is your issue resolved? Do you have any update?


Regards,

Janani Chandran


0 Kudos
Misra
Novice
1,625 Views

Hi,

 

Thank you for your reply. I did try this method before. However, when I looked into OneAPI and DevCloud documentation, it mentioned that in order to distribute XGBoost training between multiple nodes I would have to use MPI communication for nodes to process in parallel.

I am unable to find documentation or any code sample that would explain it since I have no experience in MPI application programming.

Could you please let me know more about it? If this is the correct way or the way you mentioned is same as the above.

Thank you!

 

 

Regards,

Manjari Misra

0 Kudos
AthiraM_Intel
Moderator
1,591 Views

Hi,


Please find the below documentation for running basic mpi application in DevCloud.

You can find the sample code under "Distributed-Memory Architecture" topic.


link: https://devcloud.intel.com/oneapi/documentation/advanced-queue/


Hope this helps.



Thanks




0 Kudos
AthiraM_Intel
Moderator
1,549 Views

Hi,


We have not heard back from you. Could you please give us an update?



Thanks


0 Kudos
AthiraM_Intel
Moderator
1,509 Views

Hi,


We have not heard back from you. This thread will no longer be monitored by Intel. If you need further assistance, please post a new question.



Thanks.


0 Kudos
Reply