Intel® Edge Software Hub
Get help on building Edge Software Hub solutions for edge compute node
Announcements
Welcome to the Intel Community! If you like the answer to your question, please mark it as 'Accepted Solution' to help others.

The Edge Software Vision Package for Red Hat Enterprise Linux is now available here.

Learn more about developing on Intel® Hardware and Software here.
247 Discussions

tranining stops automatically in tls web ui

vkherde
Beginner
321 Views

Hi Team,

I am trying to train a new model using tls docker deployment, however the training stop unexpectedly without throwing any errors. Sharing the screenshots for the same.

I am using EIS on a Virtual Machine which has specs of

RAM: 4GB, ROM: 64GB

Labels (1)
0 Kudos
1 Solution
WengWai_C_Intel
Moderator
297 Views

Hi @vkherde ,

For deep learning model training, it is recommended to use 32GB RAM system or above. With 4GB of RAM, most likely will hit into memory resources issue. While the deep learning model training process is running, you can use steps below to look at the log.

1) Use command 'docker ps' to list the active containers are running.

2) You should see 'dlscore' container is running. Use command 'docker logs -f dlscore' to look at the log while the model training is running.

 

In addition, it is also recommended to use a native system instead of VM for deep learning model training to reduce overheads from VM. Feel free to try the above suggestion and feedback to us. Thanks! 

View solution in original post

1 Reply
WengWai_C_Intel
Moderator
298 Views

Hi @vkherde ,

For deep learning model training, it is recommended to use 32GB RAM system or above. With 4GB of RAM, most likely will hit into memory resources issue. While the deep learning model training process is running, you can use steps below to look at the log.

1) Use command 'docker ps' to list the active containers are running.

2) You should see 'dlscore' container is running. Use command 'docker logs -f dlscore' to look at the log while the model training is running.

 

In addition, it is also recommended to use a native system instead of VM for deep learning model training to reduce overheads from VM. Feel free to try the above suggestion and feedback to us. Thanks! 

Reply