Community
cancel
Showing results for 
Search instead for 
Did you mean: 
Highlighted
New Contributor I
32 Views

Read the output and error stream of the executing job (working qpeek)

Unfortunately, system qpeek command does not work. I made some changes and it work for me now (file is attached). Unpack and place the file to $HOME/bin directory and 'chmod +x qpeek' it. To call it instead of system qpeek, place the follow code at  the beginning of $HOME/.bash_profile file:

# =========================================================================================
var_add_path(){
# Use var_add_path "VAR" "/path/to/smth" "s|e"
# s - at the beginning; e - at the end
# echo "Adding $2 to $1 at the <start|end> $3"
  tmp_var=$1
  if [ -d "$2" ]; then
    if [[ ":${!tmp_var}:" != *":$2:"* ]]; then
      # For non-empty, only add if not already there
      if [[ "$3" == "s" ]]; then
    [[ -z "${!tmp_var}" ]] && export $1=$2 || export $1=$2:${!tmp_var}
      else
    [[ -z "${!tmp_var}" ]] && export $1=$2 || export $1=${!tmp_var}:$2
      fi
    fi
  fi
}
# =========================================================================================

var_add_path "PATH" "$HOME/.local/bin" "s"
var_add_path "PATH" "$HOME/bin" "s"

After that you can use qpeek as described:

32955@login-1:~/scripts$ qstat -n

v-qsvr-1.aidevcloud: 
                                                                                  Req'd       Req'd       Elap
Job ID                  Username    Queue    Jobname          SessID  NDS   TSK   Memory      Time    S   Time
----------------------- ----------- -------- ---------------- ------ ----- ------ --------- --------- - ---------
445872.v-qsvr-1.aidevc  u32955      batch    tf_build_on_fpga   3105     1      2       --   24:00:00 R  00:39:41
   s001-n147/0-1
u32955@login-1:~/scripts$ ./qpeek -o 445872
Server: v-qsvr-1.aidevcloud
Full Job ID: 445872.v-qsvr-1.aidevcloud
Node:  s001-n147


########################################################################
#      Date:           Sat Dec 14 23:16:44 PST 2019
#    Job ID:           445872.v-qsvr-1.aidevcloud
#      User:           u32955
# Resources:           neednodes=1:fpga_compile:ppn=2,nodes=1:fpga_compile:ppn=2,walltime=24:00:00
########################################################################

Start compilation: Sun Dec 15 10:16:48 MSK 2019
Python version: 3.6.8
Python LIB path: /home/u32955/.local/lib/python3.6/site-packages
Tensorflow release: master
Bazel version: 1.1.0
Number of CPUs: 64
You have bazel 1.1.0 installed.
Preconfigured Bazel build configs. You can use any of the below by adding "--config=<>" to your build command. See .bazelrc for more details.
        --config=mkl            # Build with MKL support.
        --config=monolithic     # Config for mostly static monolithic build.
        --config=ngraph         # Build with Intel nGraph support.
        --config=numa           # Build with NUMA support.
        --config=dynamic_kernels        # (Experimental) Build kernels into separate shared objects.
        --config=v2             # Build TensorFlow 2.x instead of 1.x.
Preconfigured Bazel build configs to DISABLE default on features:
        --config=noaws          # Disable AWS S3 filesystem support.
        --config=nogcp          # Disable GCP support.
        --config=nohdfs         # Disable HDFS support.
        --config=nonccl         # Disable NVIDIA NCCL support.
Configuration finished

 

Tags (1)
0 Kudos
1 Reply
Highlighted
Employee
32 Views

Thank you so much for sharing the information.

We appreciate the efforts you undertook voluntarily.

0 Kudos