Software Archive
Read-only legacy content
17061 Discussions

Oversubscription of OpenMP threads for processing small data sets

SergeyKostrov
Valued Contributor II
9,289 Views
*** Oversubscription of OpenMP threads for processing small data sets ***
0 Kudos
87 Replies
SergeyKostrov
Valued Contributor II
1,871 Views
[ Test Case 36 - 64-bit / Cores: 1 / CPUs: 2 - Number of OpenMP threads - 1 ] [ Matrix Size: 2048 x 2048 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 2048 x 2048 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.92050 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 2048x2048 elements Completed: 0.85800 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 3.32250 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 2048x2048 elements Completed: 3.33100 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 1.03700 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 2048x2048 elements Completed: 1.02950 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 1.86450 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 2048x2048 elements Completed: 1.89550 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 10.25700 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 2048x2048 elements Completed: 2.43400 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,871 Views
[ Test Case 37 - 64-bit / Cores: 1 / CPUs: 2 - Number of OpenMP threads - 2 ] [ Matrix Size: 2048 x 2048 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 2048 x 2048 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.89700 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 2048x2048 elements Completed: 0.85000 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 3.20600 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 2048x2048 elements Completed: 3.22950 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 1.01400 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 2048x2048 elements Completed: 1.02150 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 1.90350 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 2048x2048 elements Completed: 1.90300 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 9.29750 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 2048x2048 elements Completed: 1.78600 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,871 Views
[ Test Case 38 - 64-bit / Cores: 1 / CPUs: 2 - Number of OpenMP threads - 4 ] [ Matrix Size: 2048 x 2048 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 2048 x 2048 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.87350 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 2048x2048 elements Completed: 0.85800 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 3.20600 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 2048x2048 elements Completed: 3.19800 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 1.02150 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 2048x2048 elements Completed: 1.02200 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 1.86400 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 2048x2048 elements Completed: 1.88750 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 9.31300 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 2048x2048 elements Completed: 1.90350 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,871 Views
[ Test Case 39 - 64-bit / Cores: 1 / CPUs: 2 - Number of OpenMP threads - 8 ] [ Matrix Size: 2048 x 2048 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 2048 x 2048 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.83500 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 2048x2048 elements Completed: 0.78000 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 3.21400 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 2048x2048 elements Completed: 3.22900 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 1.05300 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 2048x2048 elements Completed: 1.04500 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 1.89550 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 2048x2048 elements Completed: 1.90300 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 9.32100 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 2048x2048 elements Completed: 1.89550 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,871 Views
[ Test Case 40 - 64-bit / Cores: 1 / CPUs: 2 - Number of OpenMP threads - 16 ] [ Matrix Size: 2048 x 2048 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 2048 x 2048 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.84200 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 2048x2048 elements Completed: 0.86600 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 3.41600 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 2048x2048 elements Completed: 3.31500 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 1.05300 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 2048x2048 elements Completed: 1.06050 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 1.95800 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 2048x2048 elements Completed: 1.95750 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 9.32850 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 2048x2048 elements Completed: 1.88800 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,871 Views
[ Test Case 41 - 64-bit / Cores: 2 / CPUs: 4 - Number of OpenMP threads - 1 ] [ Matrix Size: 256 x 256 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 256 x 256 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.00187 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 256x256 elements Completed: 0.00000 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 0.00387 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 256x256 elements Completed: 0.00587 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.00187 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 256x256 elements Completed: 0.00400 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 0.00387 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 256x256 elements Completed: 0.00387 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 0.01950 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 256x256 elements Completed: 0.00387 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,871 Views
[ Test Case 42 - 64-bit / Cores: 2 / CPUs: 4 - Number of OpenMP threads - 2 ] [ Matrix Size: 256 x 256 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 256 x 256 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.00097 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 256x256 elements Completed: 0.00097 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 0.00344 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 256x256 elements Completed: 0.00341 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.00147 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 256x256 elements Completed: 0.00144 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 0.00197 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 256x256 elements Completed: 0.00194 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 0.01025 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 256x256 elements Completed: 0.00194 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,871 Views
[ Test Case 43 - 64-bit / Cores: 2 / CPUs: 4 - Number of OpenMP threads - 4 ] [ Matrix Size: 256 x 256 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 256 x 256 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.00047 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 256x256 elements Completed: 0.00147 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 0.00441 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 256x256 elements Completed: 0.00341 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.00147 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 256x256 elements Completed: 0.00147 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 0.00244 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 256x256 elements Completed: 0.00194 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 0.00975 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 256x256 elements Completed: 0.00244 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,871 Views
[ Test Case 44 - 64-bit / Cores: 2 / CPUs: 4 - Number of OpenMP threads - 8 ] [ Matrix Size: 256 x 256 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 256 x 256 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.00097 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 256x256 elements Completed: 0.00147 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 0.00488 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 256x256 elements Completed: 0.00438 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.00197 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 256x256 elements Completed: 0.00147 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 0.00294 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 256x256 elements Completed: 0.00291 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 0.01319 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 256x256 elements Completed: 0.00244 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,871 Views
[ Test Case 45 - 64-bit / Cores: 2 / CPUs: 4 - Number of OpenMP threads - 16 ] [ Matrix Size: 256 x 256 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 256 x 256 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.00100 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 256x256 elements Completed: 0.00144 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 0.00538 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 256x256 elements Completed: 0.00538 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.00194 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 256x256 elements Completed: 0.00147 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 0.00244 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 256x256 elements Completed: 0.00294 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 0.01169 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 256x256 elements Completed: 0.00294 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,871 Views
[ Test Case 46 - 64-bit / Cores: 2 / CPUs: 4 - Number of OpenMP threads - 1 ] [ Matrix Size: 512 x 512 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 512 x 512 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.00975 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 512x512 elements Completed: 0.00778 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 0.05022 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 512x512 elements Completed: 0.04728 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.01609 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 512x512 elements Completed: 0.01656 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 0.02681 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 512x512 elements Completed: 0.02684 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 0.14672 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 512x512 elements Completed: 0.02731 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,871 Views
[ Test Case 47 - 64-bit / Cores: 2 / CPUs: 4 - Number of OpenMP threads - 2 ] [ Matrix Size: 512 x 512 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 512 x 512 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.00534 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 512x512 elements Completed: 0.00584 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 0.02391 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 512x512 elements Completed: 0.02387 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.00828 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 512x512 elements Completed: 0.00828 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 0.01366 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 512x512 elements Completed: 0.01416 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 0.07409 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 512x512 elements Completed: 0.01366 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,871 Views
[ Test Case 48 - 64-bit / Cores: 2 / CPUs: 4 - Number of OpenMP threads - 4 ] [ Matrix Size: 512 x 512 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 512 x 512 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.00684 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 512x512 elements Completed: 0.00631 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 0.02437 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 512x512 elements Completed: 0.02437 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.00831 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 512x512 elements Completed: 0.00875 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 0.01563 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 512x512 elements Completed: 0.01559 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 0.07262 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 512x512 elements Completed: 0.01172 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,871 Views
[ Test Case 49 - 64-bit / Cores: 2 / CPUs: 4 - Number of OpenMP threads - 8 ] [ Matrix Size: 512 x 512 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 512 x 512 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.00778 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 512x512 elements Completed: 0.00731 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 0.03316 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 512x512 elements Completed: 0.03266 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.00975 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 512x512 elements Completed: 0.01025 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 0.01900 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 512x512 elements Completed: 0.02000 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 0.10334 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 512x512 elements Completed: 0.01609 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,871 Views
[ Test Case 50 - 64-bit / Cores: 2 / CPUs: 4 - Number of OpenMP threads - 16 ] [ Matrix Size: 512 x 512 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 512 x 512 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.00828 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 512x512 elements Completed: 0.00781 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 0.03119 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 512x512 elements Completed: 0.03025 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.00928 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 512x512 elements Completed: 0.00975 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 0.01903 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 512x512 elements Completed: 0.02047 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 0.10384 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 512x512 elements Completed: 0.01606 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,871 Views
[ Test Case 51 - 64-bit / Cores: 2 / CPUs: 4 - Number of OpenMP threads - 1 ] [ Matrix Size: 1024 x 1024 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 1024 x 1024 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.05944 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 1024x1024 elements Completed: 0.05556 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 0.38806 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 1024x1024 elements Completed: 0.37925 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.12287 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 1024x1024 elements Completed: 0.12287 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 0.21156 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 1024x1024 elements Completed: 0.21162 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 1.16313 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 1024x1024 elements Completed: 0.20863 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,871 Views
[ Test Case 52 - 64-bit / Cores: 2 / CPUs: 4 - Number of OpenMP threads - 2 ] [ Matrix Size: 1024 x 1024 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 1024 x 1024 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.05263 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 1024x1024 elements Completed: 0.05263 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 0.18913 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 1024x1024 elements Completed: 0.18913 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.06238 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 1024x1024 elements Completed: 0.06338 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 0.10625 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 1024x1024 elements Completed: 0.10631 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 0.58213 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 1024x1024 elements Completed: 0.10531 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,871 Views
[ Test Case 53 - 64-bit / Cores: 2 / CPUs: 4 - Number of OpenMP threads - 4 ] [ Matrix Size: 1024 x 1024 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 1024 x 1024 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.05169 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 1024x1024 elements Completed: 0.04781 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 0.19987 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 1024x1024 elements Completed: 0.20087 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.06631 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 1024x1024 elements Completed: 0.06631 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 0.12675 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 1024x1024 elements Completed: 0.12481 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 0.57819 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 1024x1024 elements Completed: 0.09456 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,873 Views
[ Test Case 54 - 64-bit / Cores: 2 / CPUs: 4 - Number of OpenMP threads - 8 ] [ Matrix Size: 1024 x 1024 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 1024 x 1024 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.06631 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 1024x1024 elements Completed: 0.05950 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 0.26612 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 1024x1024 elements Completed: 0.25256 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.07113 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 1024x1024 elements Completed: 0.07219 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 0.15206 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 1024x1024 elements Completed: 0.15406 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 0.71956 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 1024x1024 elements Completed: 0.12381 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,873 Views
[ Test Case 55 - 64-bit / Cores: 2 / CPUs: 4 - Number of OpenMP threads - 16 ] [ Matrix Size: 1024 x 1024 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 1024 x 1024 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.05756 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 1024x1024 elements Completed: 0.05556 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 0.24469 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 1024x1024 elements Completed: 0.24862 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.07312 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 1024x1024 elements Completed: 0.07706 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 0.13938 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 1024x1024 elements Completed: 0.14531 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 0.69813 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 1024x1024 elements Completed: 0.11600 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,873 Views
[ Test Case 56 - 64-bit / Cores: 2 / CPUs: 4 - Number of OpenMP threads - 1 ] [ Matrix Size: 2048 x 2048 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 2048 x 2048 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.43875 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 2048x2048 elements Completed: 0.43288 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 3.07125 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 2048x2048 elements Completed: 3.05763 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.97300 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 2048x2048 elements Completed: 0.97500 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 1.73937 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 2048x2048 elements Completed: 1.73738 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 9.44588 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 2048x2048 elements Completed: 2.32250 secs > Test1099 End < Tests: Completed
0 Kudos
Reply