Software Archive
Read-only legacy content
17061 Discussions

Oversubscription of OpenMP threads for processing small data sets

SergeyKostrov
Valued Contributor II
9,425 Views
*** Oversubscription of OpenMP threads for processing small data sets ***
0 Kudos
87 Replies
SergeyKostrov
Valued Contributor II
1,754 Views
[ Test Case 57 - 64-bit / Cores: 2 / CPUs: 4 - Number of OpenMP threads - 2 ] [ Matrix Size: 2048 x 2048 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 2048 x 2048 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.41337 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 2048x2048 elements Completed: 0.40950 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 1.52287 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 2048x2048 elements Completed: 1.53075 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.49137 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 2048x2048 elements Completed: 0.49137 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 0.87163 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 2048x2048 elements Completed: 0.87362 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 4.71900 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 2048x2048 elements Completed: 1.63612 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,754 Views
[ Test Case 58 - 64-bit / Cores: 2 / CPUs: 4 - Number of OpenMP threads - 4 ] [ Matrix Size: 2048 x 2048 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 2048 x 2048 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.44862 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 2048x2048 elements Completed: 0.43288 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 1.62050 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 2048x2048 elements Completed: 1.62050 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.51100 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 2048x2048 elements Completed: 0.51087 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 0.96912 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 2048x2048 elements Completed: 0.96137 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 4.64500 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 2048x2048 elements Completed: 1.13488 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,754 Views
[ Test Case 59 - 64-bit / Cores: 2 / CPUs: 4 - Number of OpenMP threads - 8 ] [ Matrix Size: 2048 x 2048 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 2048 x 2048 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.45825 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 2048x2048 elements Completed: 0.46600 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 1.78425 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 2048x2048 elements Completed: 1.79988 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.58700 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 2048x2048 elements Completed: 0.58887 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 1.11150 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 2048x2048 elements Completed: 1.07825 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 5.12663 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 2048x2048 elements Completed: 1.30262 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,754 Views
[ Test Case 60 - 64-bit / Cores: 2 / CPUs: 4 - Number of OpenMP threads - 16 ] [ Matrix Size: 2048 x 2048 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 2048 x 2048 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.48562 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 2048x2048 elements Completed: 0.48550 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 1.83300 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 2048x2048 elements Completed: 1.81938 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.58500 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 2048x2048 elements Completed: 0.56937 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 1.06275 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 2048x2048 elements Completed: 1.07650 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 5.10512 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 2048x2048 elements Completed: 1.29475 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,754 Views
[ Test Case 61 - 64-bit / Cores: 4 / CPUs: 8 - Number of OpenMP threads - 1 ] [ Matrix Size: 256 x 256 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 256 x 256 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.00147 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 256x256 elements Completed: 0.00147 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 0.00634 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 256x256 elements Completed: 0.00584 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.00244 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 256x256 elements Completed: 0.00244 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 0.00341 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 256x256 elements Completed: 0.00391 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 0.01853 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 256x256 elements Completed: 0.00387 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,754 Views
[ Test Case 62 - 64-bit / Cores: 4 / CPUs: 8 - Number of OpenMP threads - 2 ] [ Matrix Size: 256 x 256 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 256 x 256 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.00097 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 256x256 elements Completed: 0.00097 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 0.00294 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 256x256 elements Completed: 0.00294 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.00147 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 256x256 elements Completed: 0.00147 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 0.00194 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 256x256 elements Completed: 0.00194 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 0.00975 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 256x256 elements Completed: 0.00244 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,754 Views
[ Test Case 63 - 64-bit / Cores: 4 / CPUs: 8 - Number of OpenMP threads - 4 ] [ Matrix Size: 256 x 256 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 256 x 256 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.00048 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 256x256 elements Completed: 0.00098 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 0.00195 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 256x256 elements Completed: 0.00194 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.00098 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 256x256 elements Completed: 0.00097 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 0.00122 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 256x256 elements Completed: 0.00122 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 0.00513 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 256x256 elements Completed: 0.00145 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,754 Views
[ Test Case 64 - 64-bit / Cores: 4 / CPUs: 8 - Number of OpenMP threads - 8 ] [ Matrix Size: 256 x 256 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 256 x 256 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.00048 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 256x256 elements Completed: 0.00097 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 0.00292 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 256x256 elements Completed: 0.00317 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.00122 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 256x256 elements Completed: 0.00122 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 0.00195 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 256x256 elements Completed: 0.00170 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 0.00853 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 256x256 elements Completed: 0.00195 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,754 Views
[ Test Case 65 - 64-bit / Cores: 4 / CPUs: 8 - Number of OpenMP threads - 16 ] [ Matrix Size: 256 x 256 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 256 x 256 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.00073 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 256x256 elements Completed: 0.00098 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 0.00244 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 256x256 elements Completed: 0.00220 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.00145 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 256x256 elements Completed: 0.00122 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 0.00147 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 256x256 elements Completed: 0.00147 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 0.00781 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 256x256 elements Completed: 0.00198 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,754 Views
[ Test Case 66 - 64-bit / Cores: 4 / CPUs: 8 - Number of OpenMP threads - 1 ] [ Matrix Size: 512 x 512 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 512 x 512 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.00538 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 512x512 elements Completed: 0.00681 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 0.04925 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 512x512 elements Completed: 0.04678 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.01609 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 512x512 elements Completed: 0.01609 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 0.02631 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 512x512 elements Completed: 0.02681 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 0.14675 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 512x512 elements Completed: 0.02728 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,754 Views
[ Test Case 67 - 64-bit / Cores: 4 / CPUs: 8 - Number of OpenMP threads - 2 ] [ Matrix Size: 512 x 512 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 512 x 512 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.00534 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 512x512 elements Completed: 0.00584 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 0.02391 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 512x512 elements Completed: 0.02387 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.00878 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 512x512 elements Completed: 0.00878 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 0.01366 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 512x512 elements Completed: 0.01412 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 0.07409 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 512x512 elements Completed: 0.01366 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,754 Views
[ Test Case 68 - 64-bit / Cores: 4 / CPUs: 8 - Number of OpenMP threads - 4 ] [ Matrix Size: 512 x 512 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 512 x 512 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.00294 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 512x512 elements Completed: 0.00291 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 0.01269 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 512x512 elements Completed: 0.01269 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.00438 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 512x512 elements Completed: 0.00488 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 0.00731 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 512x512 elements Completed: 0.00681 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 0.03756 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 512x512 elements Completed: 0.00731 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,754 Views
[ Test Case 69 - 64-bit / Cores: 4 / CPUs: 8 - Number of OpenMP threads - 8 ] [ Matrix Size: 512 x 512 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 512 x 512 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.00391 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 512x512 elements Completed: 0.00488 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 0.02144 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 512x512 elements Completed: 0.02097 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.00681 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 512x512 elements Completed: 0.00684 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 0.01316 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 512x512 elements Completed: 0.01316 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 0.06531 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 512x512 elements Completed: 0.01072 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,754 Views
[ Test Case 70 - 64-bit / Cores: 4 / CPUs: 8 - Number of OpenMP threads - 16 ] [ Matrix Size: 512 x 512 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 512 x 512 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.00441 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 512x512 elements Completed: 0.00488 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 0.01853 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 512x512 elements Completed: 0.01850 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.00587 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 512x512 elements Completed: 0.00584 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 0.01075 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 512x512 elements Completed: 0.01119 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 0.06484 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 512x512 elements Completed: 0.01025 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,754 Views
[ Test Case 71 - 64-bit / Cores: 4 / CPUs: 8 - Number of OpenMP threads - 1 ] [ Matrix Size: 1024 x 1024 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 1024 x 1024 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.04481 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 1024x1024 elements Completed: 0.04481 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 0.38225 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 1024x1024 elements Completed: 0.37931 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.12188 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 1024x1024 elements Completed: 0.12281 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 0.21162 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 1024x1024 elements Completed: 0.21156 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 1.16319 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 1024x1024 elements Completed: 0.20963 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,754 Views
[ Test Case 72 - 64-bit / Cores: 4 / CPUs: 8 - Number of OpenMP threads - 2 ] [ Matrix Size: 1024 x 1024 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 1024 x 1024 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.05269 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 1024x1024 elements Completed: 0.05269 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 0.18812 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 1024x1024 elements Completed: 0.18819 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.06244 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 1024x1024 elements Completed: 0.06244 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 0.10631 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 1024x1024 elements Completed: 0.10625 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 0.58213 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 1024x1024 elements Completed: 0.10431 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,754 Views
[ Test Case 73 - 64-bit / Cores: 4 / CPUs: 8 - Number of OpenMP threads - 4 ] [ Matrix Size: 1024 x 1024 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 1024 x 1024 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.02291 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 1024x1024 elements Completed: 0.02294 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 0.09553 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 1024x1024 elements Completed: 0.09553 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.03316 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 1024x1024 elements Completed: 0.03266 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 0.05509 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 1024x1024 elements Completed: 0.05509 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 0.29203 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 1024x1024 elements Completed: 0.05263 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,713 Views
[ Test Case 74 - 64-bit / Cores: 4 / CPUs: 8 - Number of OpenMP threads - 8 ] [ Matrix Size: 1024 x 1024 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 1024 x 1024 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.03997 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 1024x1024 elements Completed: 0.04194 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 0.17891 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 1024x1024 elements Completed: 0.17891 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.05119 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 1024x1024 elements Completed: 0.05022 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 0.10338 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 1024x1024 elements Completed: 0.10384 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 0.51091 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 1024x1024 elements Completed: 0.08141 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,713 Views
[ Test Case 75 - 64-bit / Cores: 4 / CPUs: 8 - Number of OpenMP threads - 16 ] [ Matrix Size: 1024 x 1024 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 1024 x 1024 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.03413 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 1024x1024 elements Completed: 0.03803 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 0.15163 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 1024x1024 elements Completed: 0.15503 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.04875 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 1024x1024 elements Completed: 0.04631 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 0.09847 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 1024x1024 elements Completed: 0.09019 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 0.42169 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 1024x1024 elements Completed: 0.06825 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,713 Views
[ Test Case 76 - 64-bit / Cores: 4 / CPUs: 8 - Number of OpenMP threads - 1 ] [ Matrix Size: 2048 x 2048 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 2048 x 2048 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.28863 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 2048x2048 elements Completed: 0.28662 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 3.05763 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 2048x2048 elements Completed: 3.04600 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.97113 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 2048x2048 elements Completed: 0.97312 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 1.73738 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 2048x2048 elements Completed: 1.73162 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 9.43025 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 2048x2048 elements Completed: 2.32050 secs > Test1099 End < Tests: Completed
0 Kudos
SergeyKostrov
Valued Contributor II
1,713 Views
[ Test Case 77 - 64-bit / Cores: 4 / CPUs: 8 - Number of OpenMP threads - 2 ] [ Matrix Size: 2048 x 2048 ] Application - IccTestApp - WIN64_ICC ( 64-bit ) - Release Tests: Start > Test1099 Start < Matrix A, B and C Sizes : 2048 x 2048 Loop Processing Schema ( LPS ): IKJ Loop Blocking Divider : 1 Sub-Test 1.1 - MxMultA1 - Classic 2D LBOT size: N/A Completed: 0.41050 secs Sub-Test 1.2 - MxMultA2 - Classic 2D LBOT LBOT size: 2048x2048 elements Completed: 0.40950 secs Sub-Test 1.3 - MxMultA3 - Classic 2D Fused LBOT size: N/A Completed: 1.51619 secs Sub-Test 1.4 - MxMultA4 - Classic 2D Fused LBOT LBOT size: 2048x2048 elements Completed: 1.51706 secs Sub-Test 2.1 - MxMultB1 - Classic 2D Transposed LBOT size: N/A Completed: 0.49237 secs Sub-Test 2.2 - MxMultB2 - Classic 2D Transposed LBOT LBOT size: 2048x2048 elements Completed: 0.49338 secs Sub-Test 2.3 - MxMultB3 - Classic 2D Fused Transposed LBOT size: N/A Completed: 0.87069 secs Sub-Test 2.4 - MxMultB4 - Classic 2D Fused Transposed LBOT LBOT size: 2048x2048 elements Completed: 0.87169 secs Sub-Test 5.1 - MxMultD1 - Classic 1D LBOT size: N/A Completed: 4.71613 secs Sub-Test 5.2 - MxMultD2 - Classic 1D LBOT LBOT size: 2048x2048 elements Completed: 1.35038 secs > Test1099 End < Tests: Completed
0 Kudos
Reply