<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Memory Limitiation into Cilk ? in Software Archive</title>
    <link>https://community.intel.com/t5/Software-Archive/Memory-Limitiation-into-Cilk/m-p/962604#M22137</link>
    <description>&lt;P&gt;I just want to use Cilk Plus with implicit shared memory , but i am getting this error :&lt;/P&gt;

&lt;P&gt;HOST--ERROR:myoiExPLExtendVSM: VSM size exceeds the limitation (4294967296) now!&lt;BR /&gt;
	HOST--ERROR:myoiExMalloc:662 Fail to get a new memory chunk!&lt;BR /&gt;
	HOST--ERROR:myoArenaMalloc1: Fail to get free memory space!&lt;BR /&gt;
	HOST--ERROR:myoArenaAlignedMalloc1: No enough memory space!&lt;/P&gt;

&lt;P&gt;there is still enough space, it's also works with OpenMp and TBB but the program crashes after start with this comment, are there any paramenters to use bigger memory spaces . I didn't found any manuals to this.&lt;/P&gt;

&lt;P&gt;best regards&lt;/P&gt;</description>
    <pubDate>Sat, 18 Jan 2014 09:28:44 GMT</pubDate>
    <dc:creator>Markus__G_</dc:creator>
    <dc:date>2014-01-18T09:28:44Z</dc:date>
    <item>
      <title>Memory Limitiation into Cilk ?</title>
      <link>https://community.intel.com/t5/Software-Archive/Memory-Limitiation-into-Cilk/m-p/962604#M22137</link>
      <description>&lt;P&gt;I just want to use Cilk Plus with implicit shared memory , but i am getting this error :&lt;/P&gt;

&lt;P&gt;HOST--ERROR:myoiExPLExtendVSM: VSM size exceeds the limitation (4294967296) now!&lt;BR /&gt;
	HOST--ERROR:myoiExMalloc:662 Fail to get a new memory chunk!&lt;BR /&gt;
	HOST--ERROR:myoArenaMalloc1: Fail to get free memory space!&lt;BR /&gt;
	HOST--ERROR:myoArenaAlignedMalloc1: No enough memory space!&lt;/P&gt;

&lt;P&gt;there is still enough space, it's also works with OpenMp and TBB but the program crashes after start with this comment, are there any paramenters to use bigger memory spaces . I didn't found any manuals to this.&lt;/P&gt;

&lt;P&gt;best regards&lt;/P&gt;</description>
      <pubDate>Sat, 18 Jan 2014 09:28:44 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/Memory-Limitiation-into-Cilk/m-p/962604#M22137</guid>
      <dc:creator>Markus__G_</dc:creator>
      <dc:date>2014-01-18T09:28:44Z</dc:date>
    </item>
    <item>
      <title>As with most programs or</title>
      <link>https://community.intel.com/t5/Software-Archive/Memory-Limitiation-into-Cilk/m-p/962605#M22138</link>
      <description>&lt;P&gt;As with most programs or library, the Cilk Plus runtime does use some amount of shared memory for its scheduler.&amp;nbsp; But without some more specifics / details about the platform you are running on, or how your application is using Cilk Plus, it is hard to tell.&lt;/P&gt;

&lt;P&gt;Is this running on an offload device of some kind, or a more traditional desktop/server environment?&amp;nbsp; 32-bit or 64-bit OS?&amp;nbsp;&amp;nbsp; How close to the limit of available physical memory does the application get in its normal (serial) execution?&amp;nbsp;&amp;nbsp; What happens if you vary the number of workers used?&amp;nbsp; (E.g., CILK_NWORKERS=1, CILK_NWORKERS=2, etc.)&amp;nbsp;&amp;nbsp;&amp;nbsp; As the number of workers increases, the memory usage also increases.&lt;BR /&gt;
	&lt;BR /&gt;
	Cheers,&lt;/P&gt;

&lt;P&gt;Jim&lt;/P&gt;</description>
      <pubDate>Sat, 18 Jan 2014 15:33:06 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/Memory-Limitiation-into-Cilk/m-p/962605#M22138</guid>
      <dc:creator>Jim_S_Intel</dc:creator>
      <dc:date>2014-01-18T15:33:06Z</dc:date>
    </item>
    <item>
      <title>Well i just used the same</title>
      <link>https://community.intel.com/t5/Software-Archive/Memory-Limitiation-into-Cilk/m-p/962606#M22139</link>
      <description>&lt;P&gt;Well i just used the same code but with another API (OpenMP and TBB) and it just worked fine with the same problem size, i think the memory usage was 4GBytes maybe a lot more, but with Cilk i got this error with offloading on MIC. The only thing i did,was changing the variables from local to global , using _Cilk_shared float * _Cilk_shared var and _Cilk_offload before the _Cilk_for.&lt;/P&gt;

&lt;P&gt;Yeah well , i just want to use Cilk into an offloading mode with Intel MIC,it works with OpenMP and TBB but not with Cilk with the same problem size.&lt;/P&gt;

&lt;P&gt;I also computed the same problem on Xeon processor with the same problem size with Cilk and it worked but without the usage of&lt;/P&gt;

&lt;P&gt;#define malloc(x) _Offload_shared_aligned_malloc(x,ALIGN)&amp;nbsp; ; ALIGN=64&lt;BR /&gt;
	#define free(x) _Offload_shared_aligned_free(x);&lt;/P&gt;

&lt;P&gt;and transfering those datas to IntelMIC.&lt;/P&gt;

&lt;P&gt;Are differences at the used libaries which comes from Cilk ?&lt;/P&gt;

&lt;P&gt;Should i change the number of workes ? because at openMP and TBB i used 240 Threads.&lt;/P&gt;

&lt;P&gt;Cheers&lt;/P&gt;</description>
      <pubDate>Sat, 18 Jan 2014 17:15:29 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/Memory-Limitiation-into-Cilk/m-p/962606#M22139</guid>
      <dc:creator>Markus__G_</dc:creator>
      <dc:date>2014-01-18T17:15:29Z</dc:date>
    </item>
    <item>
      <title>It's certainly worth while to</title>
      <link>https://community.intel.com/t5/Software-Archive/Memory-Limitiation-into-Cilk/m-p/962607#M22140</link>
      <description>&lt;P&gt;It's certainly worth while to try a smaller value of MIC_CILK_NWORKERS, since the optimum value of NWORKERS on MIC frequently comes out half or less of the optimum number of OpenMP threads for the same task.&lt;/P&gt;

&lt;P&gt;I wouldn't be surprised if you encountered a limit of 4GB for offload of such a data region, or if it depended on your coprocessor model or even stack setting. How much free memory is visible?&amp;nbsp; Mine will not reach 8GB virtual available, less than 4GB physical&lt;/P&gt;</description>
      <pubDate>Sat, 18 Jan 2014 17:27:00 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/Memory-Limitiation-into-Cilk/m-p/962607#M22140</guid>
      <dc:creator>TimP</dc:creator>
      <dc:date>2014-01-18T17:27:00Z</dc:date>
    </item>
    <item>
      <title>Ok thx i will try i am in</title>
      <link>https://community.intel.com/t5/Software-Archive/Memory-Limitiation-into-Cilk/m-p/962608#M22141</link>
      <description>&lt;P&gt;Ok thx i will try i am in urly atm, i am havin a question is there any difference at using _Offload_shared_aligned_malloc even without using offload mode ? because at OpenMP it's possible to use _mm_malloc which aligns to the memory banchs&amp;nbsp; so are or can i also use _mm_malloc ?&lt;/P&gt;

&lt;P&gt;Does the stack size of workers influence the speed up. `?&lt;/P&gt;

&lt;P&gt;Nevertheless CilkPlus and OpenMP achieves nearly the same speedup&lt;/P&gt;</description>
      <pubDate>Sat, 18 Jan 2014 18:03:39 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/Memory-Limitiation-into-Cilk/m-p/962608#M22141</guid>
      <dc:creator>Markus__G_</dc:creator>
      <dc:date>2014-01-18T18:03:39Z</dc:date>
    </item>
    <item>
      <title>Ok i did this</title>
      <link>https://community.intel.com/t5/Software-Archive/Memory-Limitiation-into-Cilk/m-p/962609#M22142</link>
      <description>&lt;P&gt;Ok i did this&lt;/P&gt;

&lt;P&gt;export MIC_PREFIX=MIC ; MIC_CILK_NWORKERS=60&lt;/P&gt;

&lt;P&gt;and&lt;/P&gt;

&lt;P&gt;MIC_CILK_NWORKERS=1 and it didn't worked&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;Cheers&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sat, 18 Jan 2014 18:41:34 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/Memory-Limitiation-into-Cilk/m-p/962609#M22142</guid>
      <dc:creator>Markus__G_</dc:creator>
      <dc:date>2014-01-18T18:41:34Z</dc:date>
    </item>
    <item>
      <title>Just like computing on your</title>
      <link>https://community.intel.com/t5/Software-Archive/Memory-Limitiation-into-Cilk/m-p/962610#M22143</link>
      <description>&lt;P&gt;Just like computing on your XEON host, if your stack size is too small, you're likely to get a segfault.&amp;nbsp; Reproducing errors like this will be hard because there's no way to predict the scheduling - it's deliberately randomized.&lt;/P&gt;

&lt;P&gt;The stack size&amp;nbsp;probably doesn't&amp;nbsp;effect speedup.&amp;nbsp; It's simply the size of the stacks that the Cilk runtime will allocate to steal work on.&amp;nbsp; However, there is one way in which it can affect parallelism (and therfore speedup). If the Cilk runtime attempts to allocate a stack and fails, the worker that tried to allocate the stack will stall for a bit in the hope that&amp;nbsp;a stack has been deallocated and the next allocation attempt will succeed.&amp;nbsp; You'd need to be at the limits of the address space for that to occur.&amp;nbsp; But in that sense, if you crank down the stack size, you may be able to allocate more (smaller) stacks.&amp;nbsp; If you application is being stalled for lack of a stack, that might help.&lt;/P&gt;

&lt;P&gt;You can also control the number of stacks that will be allocated.&amp;nbsp; That will also cause stalling, if all of the stacks have been used.&lt;/P&gt;

&lt;P&gt;But both of these are fringe cases.&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&amp;nbsp; - Barry&lt;/P&gt;</description>
      <pubDate>Sat, 18 Jan 2014 18:48:03 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/Memory-Limitiation-into-Cilk/m-p/962610#M22143</guid>
      <dc:creator>Barry_T_Intel</dc:creator>
      <dc:date>2014-01-18T18:48:03Z</dc:date>
    </item>
    <item>
      <title>If you have set MIC_CILK</title>
      <link>https://community.intel.com/t5/Software-Archive/Memory-Limitiation-into-Cilk/m-p/962611#M22144</link>
      <description>&lt;P&gt;If you have set MIC_CILK_NWORKERS=1, and the program still doesn't work, then something more fundamental, apart from the scheduling done by the Cilk Plus runtime, may be going on.&amp;nbsp;&amp;nbsp; With 1 worker, I don't think the runtime does anything interesting.&lt;/P&gt;

&lt;P&gt;For the TBB or OpenMP versions that work, are you offloading the computation onto the device,&amp;nbsp;or running on the host machine?&amp;nbsp;&amp;nbsp; I am a bit confused by the statement "Cilk Plus and OpenMP achieves nearly the same speedup," because that implies you got both versions to run correctly.&lt;/P&gt;

&lt;P&gt;Perhaps it might help if you could post a sample program&amp;nbsp;or stripped-down version of your program which still triggers the error you are seeing, and the similar one that&amp;nbsp;does work?&amp;nbsp;&lt;BR /&gt;
	Cheers,&lt;/P&gt;

&lt;P&gt;Jim&lt;/P&gt;</description>
      <pubDate>Sat, 18 Jan 2014 20:32:13 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/Memory-Limitiation-into-Cilk/m-p/962611#M22144</guid>
      <dc:creator>Jim_S_Intel</dc:creator>
      <dc:date>2014-01-18T20:32:13Z</dc:date>
    </item>
    <item>
      <title>Yeah well , i just computed</title>
      <link>https://community.intel.com/t5/Software-Archive/Memory-Limitiation-into-Cilk/m-p/962612#M22145</link>
      <description>&lt;P&gt;Yeah well , i just computed the same problem on the Xeon Processor and compared the performance no on the wirk.&lt;/P&gt;

&lt;P&gt;The try was just with on worker.&lt;/P&gt;

&lt;P&gt;I think the most important parts are :&lt;/P&gt;

&lt;P&gt;#define real float&lt;BR /&gt;
	#define SQRT sqrtf&lt;BR /&gt;
	#define ALIGN 64&lt;/P&gt;

&lt;P&gt;#include &amp;lt;cilk/cilk.h&amp;gt;&lt;BR /&gt;
	#include &amp;lt;cilk/reducer_opadd.h&amp;gt;&lt;BR /&gt;
	#include &amp;lt;cilk/cilk_api.h&amp;gt;&lt;/P&gt;

&lt;P&gt;.....&lt;/P&gt;

&lt;P&gt;#define ALINGED 1&lt;BR /&gt;
	#define malloc(x) _Offload_shared_aligned_malloc(x,ALIGN)&lt;BR /&gt;
	#define free(x) _Offload_shared_aligned_free(x);&lt;/P&gt;

&lt;P&gt;....&lt;/P&gt;

&lt;P&gt;void init_Atoms(dim3 grid,int nAtoms);&lt;BR /&gt;
	void save_Atoms(int nAtoms,real gridspace,char* out_path);&lt;BR /&gt;
	void load_Atoms(int* nAtoms,real *gridspace,char* in_path);&lt;BR /&gt;
	void calc_energy(dim3 grid,int z_s,real gridspace,int nAtoms);&lt;/P&gt;

&lt;P&gt;void write_to_CSV(real *f,dim3 grid,char *out_path,real gridspace);&lt;/P&gt;

&lt;P&gt;static double dtime(void);&lt;BR /&gt;
	static double cur_second(void);&lt;/P&gt;

&lt;P&gt;_Cilk_shared real *_Cilk_shared energygrid;&lt;BR /&gt;
	_Cilk_shared real *_Cilk_shared atoms;&lt;/P&gt;

&lt;P&gt;int main(int argc, char* argv[]){&lt;/P&gt;

&lt;P&gt;.....&lt;/P&gt;

&lt;P&gt;energygrid=(_Cilk_shared real*) malloc(grid.x*grid.y*grid.z*sizeof(real));&lt;/P&gt;

&lt;P&gt;}&lt;/P&gt;

&lt;P&gt;void init_Atoms(dim3 grid,int nAtoms){&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; atoms=(_Cilk_shared real *)malloc(4*nAtoms*sizeof(real));&lt;/P&gt;

&lt;P&gt;.....&lt;/P&gt;

&lt;P&gt;}&lt;/P&gt;

&lt;P&gt;The error occurs at mallocation at energygrid, where as grid.x=grid.y=grid.z=1000 with real =float&lt;/P&gt;

&lt;P&gt;Cheers.&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sat, 18 Jan 2014 23:25:32 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/Memory-Limitiation-into-Cilk/m-p/962612#M22145</guid>
      <dc:creator>Markus__G_</dc:creator>
      <dc:date>2014-01-18T23:25:32Z</dc:date>
    </item>
    <item>
      <title>Sorry ,...</title>
      <link>https://community.intel.com/t5/Software-Archive/Memory-Limitiation-into-Cilk/m-p/962613#M22146</link>
      <description>&lt;P&gt;Sorry ,...&lt;/P&gt;

&lt;P&gt;The try was just with on worker to figure out if something general is just wrong.&lt;/P&gt;

&lt;P&gt;Yeah well , i just computed the same problem on the Xeon Processor with OpenMP,TBB and Cilk and compared the performance no on the MIC,&lt;/P&gt;

&lt;P&gt;Cheers&lt;/P&gt;</description>
      <pubDate>Sat, 18 Jan 2014 23:28:02 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/Memory-Limitiation-into-Cilk/m-p/962613#M22146</guid>
      <dc:creator>Markus__G_</dc:creator>
      <dc:date>2014-01-18T23:28:02Z</dc:date>
    </item>
    <item>
      <title>The full code can be</title>
      <link>https://community.intel.com/t5/Software-Archive/Memory-Limitiation-into-Cilk/m-p/962614#M22147</link>
      <description>&lt;P&gt;The full code can be downloaded from her, ther is a problem with the aligned of the text but it's still readable.&lt;/P&gt;

&lt;P&gt;&lt;A href="https://www.dropbox.com/home/3_vdo_scaled_simd_opt_xeon_phi?select=vdo_scaled_fd_cilk.c" target="_blank"&gt;https://www.dropbox.com/home/3_vdo_scaled_simd_opt_xeon_phi?select=vdo_scaled_fd_cilk.c&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;best regards&lt;/P&gt;</description>
      <pubDate>Sun, 19 Jan 2014 00:24:08 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/Memory-Limitiation-into-Cilk/m-p/962614#M22147</guid>
      <dc:creator>Markus__G_</dc:creator>
      <dc:date>2014-01-19T00:24:08Z</dc:date>
    </item>
    <item>
      <title>Dropbox tells me that the</title>
      <link>https://community.intel.com/t5/Software-Archive/Memory-Limitiation-into-Cilk/m-p/962615#M22148</link>
      <description>&lt;P&gt;Dropbox tells me that the folder doesn't exist.&lt;/P&gt;

&lt;P&gt;&amp;nbsp; &amp;nbsp; - Barry&lt;/P&gt;</description>
      <pubDate>Mon, 20 Jan 2014 15:02:15 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/Memory-Limitiation-into-Cilk/m-p/962615#M22148</guid>
      <dc:creator>Barry_T_Intel</dc:creator>
      <dc:date>2014-01-20T15:02:15Z</dc:date>
    </item>
    <item>
      <title>ah grap,try again</title>
      <link>https://community.intel.com/t5/Software-Archive/Memory-Limitiation-into-Cilk/m-p/962616#M22149</link>
      <description>&lt;P&gt;ah grap,try again&lt;/P&gt;

&lt;P&gt;&lt;A href="https://www.dropbox.com/s/s6uk32h3u2dxc9q/vdo_scaled_fd_cilk.c" target="_blank"&gt;https://www.dropbox.com/s/s6uk32h3u2dxc9q/vdo_scaled_fd_cilk.c&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;best regards&lt;/P&gt;</description>
      <pubDate>Mon, 20 Jan 2014 15:58:36 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/Memory-Limitiation-into-Cilk/m-p/962616#M22149</guid>
      <dc:creator>Markus__G_</dc:creator>
      <dc:date>2014-01-20T15:58:36Z</dc:date>
    </item>
  </channel>
</rss>

