- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
After increasing array sizes I got a test case where _intel_fast_memset consumes 80% of total compute time. The OProfile call graph is not very helpful in finding the caller. The only approach right now is browsing through the OProfile assembly output and check the calls of this routine.
Is anybody aware of a cheap trick to find this easier ? Or would it be possible to intercept and replacethis routine with a clone through LD_PRELOAD ? In order to do that one has to have knowledge of the API. Is it identical to memset() or is it somewhat optimized (I bet it is) ? Any hints welcome.
thanks
michael
Is anybody aware of a cheap trick to find this easier ? Or would it be possible to intercept and replacethis routine with a clone through LD_PRELOAD ? In order to do that one has to have knowledge of the API. Is it identical to memset() or is it somewhat optimized (I bet it is) ? Any hints welcome.
thanks
michael
1 Solution
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
It does appear that linking in a renamed memset() might work. As your task appears to be to find out where you spend the most time copying arrays, cutting back on optimization of the possible culprits so that the fast_memset optimization doesn't occur would be another possibility. Not to mention profiling under another compiler such as gfortran.
Link Copied
2 Replies
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
It does appear that linking in a renamed memset() might work. As your task appears to be to find out where you spend the most time copying arrays, cutting back on optimization of the possible culprits so that the fast_memset optimization doesn't occur would be another possibility. Not to mention profiling under another compiler such as gfortran.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Indeed, that's it. That way thememset time will be accountedwith the calling routine. Thanks a lot.

Reply
Topic Options
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page