- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi all
Our app uses multiple solve with the same factor, when I looked at the call graph results' it seems that about 30% of the time is in the LSAME function.
70% of DTRSM ( most of the SOLVE work) is the blas_errchk_dtrsm, that calls LSAME.
Is that reasonable? Just some profiling artifact? Or a configuration problem ?
The problem size in about 10000.
Thanks
Link Copied
2 Replies
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi!
It shouldn't spend much time a LSAME - it's just a few instructions. What is theprofiler?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
I know It shouldn't ... I have a felling that the problem is calling the BLAS function with very small size, and that the reson for the large overhead.
the profiler is intel vtune

Reply
Topic Options
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page