- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi!
Currently auto-vectorization will become enabled only if WorkSize is a multiple of 4. Is it maybe possible to implement optimization which allows vectorization up to (WorkSize/4 * 4) and does only the remainder as scalar? (and/or only the begining thus removing also the constraint for 256 byte alignment).
Thanks!
Atmapuri
Currently auto-vectorization will become enabled only if WorkSize is a multiple of 4. Is it maybe possible to implement optimization which allows vectorization up to (WorkSize/4 * 4) and does only the remainder as scalar? (and/or only the begining thus removing also the constraint for 256 byte alignment).
Thanks!
Atmapuri
Link Copied
0 Replies

Reply
Topic Options
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page