- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
What pipe does VBROADCASTSD go down? I looked in the latency/throughput section of the intel optguide and it doesn't list this information. Thanks
perfwise
Link Copied
4 Replies
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
I haven't heard which pipe it goes down but from my assembly performance it surely isn't pipe 0 or 1 where the 256-bit add and mul units are. Thought I'd let people know..
perfwise
perfwise
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
>>>I haven't heard which pipe it goes down but from my assembly performance it surely isn't pipe 0 or 1>>>
Do you mean execution unit's Port?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Yes, but it's a moot point now. I was tuning my dgemm for SB and IB and noticed some "replication" instructions utilized the same pipe as the + or *, can't remember off the top of my head. vbroadcastsd doesn't and is preferrable for this purpose.
perfwise
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
I suppose that store/load Ports 2 and 3 are executing VBROADCASTSD instruction.
Reply
Topic Options
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page