- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Link Copied
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
We forwarded your question to our Application Engineers, who responded as follows:
The Pentium 4 processor is fairly efficient at hiding instruction latency through pipelining the instructions with its out of order execution engine. The shift and add instructions use different execution units, so there is no conflict for resources between these types of instructions. The por instruction, however, uses the same execution unit as the padd instruction, and this could be causing delays while the execution unit?s pipeline clears. This is especially true if you interleave the padd and por instructions. If there is some way to stack the padd instructions so that they can execute sequentially, then you can take advantage of the pipelining in the processor. If each padd instruction is dependent on the result of a por instruction, then you will have to look at other strategies to streamline the execution.
We hope this is helpful.
Regards,
Lexi S.
IntelSoftware NetworkSupport
Message Edited by intel.software.network.support on 12-07-2005 04:39 PM

- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page