- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Slight changes in my source have caused a significant change at runtime. While previously the profile was dominated by pow.L this has now been replaced with __ieee754_pow along with __exp1.
Normally I wouldn't care but now the lattertwo take significantly more time. I have not changed compiler flags so I suspect it has to do with the data.
Can anybody explain the logic behind this ?
Link Copied
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
You must take care that the expression is pure single precision (and avoid the arch-consistency option) if you want the fast single precision power function.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
thanks a lot, Tim, good hint to glibc.
It turns out that my LIBRARY_PATH environment was messed up. Back to normal now.
(Background: Overloading of the pow() symbolappeared to bethe only way to get a gprof call graph in order to see from where this function is most frequently called. That did require tocopy and hackthe Intel libm and rename functionpow() to wop() in order to avoid duplicate symbols.)
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page