- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi all,
I tried to replace
Are there some issues that one needs to be aware of before using
Also, I am coding from VC8.0. Which
Thanks in advance.
Alex
Link Copied
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Dale
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Ok, so a small self-contained example is attached. It is a very simple image processing routine. With
6X slower!!!!!!! I would really appreciate if anybody could tell me what is going on!
I've compiled from VC++ 8.0 with the fastest options. Here is the whole command line for the compiler:
/GL /c /O3 /Og /Ob2 /Oi /Ot /Oy /GA /D "WIN32" /D "NDEBUG" /D "_CONSOLE" /D "_UNICODE" /D "UNICODE" /FD /MD /GS /GR /Fo"Release/" /W3 /nologo /Wp64 /Zi /Gd /Qansi-alias /QaxP /QxP
and for the linker
/LTCG kernel32.lib user32.lib gdi32.lib winspool.lib comdlg32.lib advapi32.lib shell32.lib ole32.lib oleaut32.lib uuid.lib odbc32.lib odbccp32.lib /OUT:"D:CodeOptimisationRelease/Optimisation.exe" /INCREMENTAL:NO /nologo /TLBID:1 /DEBUG /PDB:"D:CodeOptimisationReleaseOptimisation.pdb" /SUBSYSTEM:CONSOLE /OPT:REF /OPT:ICF /IMPLIB:"D:CodeOptimisationReleaseOptimisation.lib" /MACHINE:X86 /MANIFEST /MANIFESTFILE:"ReleaseOptimisation.exe.interme
Thanks in advance,
Alex
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hello,
Yes, it seems that including "mathimf.h" in this case prevents calling of the excellent SVML (Short Vector Math Library) routines that are called with "math.h",
00401222 movups xmm0,xmmword ptr [ebx+edi*4]
00401226 call ___svml_logf4 (40CD84h)
0040122B call ___svml_tanhf4 (40CDBCh)
00401230 call ___svml_expf4 (40CDE4h)
00401235 mov eax,dword ptr [esp+2Ch]
00401239 movaps xmmword ptr [esi+edi*4],xmm0
0040123D add edi,4
00401240 cmp edi,eax
00401242 jb main+222h (401222h)
Best Regards,
Lars Petter Endresen
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page