ok, here we go, I have a dream, make a 3D engine 100% assembler intel only with CPU, I use rotation matrix only for now.
it works of course, but it's slow when I put a lot of pixels.
Recently I decided to include voxels in my engine, and it's slow when I put> = 8000 voxels (20 * 20 * 20 cube) and when I saw that nvidia display 32M voxels (fire) I wonder how they can do it !
And I have a little idea of the reason: MMU, paging, segmentation. memory.
Am I right?
Another question, is the FPU is the slowest to compute floating point than SSE or depending of data manipulate ?
PS: I work without OS like Windows or Linux, I run on my own kernel + bootloader in assembly too with NASM.
Sorry if i don't wirte a good english, i'm french and use google translate ^-^