Hello there,
ok, here we go, I have a dream, make a 3D engine 100% assembler intel only with CPU, I use rotation matrix only for now.
					it works of course, but it's slow when I put a lot of pixels.
Recently I decided to include voxels in my engine, and it's slow when I put> = 8000 voxels (20 * 20 * 20 cube) and when I saw that nvidia display 32M voxels (fire) I wonder how they can do it !
					
					And I have a little idea of  the reason: MMU, paging, segmentation. memory.
					
					Am I right?
					
					Another question, is the FPU is the slowest to compute floating point  than SSE or depending of data manipulate ?
					PS: I work without OS like Windows or Linux, I run on my own kernel + bootloader in assembly too with NASM.
Sorry if i don't wirte a good english, i'm french and use google translate ^-^
