Created attachment 2698[details]
Patch to eliminate unaligned accesses
Render commands are stored in a 256kB byte array, which is often dereferenced as a void or some other type of pointer. On strict architectures, unaligned accesses such as these are either trapped and handled slowly in the kernel or disallowed completely.
The attached patch aligns commands to sizeof(void *) boundaries in the command buffer. Performance should be improved on all platforms and especially so on platforms where unaligned accesses very slow.
In the time it took to load quake3, and using the keyboard move down to select Exit from the main screen, it incurred 100904 unaligned traps on my Alpha system. With the patch, I have not seen a single unaligned trap.
Tested on Alpha/Linux and AMD64/Linux. Please commit.
Created attachment 2698 [details] Patch to eliminate unaligned accesses Render commands are stored in a 256kB byte array, which is often dereferenced as a void or some other type of pointer. On strict architectures, unaligned accesses such as these are either trapped and handled slowly in the kernel or disallowed completely. The attached patch aligns commands to sizeof(void *) boundaries in the command buffer. Performance should be improved on all platforms and especially so on platforms where unaligned accesses very slow. In the time it took to load quake3, and using the keyboard move down to select Exit from the main screen, it incurred 100904 unaligned traps on my Alpha system. With the patch, I have not seen a single unaligned trap. Tested on Alpha/Linux and AMD64/Linux. Please commit.