* Use int for int8x4 due to performance overhead of char4 * Add a comment about using int * Remove invalid test