 r1411 operations in some cases have higher overheads compared to the existing 128 bit SSE operations. We also compare Intel's SIMD extensions against the ARM Neon. Note that Parabix allowed us to extensions against the ARM \NEON{}. Note that Parabix allowed us to perform these studies without having to change the application source. Finally, we parallelized the Parabix XML parser to take advantage of fine-grain parallelism we exploit; parallelized Parabix achieves a further 2$\times$ improvement in performance.
