According to a Prof. of mine[1] loading/storing from/to SIMD registers is in fact slow and should be avoided. Just my two cents, panzi [1] http://www.complang.tuwien.ac.at/anton/ Here his work was used: http://webkit.org/blog/189/announcing-squirrelfish/