Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The current PR for ARM SIMD[1] uses a different instruction mix to achieve the same goals as movemask. I tested the PR and it has a significant speedup over the non-vectorized version.

[1]https://github.com/BurntSushi/memchr/pull/114



Yup, thanks for the reminder. That's my starting point once my M2 arrives.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: