@nevcairiel You're fast
That bench included !604 (prep_8tap), so this is most likely the performance as it's going to look like at 0.2.0 release. Not all prep_8tap functions are converted to ssse3 yet, so there is potentially another 5-12% speedup possible for ssse3, highly depending on content.