AVX with 128-bit registers has small minor benefits, usually because there happen to be some instrustrictons that may do the job better, and because AVX allows more efficient use of some other instructions (ie. distinct destination registers for some instructions, instead of an implied one). The differences to SSE1/2/3/4.1 versions of the same code (which also are 128-bit) is relatively small, maybe 5-10% (for a single given function), while doubling the register size to 256 can in an ideal case of course be up to twice as fast.
__________________
LAV Filters - open source ffmpeg based media splitter and decoders
|