Quote:
Originally Posted by egur
Thanks!
I looked at the VLC code and found out they use an SSE4.1 instruction to copy from the GPU memory. I had to rewrite using SSE4 intrinsics so 64 bit compilations would work. Results are nice, Now I'm always faster then libavcodec on 720p (and north) videos.
|
Yeah that SSE 4.1 instruction is great for this task. Intel really knows what they're doing.