Page 15:
The diagram on the left is not the conventional non-pyramid B-frame structure (nor is it even an allowed structure: you're claiming to predict P-frames from B-frames that are later in coded order). Conventional is to predict B-frames only from P-frames, not from previous B-frames.
Page 20:
The DPB always acts as a FIFO (except for MMCOs). This is not caused by the use or ordering of L0 and L1.
(And if you're focusing on x264 rather than on the standard, you could skip the part about how P and B have different default reference orders, because x264 ignores the standard's default and makes them the same.)
Page 21:
Motion search area is not usually rectangular, nor any other data-independent shape. The sane methods are hill-climbing searches.
SSD and MSE are the same thing (if you ignore the normalization constant, which you would if you're only comparing to other values of the same metric).
Page 26:
The standard describes CABAC inefficiently. CABAC states actually fit in 7 bits per context, not 16.
Page 28:
Storing the bitstream from RDO is not helpful. Not even for speed and ignoring the negligible memory costs. It's faster to not generate the bitstream in the first place, since RDO only cares about the number of bits, not which bits they are.
|