In order to create a single video from a varying number of dissimilar audio/video components, they must be of the same frame rate (format, size, channels, etc. too). So when the .jpg and .mp3 are combined, they need to match all the other components. I'll give FFmpegSource and LSMASHSource a go and see if there is any difference there. In the mean time, I've used audioTrim to force the .avs output to the duration I want and that seems to work.
__________________
Life is more interesting viewed upside down
|