Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion. Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules. |
![]() |
#1762 | Link | |
Registered User
Join Date: Feb 2003
Location: New York, NY (USA)
Posts: 82
|
Quote:
|
|
![]() |
![]() |
![]() |
#1763 | Link | |
Registered User
Join Date: Feb 2003
Location: New York, NY (USA)
Posts: 82
|
Quote:
So basically - ignoring x264 for a second (which is pretty mature/stable) - some from late April and some from early June, none from January or February. |
|
![]() |
![]() |
![]() |
#1764 | Link |
Registered User
Join Date: May 2005
Location: Swansea, Wales, UK
Posts: 184
|
Ah, must have misread the presentation then, easier for me to read from slides than video - still the position of rav1e seems odd, it shows on graphs to be still hovering around x264 - I could have sworn it passed x264 months ago, and then nothing has been said since despite all the work commits being merged into it.
Is it still suffering that regression from awhile ago? "I believe that because the conference was organized by Vimeo/Mozilla, who just announced a partnership around rav1e, they wanted to prevent a potential conflict of interest in talk selection and decided to not give a talk on it." Yes that makes sense, just seemed a little odd, like going to WWDC and getting nothing from Apple - still they got a lot of mentions from the presenters in any case. Last edited by soresu; 1st July 2019 at 19:40. |
![]() |
![]() |
![]() |
#1765 | Link |
Registered User
Join Date: Aug 2015
Posts: 34
|
The rav1e stats are correct. We still fall behind on high bitrate VMAF - most likely due to that being more sensitive to activity masking (aq) which is still in progress by s_p.
The multithreading performance is limited by a serialization point of the loop filters between frames (by far the slowest part of rav1e right now). There's some outstanding PRs to make it better, e.g. https://github.com/xiph/rav1e/pull/1396 |
![]() |
![]() |
![]() |
#1766 | Link |
Registered User
Join Date: Aug 2009
Posts: 198
|
The Visionular talk from Zoe Liu used builds from Jan and Feb.
I think the x265 release used was the last stable release so that doesn't seem too crazy. You could easily cry foul if someone used a non stable git commit and it performed worse than expected due to hitting a bug. It's also worth bearing in mind that that was basically the same talk as given at the Agora.io thing, so some people tour these things around for a while, it's not ridiculous for them to reuse slides and not have something fresh for every talk they give. Fairly certain I'd seen the NGCodec talk slides before too. |
![]() |
![]() |
![]() |
#1767 | Link | |
Moderator
![]() Join Date: Jan 2006
Location: Portland, OR
Posts: 3,596
|
Quote:
|
|
![]() |
![]() |
![]() |
#1768 | Link | |
Moderator
![]() Join Date: Jan 2006
Location: Portland, OR
Posts: 3,596
|
Quote:
Also, VMAF isn't static. Netflix comes out with new ML models that will give different (and more accurate) scores compared to older models. It can be estimated for mobile, 1080p, or UHD devices. The scores vary based on the resolution of comparison (720p tested at 720p will deliver higher scores than 720p tested at 1080p, compared to 1080p encoding). And it is a per-frame metric, and how to aggregate per-frame scores into an overall clip quality is an unanswered question. Using a harmonic mean helps, but even that is probably only useful for <20 second durations. A single VMAF score for a whole movie or episode could indicate highly variable quality or highly consistent quality. None of this is a diss on VMAF. Netflix did what they set out to do well, put a huge amount of effort into it, and made reasonable design decisions. But like all metrics, it measures what it is designed to measure, not what we wish it measured ![]() I've seen VMAF do a poor job of detecting:
And it doesn't do HDR at all. It used to not do UHD, but does now. Another problem with a popular metric is that developers start tuning for that metric instead of what the metric is supposed to measure (subjective quality in this case). When developers start tuning for metrics over eyes, the correlation of that metric to subjective quality actually gets WORSE. So, for an encoder like libaom that got tuning based on VMAF ratings, we'd expect that its VMAF scores would be higher relative to actual subjective quality than for encoders that weren't tuned that were. But it'll be better than ones tuned for PSNR, like the vp? series. Not that tuning for VMAF is a bad strategy. But it does result in less meaningful VMAF scores. |
|
![]() |
![]() |
![]() |
#1769 | Link | |||
Moderator
![]() Join Date: Jan 2006
Location: Portland, OR
Posts: 3,596
|
Quote:
That said, I've not seen any study demonstrating better subjective quality from a well-tuned libpvx encode versus a well-tuned x265 encode, using the same bitrate @ time. Quote:
Quote:
There are similar debates about TV's default "vivid" mode. Some people claim to like it, even though what's displayed in manifestly wrong on many axes. |
|||
![]() |
![]() |
![]() |
#1771 | Link | |
Moderator
![]() Join Date: Jan 2006
Location: Portland, OR
Posts: 3,596
|
Quote:
How the encoders got tuned is what matters. And if quality is being compared at fixed encoding time, performance improvements become quality improvements. |
|
![]() |
![]() |
![]() |
#1772 | Link | |
Registered User
Join Date: Aug 2009
Posts: 198
|
Quote:
Imagine, for example, people who believe that tube amps or vinyl is better than digital audio. In a double-blind test they would probably still vote for the tube amp or the vinyl because it has distinctive audio characteristics that can't be removed without invalidating the test. They can hear things that they prefer and associate (conciously or not) with quality. On the other hand, they would potentially be fooled by audio that had been processed to sound like tube amps or vinyl or passed through a digital chain before output. I considered this possibility because two recent tests that were presented as being negative for AV1 specifically mentioned that some of their test participants were video engineers. They mentioned this as evidence that it was all done properly, but it seemed like an obvious test methodology failure to me. I think it was Monty from Xiph that said his party trick used to be identifying the encoder used just by listening to mp3s, and I bet certain bitrates and content would let people here do the same with video codecs and there's a possibility their opinion scores would differ from Joe Public as a result. |
|
![]() |
![]() |
![]() |
#1773 | Link | |
Registered User
Join Date: Aug 2009
Posts: 198
|
Quote:
http://www.compression.ru/video/code...jective_report On the other hand, similar to how complaints about electric cars are now "I don't like the minimalism of their touchscreen interfaces" when not too long ago you'd hear how they were physical impossibilities, I think the fact that we're now at this level of complaint for the previous generation of royalty-free codecs is a testament to how far we've come. |
|
![]() |
![]() |
![]() |
#1776 | Link | ||||
Moderator
![]() Join Date: Jan 2006
Location: Portland, OR
Posts: 3,596
|
Quote:
Quote:
Quote:
Generally I'll have video experts to an initial pass on something to see "is there something that can be seen here?" and then using double-blind testing with a more general population to confirm details. The second is a LOT slower and more expensive than the first, of course. Quote:
|
||||
![]() |
![]() |
![]() |
#1778 | Link | |
Moderator
![]() Join Date: Jan 2006
Location: Portland, OR
Posts: 3,596
|
Quote:
Anyway, I requested an optimal Eve encoding for My encoding challenge, but they declined to participate. It is common for encoder vendors who think they are doing some magic things in the bitstream to want to have the bitstream output under NDA and such. I get the impulse, but it just isn't practical for doing actual comparisons or due diligence evaluation. Last edited by benwaggoner; 3rd July 2019 at 22:59. Reason: More details about Eve offer |
|
![]() |
![]() |
![]() |
#1779 | Link |
Registered User
Join Date: Aug 2009
Posts: 198
|
When you say they "declined to participate" did they respond and say they didn't want to take part or did you just not hear from them after making a broad request in a forum post?
I believe the comment above yours saying ("Have you asked?") Is written by a developer of EVE, which suggests they didn't know they'd been asked, so possibly an email has got lost in a spam trap. |
![]() |
![]() |
![]() |
#1780 | Link |
Registered User
Join Date: Feb 2019
Posts: 3
|
Hi guys, I've desided to make a comparison of x264, rav1e and x265 encoders with 500, 600, 700, 800, 900, 1000 kbit/s with the following settings:
rav1e -b $g --tiles 6 -s 5 --matrix BT470BG /D/sintel/sintel720.y4m --output sintel720_rav1e_s5_$g.ivf rav1e -b $g --tiles 6 -s 3 --matrix BT470BG /D/sintel/sintel720.y4m --output sintel720_rav1e_s5_$g.ivf x264 -t 2 -m 11 --me umh --weightp 2 --direct spatial --aq-mode 2 --b-adapt 2 -B $g -b 4 -r 6 -I 240 --b-pyramid normal --no-dct-decimate --no-fast-pskip -A all -o sintel720_x264_$g.264 x265 /D/sintel/sintel720.y4m --y4m -o "Sintel720_x265_$g.h265" --rd 3 -b 4 --b-adapt 2 --b-pyramid --ref 6 -I 240 --bitrate $g --aq-mode 2 --weightp --weightb -m 2 --no-early-skip --psy-rd 1 --me star where $g stands for bitrate value My OS is Arch Linux x86_64 and CPU Core i5 8600K, rav1e was build recently and for testing 1191 frames of sintel 1k 16bit (from 12987 to 14177) were taken and converted to 720x306 yuv420p. x265 and x264 are from the distro's repository. To measure MS-SSIM and PSNR-HVS-M daala's tools were used. To measure VMAF score I used ffmpeg's VMAF filter. Results: ![]() ![]() ![]() x265 is a clear winner with 50.59-61.01 fps (lowest to highest bitrate settings) x264 80.08-102.73 fps rav1e s3 1.603-2.187 fps rav1e s5 3.736-4.469 fps Average CPU utilization of rav1e was 66%-70% (and I couldn't increase it). Last edited by Ilya87; 4th July 2019 at 23:42. |
![]() |
![]() |
![]() |
Thread Tools | Search this Thread |
Display Modes | |
|
|