Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion. Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules. |
![]() |
#7441 | Link |
Registered User
Join Date: Oct 2003
Posts: 158
|
Hi all,
I'm trying to optimize the encoding speed with my Ryzen 3990X (64 cores - SMT disabled) on Windows 10 64 bit. When encoding UHD @ 23,76 fps file, i only reach 55-58% of CPU charge. I tried to add "--numa-pools=64" "settings but there's no change. I currently use 3.2+38-fdbd4e4 build with VS 2019 / AVX2. Do you have any idea how to speed up encoding? Thank you. |
![]() |
![]() |
![]() |
#7442 | Link | |
Registered User
Join Date: Dec 2002
Location: Region 0
Posts: 1,436
|
Quote:
2) It depends where the bottleneck is. Your source could be unable to feed frames to the encoder fast enough. 3) Split the encode into pieces on a scene change and run them simultaneously. |
|
![]() |
![]() |
![]() |
#7443 | Link | |
RipBot264 author
Join Date: May 2006
Location: Poland
Posts: 7,915
|
Quote:
__________________
Windows 7 Image Updater - SkyLake\KabyLake\CoffeLake\Ryzen Threadripper |
|
![]() |
![]() |
![]() |
#7444 | Link |
Registered User
Join Date: Dec 2002
Location: Region 0
Posts: 1,436
|
Windows 10 generally doesn't support more than 64 logical cores per NUMA node, so disabling SMT is the best way to get the most performance out of a 3990 under Windows 10 unless he uses Windows 10 Enterprise which can use all 128. Using 64 real cores is better than 64 logical cores across 32 real cores.
|
![]() |
![]() |
![]() |
#7446 | Link |
Registered User
Join Date: Oct 2003
Posts: 158
|
Interesting details:
I have a little better performance on Linux (Centos 8.1) but still far from 100% of CPU charge. However, I made this X265 benchmark test: http://www.xin.at/x265/index-en.php I reached 00:40:09:644 on Linux and 01:14:24.446 on Windows. Last edited by Lucius Snow; 14th March 2020 at 01:02. |
![]() |
![]() |
![]() |
#7447 | Link | |
RipBot264 author
Join Date: May 2006
Location: Poland
Posts: 7,915
|
Quote:
__________________
Windows 7 Image Updater - SkyLake\KabyLake\CoffeLake\Ryzen Threadripper |
|
![]() |
![]() |
![]() |
#7448 | Link |
結城有紀
Join Date: Dec 2003
Location: Oregon
Posts: 895
|
Do you have benchmarks to support that? I thought we'd lose at most 20% performance by not using SMT but that's just my wild guess. Would love to see actual numbers showing the difference.
__________________
My Projects x265 - Yuuki-Asuna-mod TS - ADTS AAC Splitter | LATM AAC Splitter | BS4K-ASS Neo AviSynth+ filters - F3KDB | FFT3D | DFTTest | MiniDeen | Temporal Median |
![]() |
![]() |
![]() |
#7449 | Link |
Registered User
Join Date: Dec 2002
Location: Region 0
Posts: 1,436
|
What is an exception code of 0xc0000005 in x265?
Code:
Faulting application name: x265.exe, version: 3.3.0.1, time stamp: 0x5e4bfd9a Faulting module name: x265.exe, version: 3.3.0.1, time stamp: 0x5e4bfd9a Exception code: 0xc0000005 Fault offset: 0x00000000004fb4d2 Faulting process id: 0x23a4 Faulting application start time: 0x01d5f9aa83090eef Faulting application path: C:\HDTV Tools\x265\x265.exe Faulting module path: C:\HDTV Tools\x265\x265.exe Report Id: 921cdb5a-b329-4dac-b876-d60ff8a06c73 I changed from this: Code:
START "Enc #6" /NORMAL /NODE 0 /AFFINITY 00000F00 "x265.exe" --pools 4 -F 1 --crf 16.0 -p veryslow --no-sao --aq-mode 1 --aq-strength 1.15 --vbv-maxrate 25000 --vbv-bufsize 25000 --level 5.0 --keyint 120 --open-gop -D 10 --colorprim "bt709" --transfer "bt709" --colormatrix "bt709" --sar 1:1 --qpfile 6.chp -o "out_6.265" "in_6.avs" Code:
START "Enc #6" /NORMAL /NODE 0 /AFFINITY 00000F00 "x265.exe" --pools 4 -F 1 --crf 16.0 -p veryslow --aq-strength 1.15 --vbv-maxrate 25000 --vbv-bufsize 25000 --level 5.0 --keyint 120 --open-gop -D 10 --colorprim "bt709" --transfer "bt709" --colormatrix "bt709" --sar 1:1 --qpfile 6.chp -o "out_6.265" "in_6.avs" Code:
AVSMeter 2.9.8 (x64), 2012-2020, (c) Groucho2004 AviSynth+ 3.4 (r2925, master, x86_64) (3.4.0.0) Number of frames: 30565 Length (hh:mm:ss.ms): 00:21:14.815 Frame width: 1920 Frame height: 1080 Framerate: 23.976 (24000/1001) Colorspace: YUV420P10 I'm using HolyWu's build if that matters. Last edited by Stereodude; 14th March 2020 at 14:42. |
![]() |
![]() |
![]() |
#7450 | Link | |
Registered User
Join Date: Dec 2002
Location: Region 0
Posts: 1,436
|
Quote:
Disabling SMT is how you optimize the performance of an application that will only run on a single NUMA node with a processor like a 3990. The FPS of his single encode with SMT disabled from his single x265 instance is significantly higher than it would be if SMT was enabled. I'd estimate 70-80%. Is there an alternative workaround for maximizing x265 performance with running multiple simultaneous encodes, yes. Last edited by Stereodude; 14th March 2020 at 14:22. |
|
![]() |
![]() |
![]() |
#7452 | Link | |
Registered User
Join Date: Dec 2002
Location: Region 0
Posts: 1,436
|
Quote:
By the way, you could just encode two different video source files at the same time. Last edited by Stereodude; 14th March 2020 at 22:32. |
|
![]() |
![]() |
![]() |
#7453 | Link | |
Registered User
Join Date: Feb 2015
Posts: 33
|
Hey guys, any good and free tool to extract hdr10plus meta tags as json and re use it with x265?
I used quietvoid's hdr10plus_parser tool but it gives me error: Quote:
|
|
![]() |
![]() |
![]() |
#7457 | Link |
German doom9/Gleitz SuMo
Join Date: Oct 2001
Location: Germany, rural Altmark
Posts: 7,175
|
@Stereodude: I used to believe that the more threads are working on the same video, the smaller the scope of each thread gets, the less efficient the search for redundant areas will be, which will limit quality. Is that no concern for you?
|
![]() |
![]() |
![]() |
#7458 | Link | |
Registered User
Join Date: Dec 2002
Location: Region 0
Posts: 1,436
|
Quote:
![]() I've been using --pools 4 -F 1 for my encodes. Mostly because in prior testing I saw a noticeable quality improvement limiting the simultaneous frames to 1. I limit the pools to 4 mostly because when I set frames to 1 it only uses about 4 threads worth of CPU. I'm going to retest if -F 1 is still necessary with the latest builds since it seems to have considerably image quality improvements that are leading me to rather different conclusions on my preferred settings vs. the last time I tested over a year ago. I find AQ2 w/ SAO left enabled has a very pleasing look now whereas I previously thought it was terrible. Maybe it's just me... Last edited by Stereodude; 16th March 2020 at 13:32. |
|
![]() |
![]() |
![]() |
#7460 | Link |
Registered User
Join Date: Dec 2002
Location: Region 0
Posts: 1,436
|
Where can I get another 64-bit Windows build of 3.3+1-f94b0d32737d? I already have a HolyWu build made with Clang 9.0.0 and want to compare a behavior I see to a different build of the same x265 version.
|
![]() |
![]() |
![]() |
Thread Tools | Search this Thread |
Display Modes | |
|
|