Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion. Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules. |
![]() |
#9761 | Link |
Registered User
Join Date: May 2005
Posts: 1,556
|
I am about to buy an AMD 99503D, which supports AVX-512 very well. I wanted to prepare a command line with --asm avx512, but fail to see avx512 as a valid string in the documentation any more. Are there still versions of x265 that support this?
__________________
Gorgeous, delicious, deculture! |
![]() |
![]() |
![]() |
#9762 | Link |
German doom9/Gleitz SuMo
Join Date: Oct 2001
Location: Germany, rural Altmark
Posts: 7,223
|
Support for AVX512 certainly still exists, I just read it in the sources. But it will not be auto-detected. So add the parameter explicitly; if it is not supported either by hardware or by your OS, or being removed from the CLI without my knowledge, x265 will tell you when you execute it.
|
![]() |
![]() |
![]() |
#9764 | Link |
Registered User
Join Date: Jan 2025
Posts: 65
|
I also have a 9950X3D, and I have this:- --asm avx512 in my x265 command line.
The documentation needs to be updated.
__________________
Main Systems:- Threadripper 7970X on Asus Pro WS TRX50-Sage WiFi Ryzen 9 9950X3D on MSI Carbon X670E Ryzen 9 7950X on Gigabyte Aorus Elite B650 Intel 13900KF on MSI Tomahawk B660 |
![]() |
![]() |
![]() |
#9765 | Link | |
Registered User
Join Date: May 2005
Posts: 1,556
|
Quote:
![]() Can you tell me something about the performance boost using AVX512 gives? There are many charts on the 99050X3D, and even on them using x265, but never with AVX512 being used. I feel it will make quite a difference.
__________________
Gorgeous, delicious, deculture! |
|
![]() |
![]() |
![]() |
#9766 | Link |
German doom9/Gleitz SuMo
Join Date: Oct 2001
Location: Germany, rural Altmark
Posts: 7,223
|
In general: In the past, the difference between AVX2 and AVX512 used to be marginal, and AVX512 has a higher risk of thermal throttling.
I can't tell you current results though, sorry. Try to search this thread for AVX512, I believe there have been reports a few months ago. Like here |
![]() |
![]() |
![]() |
#9767 | Link | |
Registered User
Join Date: Jan 2025
Posts: 65
|
Quote:
I never noticed any thermal throttling with the 7950X, and I would suggest that the 9950X3D & the 7970X won't either, but I do have pretty good cooling. But one thing I have noticed is 2 major typos in your post's, when referencing the 9950X3D !!!! ![]()
__________________
Main Systems:- Threadripper 7970X on Asus Pro WS TRX50-Sage WiFi Ryzen 9 9950X3D on MSI Carbon X670E Ryzen 9 7950X on Gigabyte Aorus Elite B650 Intel 13900KF on MSI Tomahawk B660 |
|
![]() |
![]() |
![]() |
#9768 | Link |
Big Bit Savings Now !
Join Date: Feb 2007
Location: close to the wall
Posts: 1,911
|
On a i9-11900K I found a few percent speed gain using -- asm avx512, limited by some of Intels safety features (limiting to base clock)
and the available cooling of a 125W CPU+115W GPU notebook system, although with plenty air intake, exhaust and 2x 5000rpm fan). On a real desktop AMD 7950X/9950X I would expect considerable speed gains in the range of >25%, so well worth it.
__________________
"To bypass shortcuts and find suffering...is called QUALity" (Die toten Augen von Friedrichshain) "Data reduction ? Yep, Sir. We're that issue working on. Synce invntoin uf lingöage..." |
![]() |
![]() |
![]() |
#9770 | Link |
Registered User
Join Date: May 2005
Posts: 1,556
|
That's the beauty if the 9950X3D! Allegedly it includes an extremely efficient implementation of AVX512, where the CPU doesn't reduce clockspeed for them, and only consumes a few Watt extra on AVX512. So, this should go endlessly better on the 9950X3D.
__________________
Gorgeous, delicious, deculture! |
![]() |
![]() |
![]() |
#9772 | Link | |
Big Bit Savings Now !
Join Date: Feb 2007
Location: close to the wall
Posts: 1,911
|
Quote:
How many % speed advantage AVX2 vs AVX512 in the described scenario do 7950X/9950X owners note ?
__________________
"To bypass shortcuts and find suffering...is called QUALity" (Die toten Augen von Friedrichshain) "Data reduction ? Yep, Sir. We're that issue working on. Synce invntoin uf lingöage..." |
|
![]() |
![]() |
![]() |
#9774 | Link | |
Lost my old account :(
Join Date: Jul 2017
Posts: 370
|
Quote:
|
|
![]() |
![]() |
![]() |
#9775 | Link | |
Registered User
Join Date: May 2021
Posts: 1
|
Quote:
on 2160p: CTU 32, ref 4, subme 4, rd 4, rect, no-amp, aq-mode 2, tu-intra-depth 3, tu-inter-depth 3, max-merge 5, crf 20 => +15.5% CTU 64, ref 5, subme 5, rd 4, rect, no-amp, aq-mode 2, tu-intra-depth 4, tu-inter-depth 4, max-merge 5, crf 18 => +36% Temps are manageable on air cooling. |
|
![]() |
![]() |
![]() |
#9776 | Link | |
Registered User
Join Date: Aug 2024
Posts: 521
|
Quote:
|
|
![]() |
![]() |
![]() |
#9777 | Link |
Registered User
Join Date: Aug 2024
Posts: 521
|
x265 will produce nonconformity bitstream when a very sudden change in chroma channel happens and triggered weighted prediction.
https://bitbucket.org/multicoreware/...roma_offset_lx (Finally remembered my bitbucket account) Related issue (4 years ago, when I didn't know the root cause): https://bitbucket.org/multicoreware/x265_git/issues/582 Many hardware decoders will fail to decode such frame, and result in corrupted output. HM will abort due to assertion. Code:
int pred = (128 - ((128 * wp[plane].inputWeight) >> (wp[plane].log2WeightDenom))); int deltaChroma = (wp[plane].inputOffset - pred); WRITE_SVLC(deltaChroma, "delta_chroma_offset_lX"); Last edited by Z2697; Yesterday at 23:11. |
![]() |
![]() |
![]() |
#9778 | Link | |
Moderator
![]() Join Date: Jan 2006
Location: Portland, OR
Posts: 5,014
|
Quote:
The per clock throughput improvements were solid, but the throttling reductions in instructions per second nearly cancelled those out. An implementation that maintains IPS with the IPC gains would be quite impressive. |
|
![]() |
![]() |
![]() |
#9780 | Link | |
Registered User
Join Date: May 2005
Posts: 1,556
|
Quote:
![]() I heard +37% elsewhere (but didn't mention that yet, as I was looking for objective feedback). So, it now seems that number was quite accurate!
__________________
Gorgeous, delicious, deculture! |
|
![]() |
![]() |
![]() |
Thread Tools | Search this Thread |
Display Modes | |
|
|