Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion. Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules. |
|
|
#9661 | Link |
|
Registered User
Join Date: Oct 2002
Location: France
Posts: 2,635
|
About mbtree and x264.
Disabling mbtree is recommended when you target Blu-Ray, according mp3dom results and tests. But i think it's the only case (or of course, cases using very very similar encode parameters). The issue seems to be the small (1s) keyint value, not working well with mbtree. With standard [250] keyint value, mbtree is doing a proper job (still according mp3dom).
__________________
My github. Last edited by jpsdr; 16th December 2024 at 23:51. |
|
|
|
|
|
#9662 | Link | |
|
Registered User
Join Date: Jun 2024
Location: South Africa
Posts: 615
|
Quote:
|
|
|
|
|
|
|
#9663 | Link |
|
Registered User
Join Date: Aug 2024
Location: Between my two ears
Posts: 864
|
Hmm, something doesn't feel right.
Code:
source/encoder/slicetype.cpp
void Lookahead::cuTree(Lowres **frames, int numframes, bool bIntra)
{
...
double totalDuration = 0.0;
for (int j = 0; j <= numframes; j++)
totalDuration += (double)m_param->fpsDenom / m_param->fpsNum;
double averageDuration = totalDuration / (numframes + 1);
Me, being a incompetent wannabe programmer, also kind of narrowed things down (or, rather, "confirmed") to cutree after, what, like 4 days? And is now moderately sure the quoted codes (in combination with some compiler / ISA optimization) caused the inconsistency. (only averageDuration is used in the following codes) As x265 itself does not have variable framerate awareness, this block code is unnecessary, I think, all these code will just result us back to "(double)m_param->fpsDenom / m_param->fpsNum", theoretically, if not the weirdness of FP math is happening, you know, the .1+.2 != .3 and non-associative stuff. Code:
double averageDuration = (double) m_param->fpsDenom / m_param->fpsNum; Alternatively you can put "#pragma GCC novec" or other compiler's equivalent right before the for loop. (This retains the consistency with "pre-this-modification" "nocona" (default -march of GCC) / "SSE3" version) But this is compiler specific. I presonally think the former "solution" is more elegant. The "inconsistency between bitstreams encoded by different compilers produced binaries" still exists after this modification. (e.g. GCC vs Clang) Some suspicious executables (10bit only) https://files.catbox.moe/nhla9t.7z Am I missing something? Please help me. plzzzzzzzz Last edited by Z2697; 25th December 2024 at 10:33. |
|
|
|
|
|
#9665 | Link | |
|
Registered User
Join Date: Jun 2024
Location: South Africa
Posts: 615
|
Quote:
|
|
|
|
|
|
|
#9666 | Link | |
|
Registered User
Join Date: Aug 2024
Location: Between my two ears
Posts: 864
|
Quote:
|
|
|
|
|
|
|
#9668 | Link | |
|
Registered User
Join Date: Apr 2017
Location: Hungary
Posts: 9
|
Quote:
The 1 minute sample was cut from an UHD Blu-Ray (36th Precinct) and was encoded in 4K using the same version of Handbrake with identical settings (preset slow) on both platforms. M4 Pro: 4m 20s 5900X: 5m 25s Quite impresssive. |
|
|
|
|
|
|
#9669 | Link | |
|
Testeur de codecs
Join Date: May 2003
Location: France
Posts: 2,546
|
Quote:
__________________
Le Sagittaire ... ;-) 1- Ateme AVC or x264 2- VP7 or RV10 only for anime 3- XviD, DivX or WMV9 Last edited by Sagittaire; 30th December 2024 at 06:04. |
|
|
|
|
|
|
#9670 | Link |
|
Registered User
Join Date: Mar 2007
Posts: 113
|
Of course, but the 9950X has a 170W TDP, and the M4 Pro is what, 40 W at max, with 6 less performance cores than the 9950X.
And it seems all the latest arm64 optimizations are again stuck on the x265-devel mailing list. |
|
|
|
|
|
#9671 | Link | |
|
Registered User
Join Date: Apr 2017
Location: Hungary
Posts: 9
|
Quote:
Screenshot 2024-12-30 165716.png |
|
|
|
|
|
|
#9672 | Link | |
|
Testeur de codecs
Join Date: May 2003
Location: France
Posts: 2,546
|
Quote:
I create codec benchmark for make that and 9950X at stock has 74% more performance than 5950X for x265. I don't test 5900X but 5950X have theoricaly 20% more perfomance than 5900X. In correct H265 benchmark (all CPU thread saturation) the 9950X will produce 110% more performance than 5900X. When Techpowerup use correct CPU saturation benchmark like cinebench, blender or stockfish, you evaluate the correct CPU power Stockfisk: 5900X: 14.52 Mips 9950X: 30.78 Mips (+111%) Blender: 5900X: 114.9 s 9950X: 56 s (+105%) V-Ray: 5900X: 21538 9950X: 48899 (+127%)
__________________
Le Sagittaire ... ;-) 1- Ateme AVC or x264 2- VP7 or RV10 only for anime 3- XviD, DivX or WMV9 Last edited by Sagittaire; 30th December 2024 at 23:36. |
|
|
|
|
|
|
#9673 | Link | |
|
Registered User
Join Date: Aug 2018
Location: Germany
Posts: 153
|
Quote:
|
|
|
|
|
|
|
#9674 | Link |
|
Registered User
Join Date: Feb 2007
Location: Sweden
Posts: 505
|
x265 v4.1+62-441e1e4
Built on December 31 2024, GCC 14.2.0 Win32/64 / 8bit+10bit+12bit https://bitbucket.org/multicoreware/.../branch/master DL : https://www.mediafire.com/file/6nh9e7dfb72b3pi
__________________
Do NOT re-post any of my Mediafire links. Download & re-host the content(s) if you want to share it somewhere else. Last edited by Barough; 31st December 2024 at 23:29. |
|
|
|
|
|
#9675 | Link | |
|
Lost my old account :(
Join Date: Jul 2017
Posts: 394
|
Quote:
Most software does not have perfect parallelization scaling, and in most cases were that is the case, i.e. 3d-rendering and simulations etc, those loads usually gain more to be calculated on GPUs anyway. And although I think it makes perfect sense to test both cases here, cause you can just run two encodes at the same time even though you dont wanna start doing chunk-encoding to get "more" out of your CPU. Its not like we have a history of starting to run multiple parallel benchmark of a software cause we dont find the thread-scaling good enough when that the results dont see the full "potential" of the CPU, cause this argument can be made for most of them (audio encoding, compression, compiling etc). Last edited by excellentswordfight; 2nd January 2025 at 14:43. |
|
|
|
|
|
|
#9676 | Link | |
|
Testeur de codecs
Join Date: May 2003
Location: France
Posts: 2,546
|
Quote:
Yes but time to encode wav to mp3 with lame is not really a problem. Encoding video source can take several hours. And multipart or ABR Ladder to saturate CPU are simply well-known techniques in the professional world. For exemple ABR Ladder is full option include directly in x265 codec. Make multiscession encoding is option too and directly in handbrake. Why buy a $600 CPU to do the fastest possible encoding in AOM AV1, when you can do it 4 times faster with a $200 CPU using the right encoding technique.
__________________
Le Sagittaire ... ;-) 1- Ateme AVC or x264 2- VP7 or RV10 only for anime 3- XviD, DivX or WMV9 Last edited by Sagittaire; 2nd January 2025 at 20:52. |
|
|
|
|
|
|
#9677 | Link | |
|
Registered User
Join Date: Aug 2024
Location: Between my two ears
Posts: 864
|
Quote:
Since m4 pro only comes with severely overpriced memory, if you are planning on only use the CPU to do "work" (but why) that's just not worth it. Or maybe it's the other way around, the basic model of mac mini m4 is underpriced? You know, like the razor and blades model? I don't know, I don't own a mac. (wait a minute, m4 pro has 2 variants? and they are very different errrr it's a great cpu but apple is just confusing) It's a great chip, but it doesn't come as just a chip, I just don't want to buy it this way. (and the "large scale" customers are likely don't want as well) It seems like even a m4 in basic model mac mini has more transistors than 9950X (although with integrated memory and GPU), and with the best process node at the time, I'm not surprised that it's performant and efficient, and the best model (m4 max 12p+4e) can come close to 9950X with less power draw. Physics works, how surprising. I have to say this is very out of topic now. Last edited by Z2697; 3rd January 2025 at 10:25. |
|
|
|
|
|
|
#9679 | Link |
|
Registered User
Join Date: Feb 2007
Location: Sweden
Posts: 505
|
x265 v4.1+78-5223ea7
Built on January 03 2025, GCC 14.2.0 Win32/64 / 8bit+10bit+12bit https://bitbucket.org/multicoreware/.../branch/master DL : https://www.mediafire.com/file/86pd5zd03csrk67
__________________
Do NOT re-post any of my Mediafire links. Download & re-host the content(s) if you want to share it somewhere else. |
|
|
|
|
|
#9680 | Link |
|
Acid fr0g
Join Date: May 2002
Location: Italy
Posts: 3,065
|
Finally I have a working build with --frame-dup working and I'd like to play with it a bit, as I mostly encode animes.
What value of --dup-threshold should be ok? The default is 70 but it doesn't tell too much to me. Is there a way to calculate the "difference" between two frames in a way similar to what x265 does?
__________________
@turment on Telegram |
|
|
|
![]() |
| Thread Tools | Search this Thread |
| Display Modes | |
|
|