Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion. Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules. |
![]() |
#9661 | Link | ||
Lunarian
Join Date: Dec 2024
Posts: 15
|
Quote:
![]() Whenever I get it working I'll update the .diff and maybe build some bins for anyone that wants to try it. Just trying to leave things nicer for the next person. I do appreciate having a diff to work from it's really helpful. Quote:
Last edited by LunaRabbit; 16th December 2024 at 14:24. Reason: quote full post with links to images since pagnitation kicked in. |
||
![]() |
![]() |
![]() |
#9663 | Link | |
Registered User
Join Date: Aug 2024
Posts: 370
|
Quote:
As for the metric I think it works OK to evaluate the similarity of the "corresponding qcomp and cutree-strength pairs", it's just not valid when comparing different pairs. I used my eyes as well, of course. Just the summary report is easier to post. Last edited by Z2697; 16th December 2024 at 14:32. |
|
![]() |
![]() |
![]() |
#9664 | Link | ||
Lunarian
Join Date: Dec 2024
Posts: 15
|
Quote:
Quote:
I do appreciate the testing and providing the helpful chart. Didn't mean to imply otherwise. ![]() |
||
![]() |
![]() |
![]() |
#9665 | Link | |
Registered User
Join Date: Aug 2024
Posts: 370
|
Quote:
|
|
![]() |
![]() |
![]() |
#9666 | Link |
Registered User
Join Date: Oct 2002
Location: France
Posts: 2,388
|
About mbtree and x264.
Disabling mbtree is recommended when you target Blu-Ray, according mp3dom results and tests. But i think it's the only case (or of course, cases using very very similar encode parameters). The issue seems to be the small (1s) keyint value, not working well with mbtree. With standard [250] keyint value, mbtree is doing a proper job (still according mp3dom).
__________________
My github. Last edited by jpsdr; 16th December 2024 at 23:51. |
![]() |
![]() |
![]() |
#9667 | Link | |
Registered User
Join Date: Jun 2024
Location: South Africa
Posts: 260
|
Quote:
|
|
![]() |
![]() |
![]() |
#9668 | Link |
Registered User
Join Date: Aug 2024
Posts: 370
|
Hmm, something doesn't feel right.
Code:
source/encoder/slicetype.cpp void Lookahead::cuTree(Lowres **frames, int numframes, bool bIntra) { ... double totalDuration = 0.0; for (int j = 0; j <= numframes; j++) totalDuration += (double)m_param->fpsDenom / m_param->fpsNum; double averageDuration = totalDuration / (numframes + 1); Me, being a incompetent wannabe programmer, also kind of narrowed things down (or, rather, "confirmed") to cutree after, what, like 4 days? And is now moderately sure the quoted codes (in combination with some compiler / ISA optimization) caused the inconsistency. (only averageDuration is used in the following codes) As x265 itself does not have variable framerate awareness, this block code is unnecessary, I think, all these code will just result us back to "(double)m_param->fpsDenom / m_param->fpsNum", theoretically, if not the weirdness of FP math is happening, you know, the .1+.2 != .3 and non-associative stuff. Code:
double averageDuration = (double) m_param->fpsDenom / m_param->fpsNum; Alternatively you can put "#pragma GCC novec" or other compiler's equivalent right before the for loop. (This retains the consistency with "pre-this-modification" "nocona" (default -march of GCC) / "SSE3" version) But this is compiler specific. I presonally think the former "solution" is more elegant. The "inconsistency between bitstreams encoded by different compilers produced binaries" still exists after this modification. (e.g. GCC vs Clang) Some suspicious executables (10bit only) https://files.catbox.moe/nhla9t.7z Am I missing something? Please help me. plzzzzzzzz Last edited by Z2697; 25th December 2024 at 10:33. |
![]() |
![]() |
![]() |
#9670 | Link | |
Registered User
Join Date: Jun 2024
Location: South Africa
Posts: 260
|
Quote:
![]() |
|
![]() |
![]() |
![]() |
#9671 | Link | |
Registered User
Join Date: Aug 2024
Posts: 370
|
Quote:
|
|
![]() |
![]() |
![]() |
#9673 | Link | |
Registered User
Join Date: Apr 2017
Location: Hungary
Posts: 9
|
Quote:
The 1 minute sample was cut from an UHD Blu-Ray (36th Precinct) and was encoded in 4K using the same version of Handbrake with identical settings (preset slow) on both platforms. M4 Pro: 4m 20s 5900X: 5m 25s Quite impresssive. |
|
![]() |
![]() |
![]() |
#9674 | Link | |
Testeur de codecs
Join Date: May 2003
Location: France
Posts: 2,530
|
Quote:
__________________
Le Sagittaire ... ;-) 1- Ateme AVC or x264 2- VP7 or RV10 only for anime 3- XviD, DivX or WMV9 Last edited by Sagittaire; 30th December 2024 at 06:04. |
|
![]() |
![]() |
![]() |
#9675 | Link |
Registered User
Join Date: Mar 2007
Posts: 103
|
Of course, but the 9950X has a 170W TDP, and the M4 Pro is what, 40 W at max, with 6 less performance cores than the 9950X.
And it seems all the latest arm64 optimizations are again stuck on the x265-devel mailing list. |
![]() |
![]() |
![]() |
#9676 | Link | |
Registered User
Join Date: Apr 2017
Location: Hungary
Posts: 9
|
Quote:
|
|
![]() |
![]() |
![]() |
#9677 | Link | |
Testeur de codecs
Join Date: May 2003
Location: France
Posts: 2,530
|
Quote:
I create codec benchmark for make that and 9950X at stock has 74% more performance than 5950X for x265. I don't test 5900X but 5950X have theoricaly 20% more perfomance than 5900X. In correct H265 benchmark (all CPU thread saturation) the 9950X will produce 110% more performance than 5900X. When Techpowerup use correct CPU saturation benchmark like cinebench, blender or stockfish, you evaluate the correct CPU power Stockfisk: 5900X: 14.52 Mips 9950X: 30.78 Mips (+111%) Blender: 5900X: 114.9 s 9950X: 56 s (+105%) V-Ray: 5900X: 21538 9950X: 48899 (+127%) ![]()
__________________
Le Sagittaire ... ;-) 1- Ateme AVC or x264 2- VP7 or RV10 only for anime 3- XviD, DivX or WMV9 Last edited by Sagittaire; 30th December 2024 at 23:36. |
|
![]() |
![]() |
![]() |
#9678 | Link | |
Registered User
Join Date: Aug 2018
Location: Germany
Posts: 139
|
Quote:
|
|
![]() |
![]() |
![]() |
#9679 | Link |
Registered User
Join Date: Feb 2007
Location: Sweden
Posts: 492
|
x265 v4.1+62-441e1e4
Built on December 31 2024, GCC 14.2.0 Win32/64 / 8bit+10bit+12bit https://bitbucket.org/multicoreware/.../branch/master DL : https://www.mediafire.com/file/6nh9e7dfb72b3pi
__________________
Do NOT re-post any of my Mediafire links. Download & re-host the content(s) if you want to share it somewhere else. Last edited by Barough; 31st December 2024 at 23:29. |
![]() |
![]() |
![]() |
#9680 | Link | |
Lost my old account :(
Join Date: Jul 2017
Posts: 352
|
Quote:
Most software does not have perfect parallelization scaling, and in most cases were that is the case, i.e. 3d-rendering and simulations etc, those loads usually gain more to be calculated on GPUs anyway. And although I think it makes perfect sense to test both cases here, cause you can just run two encodes at the same time even though you dont wanna start doing chunk-encoding to get "more" out of your CPU. Its not like we have a history of starting to run multiple parallel benchmark of a software cause we dont find the thread-scaling good enough when that the results dont see the full "potential" of the CPU, cause this argument can be made for most of them (audio encoding, compression, compiling etc). Last edited by excellentswordfight; 2nd January 2025 at 14:43. |
|
![]() |
![]() |
![]() |
Thread Tools | Search this Thread |
Display Modes | |
|
|