Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion.

Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules.

 

Go Back   Doom9's Forum > Capturing and Editing Video > Avisynth Development

Reply
 
Thread Tools Search this Thread Display Modes
Old 25th September 2019, 14:33   #981  |  Link
videoh
Useful n00b
 
Join Date: Jul 2014
Posts: 1,667
Second test:

DGSource() 2080 Ti + x264 7700K: 4:28
LWLibavVideoSource() + x264 7700K: 4:57
Preset medium

That's about an 11% improvement on GPU versus doing everything on my fairly strong CPU.

I have always pointed out that the gain you can get will depend on how strong your processor is. And I have also pointed out that DG tools bring other things to the table, such as robust random access, GPU resizing, some CUDA filters, DGIndexNV as a useful analysis tool in its own right, etc. With over 15000 paid users, I'm not worried about justifying the existence of DG tools.

I'll do another test with UHD. It will be interesting to see how things scale with frame size.

Last edited by videoh; 25th September 2019 at 14:55.
videoh is offline   Reply With Quote
Old 25th September 2019, 14:49   #982  |  Link
Atak_Snajpera
RipBot264 author
 
Atak_Snajpera's Avatar
 
Join Date: May 2006
Location: Poland
Posts: 7,806
Quote:
I have always pointed out that the gain you can get will depend on how strong your processor is.
Not really. On my weak PC delta is ~9% in default medium x264 preset.


On your much more powerful CPU (7700k@4.2Ghz) and GPU (RTX 2080TI) delta is ~11%. Your ultra expensive ~$1200 GPU with 4xfaster hardware decoding than ancient GT 710 kepler for $40 gives you only +2 percent points extra in video encoding. That's just pathetic.

Last edited by Atak_Snajpera; 25th September 2019 at 14:54.
Atak_Snajpera is offline   Reply With Quote
Old 25th September 2019, 15:03   #983  |  Link
Atak_Snajpera
RipBot264 author
 
Atak_Snajpera's Avatar
 
Join Date: May 2006
Location: Poland
Posts: 7,806
Quote:
Originally Posted by videoh View Post
710 is garbage so your results are pointless. So sorry you're too poor to buy decent hardware.
Oh finally you said it! That's what I wanted to hear from you! Old good Neuron2 (aka banned doom9 moderator) is back!

PS. HA! got ya! I was faster this time and I managed to quote your deleted comment! Not this time amigo!

Last edited by Atak_Snajpera; 25th September 2019 at 15:05.
Atak_Snajpera is offline   Reply With Quote
Old 25th September 2019, 15:07   #984  |  Link
videoh
Useful n00b
 
Join Date: Jul 2014
Posts: 1,667
I don't care what you quote, and your childish behavior doesn't move me at all. Grow up.

To be fair, I would encode on my snazzy 2080 Ti also. And the 2080 Ti is bringing me other things that are quite useful for me, such as tensor cores, massive number of CUDA cores for my physics simulations, etc. No matter how you blow smoke, I get an 11% improvement for your use case. It would be brain-dead for me not to choose DGSource() over LWLibavVideoSource().

You sound bitter because you can't afford decent hardware, although your post above suggests simple DG derangement syndrome.

Last edited by videoh; 25th September 2019 at 15:28.
videoh is offline   Reply With Quote
Old 25th September 2019, 15:16   #985  |  Link
DJATOM
Registered User
 
DJATOM's Avatar
 
Join Date: Sep 2010
Location: Ukraine, Bohuslav
Posts: 377
Actually GT 710 is almost deprecated card. You will lose driver updates soon (from April 2020). Not to mention it's only ok for decoding 8 bit AVC, while RTX cards (or GTX 1660/1660Ti cards) has bleeding edge decoder, which significantly faster and supports more formats (HEVC 8/10/12 bit, VP9 8 and 10 bit).
__________________
Me on GitHub
PC Specs: Ryzen 5950X, 64 GB RAM, RTX 2070
DJATOM is offline   Reply With Quote
Old 25th September 2019, 15:20   #986  |  Link
Atak_Snajpera
RipBot264 author
 
Atak_Snajpera's Avatar
 
Join Date: May 2006
Location: Poland
Posts: 7,806
Quote:
Originally Posted by DJATOM View Post
Actually GT 710 is almost deprecated card. You will lose driver updates soon (from April 2020). Not to mention it's only ok for decoding 8 bit AVC, while RTX cards (or GTX 1660/1660Ti cards) has bleeding edge decoder, which significantly faster and supports more formats (HEVC 8/10/12 bit, VP9 8 and 10 bit).
You do realize that ancient GT 710 is just for showing desktop right? It was not bought for games or 4k HEVC movies. Instead of changing topic we should focus on facts. 4xfaster video engine gives you in real life encoding scenarios only +2 extra percent points versus something what costs $40.

Ps. And yes If you want you can also call me poor like videoh aka Neuron2 did in his post. I'm too old for this childish talking.

Last edited by Atak_Snajpera; 25th September 2019 at 15:25.
Atak_Snajpera is offline   Reply With Quote
Old 25th September 2019, 15:41   #987  |  Link
videoh
Useful n00b
 
Join Date: Jul 2014
Posts: 1,667
That's "neuron2", pal. Get it right!
videoh is offline   Reply With Quote
Old 25th September 2019, 15:42   #988  |  Link
poisondeathray
Registered User
 
Join Date: Sep 2007
Posts: 5,345
Thanks for the testing results,

@videoh -
How about LWLibavVideoSource(decoder="h264_cuvid") on the same hardware? I'd expect similar speeds as DGSource

Last edited by poisondeathray; 25th September 2019 at 15:50.
poisondeathray is offline   Reply With Quote
Old 25th September 2019, 15:51   #989  |  Link
DJATOM
Registered User
 
DJATOM's Avatar
 
Join Date: Sep 2010
Location: Ukraine, Bohuslav
Posts: 377
I'm trying to be on topic

Quote:
Originally Posted by Atak_Snajpera View Post
You do realize that ancient GT 710 is just for showing desktop right? It was not bought for games or 4k HEVC movies. Instead of changing topic we should focus on facts. 4xfaster video engine gives you in real life encoding scenarios only +2 extra percent points versus something what costs $40.
While your ancient card only good for showing desktop, RTX card (for example, mine RTX 2070) can do OpenCL/CUDA computations, offloading a lot of work (for example, NNEDI3CL or EEDI3CL) from CPU. I've tried GTX750, 760, 1050, 2060 and 2070 cards with those filters: with decent CPU GTX 750, 760 and 1050 was a bottleneck for my filtering chain (thus slowing down encoder).
If you don't have to use heavy filtering, there is not much benefit from better card or GPU decoding at all (with slower presets), but when you have to do a lot of encodes, it definitely wins some time for you. Even your tests shows a benefit from using GPU decoder.
__________________
Me on GitHub
PC Specs: Ryzen 5950X, 64 GB RAM, RTX 2070
DJATOM is offline   Reply With Quote
Old 25th September 2019, 15:53   #990  |  Link
MeteorRain
結城有紀
 
Join Date: Dec 2003
Location: NJ; OR; Shanghai
Posts: 894
Quote:
Originally Posted by Atak_Snajpera View Post
On your much more powerful CPU (7700k@4.2Ghz) and GPU (RTX 2080TI) delta is ~11%. Your ultra expensive ~$1200 GPU with 4xfaster hardware decoding than ancient GT 710 kepler for $40 gives you only +2 percent points extra in video encoding. That's just pathetic.
People buy a house not because they want to have a slightly bigger closet in their room than your condo. People buy a RTX 2080Ti not because they want to show how 1% of its processing power improves 2% of the portion.

Your $1200 GPU argument makes no sense because its just a card he has. I have a $60 worth of GTX 950 and I can get exactly the same decoding speed that he got, and thus the same result. Saying people are pathetic buying a 1 million dollar house by comparing how big the closet is, is IMHO so funny that I literally laughed out load when I saw your words.

If you really want to see how big the difference that graphics card can do, decode and scale down some UHD to 1080P or 720P and check how big of the portion those card can take over. Even worse, I've got a 8K 60fps HEVC 10bit TV broadcasting stream that my freshly bought Ryzen 3600 can barely playback at 5 fps full CPU load. You want to see how $1200 GPU works compared to your $40 card? Try those. It can decode, downscale, process the image and then output to AviSynth at > 60 fps, comparing to, let's say, 3 fps on a 6 core CPU sold this year.
__________________
Projects
x265 - Yuuki-Asuna-mod Download / GitHub
TS - ADTS AAC Splitter | LATM AAC Splitter | BS4K-ASS
Neo AviSynth+ filters - F3KDB | FFT3D | DFTTest | MiniDeen | Temporal Median
MeteorRain is offline   Reply With Quote
Old 25th September 2019, 15:58   #991  |  Link
videoh
Useful n00b
 
Join Date: Jul 2014
Posts: 1,667
Quote:
Originally Posted by poisondeathray View Post
How about LWLibavVideoSource(decoder="h264_cuvid") on the same hardware? I'd expect similar speeds as DGSource
Yes, indeed:

4:30

So maybe need to retract my point about being brain-dead not to choose DGSource() here, but the main point still applies, I'd be brain-dead not to choose GPU decoding.

To answer your earlier question...I haven't run into any files that DGSource() cannot open and that I needed to do anything with. Surely, if such files became important for me or my users I would first want to add support in DGDecNV but failing that, of course would revert to an appropriate SW decoder.

Last edited by videoh; 25th September 2019 at 16:20.
videoh is offline   Reply With Quote
Old 25th September 2019, 15:59   #992  |  Link
DJATOM
Registered User
 
DJATOM's Avatar
 
Join Date: Sep 2010
Location: Ukraine, Bohuslav
Posts: 377
Quote:
Originally Posted by poisondeathray View Post
Thanks for the testing results,

@videoh -
How about LWLibavVideoSource(decoder="h264_cuvid") on the same hardware? I'd expect similar speeds as DGSource
I've measured some fps for certain sources (with vsedit's benchmark).

8bit AVC (.m2ts from BD)
Code:
Time elapsed: 1:05.696 - 547.85574001840120672568 FPS # DGSource
Time elapsed: 1:04.261 - 560.09428731059801975789 FPS # LWLibAvSource
10bit HEVC (.hevc ES)
Code:
Time elapsed: 1:17.304 - 465.58912373251257577067 FPS # DGNV
Time elapsed: 1:24.014 - 428.40612414457933709855 FPS # LWLibAvSource
__________________
Me on GitHub
PC Specs: Ryzen 5950X, 64 GB RAM, RTX 2070
DJATOM is offline   Reply With Quote
Old 25th September 2019, 16:02   #993  |  Link
Atak_Snajpera
RipBot264 author
 
Atak_Snajpera's Avatar
 
Join Date: May 2006
Location: Poland
Posts: 7,806
Quote:
Saying people are pathetic buying a 1 million dollar house by comparing how big the closet is, is IMHO so funny that I literally laughed out load when I saw your words.
Where did I write that?! Show me! You are now just spreading fake news. Not cool bro!
Atak_Snajpera is offline   Reply With Quote
Old 25th September 2019, 16:04   #994  |  Link
MeteorRain
結城有紀
 
Join Date: Dec 2003
Location: NJ; OR; Shanghai
Posts: 894
Quote:
Originally Posted by Atak_Snajpera View Post
Where did I write that?! Show me! You are now just spreading fake news. Not cool bro!
You are so funny.
__________________
Projects
x265 - Yuuki-Asuna-mod Download / GitHub
TS - ADTS AAC Splitter | LATM AAC Splitter | BS4K-ASS
Neo AviSynth+ filters - F3KDB | FFT3D | DFTTest | MiniDeen | Temporal Median
MeteorRain is offline   Reply With Quote
Old 25th September 2019, 16:04   #995  |  Link
videoh
Useful n00b
 
Join Date: Jul 2014
Posts: 1,667
Quote:
Originally Posted by MeteorRain View Post
I've got a 8K 60fps HEVC 10bit TV broadcasting stream that my freshly bought Ryzen 3600 can barely playback at 5 fps full CPU load. You want to see how $1200 GPU works compared to your $40 card? Try those. It can decode, downscale, process the image and then output to AviSynth at > 60 fps, comparing to, let's say, 3 fps on a 6 core CPU sold this year.
Amen, bro!
videoh is offline   Reply With Quote
Old 25th September 2019, 16:08   #996  |  Link
Atak_Snajpera
RipBot264 author
 
Atak_Snajpera's Avatar
 
Join Date: May 2006
Location: Poland
Posts: 7,806
Quote:
Originally Posted by MeteorRain View Post
You are so funny.
Unfortunately I can't say the same about your lies. They are not funny at all.
Atak_Snajpera is offline   Reply With Quote
Old 25th September 2019, 16:13   #997  |  Link
videoh
Useful n00b
 
Join Date: Jul 2014
Posts: 1,667
Cool down Atak. You did use the epithet "pathetic". MeteorRain was merely analogizing: obsessing over a video decoder when the card brings so much more to the table is like obsessing over a closet in a house. I suppose you did understand that but calling someone a liar is so much cooler, right? From the guy that is too old to be childish!

To be honest, I'm having trouble understanding what your basic point is here. Is it that using GPU power is useless? Is it that 2080 Ti etc. are poor value? DG tools suck? DG sucks? What is your overall point?

Last edited by videoh; 25th September 2019 at 16:19.
videoh is offline   Reply With Quote
Old 25th September 2019, 16:17   #998  |  Link
DJATOM
Registered User
 
DJATOM's Avatar
 
Join Date: Sep 2010
Location: Ukraine, Bohuslav
Posts: 377
Quote:
Originally Posted by MeteorRain View Post
People buy a house not because they want to have a slightly bigger closet in their room than your condo. People buy a RTX 2080Ti not because they want to show how 1% of its processing power improves 2% of the portion.

Your $1200 GPU argument makes no sense because its just a card he has. I have a $60 worth of GTX 950 and I can get exactly the same decoding speed that he got, and thus the same result. Saying people are pathetic buying a 1 million dollar house by comparing how big the closet is, is IMHO so funny that I literally laughed out load when I saw your words.

If you really want to see how big the difference that graphics card can do, decode and scale down some UHD to 1080P or 720P and check how big of the portion those card can take over. Even worse, I've got a 8K 60fps HEVC 10bit TV broadcasting stream that my freshly bought Ryzen 3600 can barely playback at 5 fps full CPU load. You want to see how $1200 GPU works compared to your $40 card? Try those. It can decode, downscale, process the image and then output to AviSynth at > 60 fps, comparing to, let's say, 3 fps on a 6 core CPU sold this year.
Yeah, good real word scenario. I've tried to decode such stream on my card and it's playing smoothly without any stutters while stuttering a lot with SW decoder and 3900X (40-60% utilization, so a believe it hits PCIe Gen3 x16 max throughput capacity).

MI from that video: https://pastebin.com/xB8CHFiS.
__________________
Me on GitHub
PC Specs: Ryzen 5950X, 64 GB RAM, RTX 2070

Last edited by DJATOM; 25th September 2019 at 16:21.
DJATOM is offline   Reply With Quote
Old 25th September 2019, 16:25   #999  |  Link
videoh
Useful n00b
 
Join Date: Jul 2014
Posts: 1,667
Quote:
Originally Posted by DJATOM View Post
so a believe it hits PCIe Gen3 x16 max throughput capacity
That's a great point. If GPU processing has an achilles heel, that is it. We can look forward to greater bandwidth with future generations. Also, to mitigate this I have been experimenting with my CUDASynth framework, which allows many full frame transfers over PCIe to be eliminated for a script with multiple filters. You can read about it at the DG forum.
videoh is offline   Reply With Quote
Old 25th September 2019, 16:52   #1000  |  Link
MeteorRain
結城有紀
 
Join Date: Dec 2003
Location: NJ; OR; Shanghai
Posts: 894
Quote:
Originally Posted by DJATOM View Post
SW decoder and 3900X (40-60% utilization, so a believe it hits PCIe Gen3 x16 max throughput capacity).
That means the video renderer can't keep up because of the limited PCIe bandwidth? Very interesting.

Maybe try reducing the frame rate down to 30 on MKV, and pin the video player process to half of the cores, and see how that works? That should tell us whether it's due to the bandwidth or due to, like, CPU scheduler.
__________________
Projects
x265 - Yuuki-Asuna-mod Download / GitHub
TS - ADTS AAC Splitter | LATM AAC Splitter | BS4K-ASS
Neo AviSynth+ filters - F3KDB | FFT3D | DFTTest | MiniDeen | Temporal Median
MeteorRain is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT +1. The time now is 11:10.


Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2024, vBulletin Solutions Inc.