Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion.

Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules.

 

Go Back   Doom9's Forum > Hardware & Software > Software players
Register FAQ Calendar Today's Posts Search

Reply
 
Thread Tools Search this Thread Display Modes
Old 21st August 2015, 03:12   #32621  |  Link
Zachs
Suptitle, MediaPlayer.NET
 
Join Date: Nov 2001
Posts: 1,721
Quote:
Originally Posted by x7007 View Post
Hey, how can I use the OpenCL NNEDI3 ? with CUVID or DXVA2 write-back ?
Can you explain .
Thanks
My reply was specifically for madshi only. I have no idea how to answer your question for madVR as I don't use it. If you mean to ask how you can use OpenCL NNEDI3 with MPDN, kindly post in the MPDN thread instead.

Cheers.
Zachs is offline   Reply With Quote
Old 21st August 2015, 07:58   #32622  |  Link
madshi
Registered Developer
 
Join Date: Sep 2006
Posts: 9,140
Quote:
Originally Posted by Zachs View Post
Well FWIW on NVidia at least, anything less than 128 neurons (i.e. 64 and below), the SM5.0 NNEDI3 trumps the OpenCL version using avoid branches optimization. On Intel with P4600 (which is still very modern), the SM5.0 version is ~2x faster than the OpenCL version across the board with the "Alt weight access + scalar small code" optimization option. For example, upscaling 720x576 to 1080p the P4600 renders in 31ms (16/16 neurons) making it perfectly acceptable to watch anything 30fps and below. With the OpenCL version, it needs 58ms. At the end of the day it really comes down to how well the driver/compiler maps the code to its underlying hardware.
From reading the MPDN thread my impression was that older NVidia GPUs were roughly the same with SM5.0 and OpenCL and that newer NVidia GPUs (Maxwell) still performed faster with OpenCL. Has this changed with recent drivers? I'm aware of that Intel likes SM5.0 much more. Thanks...
madshi is offline   Reply With Quote
Old 21st August 2015, 08:38   #32623  |  Link
Sunset1982
Registered User
 
Join Date: Sep 2014
Posts: 280
@xAURUS:

Quote:
Originally Posted by Xaurus View Post
Ok so I tried to figure out the mysterious dropped frames in Win 10.

Tried to replace madvr with EVR, no difference. Tried moving back to 32-bit mpc-hc/lav/madvr. No difference.
Tried using ReClock. No difference. Went back to madvr 88.16. No difference.

The only thing that actually works is to restart the computer, then typically the first movie I watch has no mysterious dropped frames.
After that, it's the same thing.

So I guess this must be something with Win 10. In either case, it's extremely annoying.
did you try the latest insider build 10525 of Win10? maybe this works better?
__________________
Intel i5 6600, 16 GB DDR4, AMD Vega RX56 8 GB, Windows 10 x64, Kodi DS Player 17.6, MadVR (x64), LAV Filters (x64), XySubfilter .746 (x64)
LG 4K OLED (65C8D), Denon X-4200 AVR, Dali Zensor 5.1 Set
Sunset1982 is offline   Reply With Quote
Old 21st August 2015, 08:41   #32624  |  Link
aufkrawall
Registered User
 
Join Date: Dec 2011
Posts: 1,812
128 neurons NNEDI3 quadrupling on GTX 980 with 355.60
OpenCL (seems to be working fine with Zachs' new implementation):
GPU load: ~53%
clock: 1421Mhz
power consumption: ~72%
rendering time: ~25.6ms

SM 5.0 Avoid Branches:
GPU load: ~63%
clock: 1421Mhz
power consumption: ~67%
rendering time: ~31.8ms

Prefer Vector:
GPU load: ~80%
clock: 1421Mhz
power consumption: ~80%
rendering time: ~39.5ms

100% power consumption would be ~180 Watts.
aufkrawall is offline   Reply With Quote
Old 21st August 2015, 08:46   #32625  |  Link
madshi
Registered Developer
 
Join Date: Sep 2006
Posts: 9,140
So OpenCL does seem to be still faster with Maxwell GPUs, right?

Since OpenCL vs. SM5.0 is like compute power vs shader power (so to say), I suppose that the more the GPU architecture is optimized for compute performance, the higher the probability that OpenCL will be faster than SM5.0. Which also means that newer GPU architectures are likely to favor OpenCL. But then, a lot depends on compilers, too. And Intel seems to be quite different, in any case.
madshi is offline   Reply With Quote
Old 21st August 2015, 08:53   #32626  |  Link
aufkrawall
Registered User
 
Join Date: Dec 2011
Posts: 1,812
Quote:
Originally Posted by madshi View Post
So OpenCL does seem to be still faster with Maxwell GPUs, right?
Yes. However, the advantage naturally shrinks when using less neurons and further effects like AdpativeSharpen or SuperRes. So, in many situations there probably won't be a difference "perceived".

Now questions is, does the way Zachs found for OpenCL look secure for the future?
aufkrawall is offline   Reply With Quote
Old 21st August 2015, 09:09   #32627  |  Link
Arm3nian
Registered User
 
Join Date: Jul 2014
Location: Las Vegas
Posts: 177
Can someone explain why d3d9 has to be used? Why not 10, 11 or 12? Does dxva not support those or something? Isn't this microsoft's fault for not updating their stuff instead of nvidia removing backward compatibility or incompatibility in general.

Also, I don't think new gpu's like the 980ti or 950 even have drivers, on any operating system, that support d3d9 interop.
Arm3nian is offline   Reply With Quote
Old 21st August 2015, 10:55   #32628  |  Link
Xaurus
Registered User
 
Join Date: Jun 2011
Posts: 288
Quote:
Originally Posted by Sunset1982 View Post
@xAURUS:



did you try the latest insider build 10525 of Win10? maybe this works better?
I did not try that yet. But I am surprised, if this is a Win 10 issue there should be much more info on this already. But I can't find much.

What I will do today is to downgrade the Nvidia drivers to the version that can use NNEDI3.

Right now, my HTPC feels useless. Can't watch anything without pulling my hair out because of these mysterious dropped frames.
__________________
SETUP: Win 10/MPC-HC/LAV/MadVR
HARDWARE: Fractal Design Node 804 | Xeon E3-1260L v5 | Supermicro X11SSZ-TLN4F | Samsung 2x8GB DDR4 ECC | Samsung 850 EVO 1TB | MSI GTX 1650 Super | EVGA G2 750
Xaurus is offline   Reply With Quote
Old 21st August 2015, 12:59   #32629  |  Link
kalston
Registered User
 
Join Date: May 2011
Posts: 164
Quote:
Originally Posted by Arm3nian View Post
Also, I don't think new gpu's like the 980ti or 950 even have drivers, on any operating system, that support d3d9 interop.
Well my 980 ti does NNEDI3 256 neurons for 720p to 1440p with ~25ms render times on my Win 7 64 machine so I'm pretty sure that's incorrect.
kalston is offline   Reply With Quote
Old 21st August 2015, 14:04   #32630  |  Link
Zachs
Suptitle, MediaPlayer.NET
 
Join Date: Nov 2001
Posts: 1,721
Quote:
Originally Posted by madshi View Post
So OpenCL does seem to be still faster with Maxwell GPUs, right?

Since OpenCL vs. SM5.0 is like compute power vs shader power (so to say), I suppose that the more the GPU architecture is optimized for compute performance, the higher the probability that OpenCL will be faster than SM5.0. Which also means that newer GPU architectures are likely to favor OpenCL. But then, a lot depends on compilers, too. And Intel seems to be quite different, in any case.
Well it would seem Maxwell is the odd one out - the other simpler explanation could be that they simply haven't optimised the driver as much as they have for the more mature architectures.
Zachs is offline   Reply With Quote
Old 21st August 2015, 14:06   #32631  |  Link
madshi
Registered Developer
 
Join Date: Sep 2006
Posts: 9,140
Well, it's Maxwell and AMD. So basically those who're strong at computing, no? But I don't really know, I'm just guessing...
madshi is offline   Reply With Quote
Old 21st August 2015, 14:10   #32632  |  Link
nevcairiel
Registered Developer
 
Join Date: Mar 2010
Location: Hamburg/Germany
Posts: 10,346
Quote:
Originally Posted by Zachs View Post
Well it would seem Maxwell is the odd one out - the other simpler explanation could be that they simply haven't optimised the driver as much as they have for the more mature architectures.
If by "odd one out" you mean OpenCL is stronger, then thats expected as NVIDIA changed the architecture in Maxwell to cover a lot of ground in computing.
__________________
LAV Filters - open source ffmpeg based media splitter and decoders
nevcairiel is offline   Reply With Quote
Old 21st August 2015, 14:29   #32633  |  Link
Zachs
Suptitle, MediaPlayer.NET
 
Join Date: Nov 2001
Posts: 1,721
By "odd one out" I mean it is different to Fermi and Kepler. Fermi was strong in computing too.
But, is shader not computing? Is it not possible for Nvidia to map the shader version to the OpenCL version in its entirety in the future with driver optimizations?
We don't know any of that - all we're doing here is pure speculation, and a bit of a waste of time to be honest since there's no way to find out, or tell the future.
Zachs is offline   Reply With Quote
Old 21st August 2015, 18:18   #32634  |  Link
Asmodian
Registered User
 
Join Date: Feb 2002
Location: San Jose, California
Posts: 4,407
Quote:
Originally Posted by Arm3nian View Post
Also, I don't think new gpu's like the 980ti or 950 even have drivers, on any operating system, that support d3d9 interop.
Using 853.30 or .49 d3d9 interop worked fine on my 980ti in Windows 10.
__________________
madVR options explained
Asmodian is offline   Reply With Quote
Old 21st August 2015, 20:41   #32635  |  Link
amayra
Quality Checker
 
amayra's Avatar
 
Join Date: Aug 2013
Posts: 285
i manage to make madvr work in Wine 1.7.50 somehow but is crash later i will post log after they release "CSMT" and "staging" edition
__________________
I love Doom9
amayra is offline   Reply With Quote
Old 21st August 2015, 21:33   #32636  |  Link
Akeno
Registered User
 
Join Date: Jul 2015
Location: Seattle, Washington
Posts: 53
This has been spanning on for the past couple of pages but can anyone give a short run down on what problems madVR has with windows 10?
So far, it sounds like nnedi3 still doesn't work on nvidia cards due to missing opencl implementation and something about d3d9 interop although I don't know what effects that has on madVR.
Akeno is offline   Reply With Quote
Old 21st August 2015, 21:42   #32637  |  Link
Arm3nian
Registered User
 
Join Date: Jul 2014
Location: Las Vegas
Posts: 177
Quote:
Originally Posted by Asmodian View Post
Using 853.30 or .49 d3d9 interop worked fine on my 980ti in Windows 10.
Thanks worked for me also. Now that I've tried it, super-xbr looks identical to nnedi3 in chroma upscaling and image chroma and luma doubling watching bluray on a 4k screen. The only difference is that super-xbr doesn't turn my room into an oven.
Arm3nian is offline   Reply With Quote
Old 21st August 2015, 22:54   #32638  |  Link
noee
Registered User
 
Join Date: Jan 2007
Posts: 530
Quote:
Originally Posted by amayra View Post
i manage to make madvr work in Wine 1.7.50 somehow but is crash later i will post log after they release "CSMT" and "staging" edition
Start another thread, maybe in the Linux section....I'm trying the same thing....
__________________
Win7Ult || RX560/4G || Ryzen 5
noee is offline   Reply With Quote
Old 22nd August 2015, 14:54   #32639  |  Link
Fadeout
Registered User
 
Join Date: Dec 2009
Posts: 150
Quick question: does anyone know why MadVR deinterlacing looks much better than Yadif on VLC and other applications?

Windows Media Player gives a similar result to MadVR, it's only Yadif that makes things less smooth. And this is a problem because I'm trying to encode the video to progressive, but the only options for the decoding only produce results that look like Yadif, and so everything is less smooth.
Fadeout is offline   Reply With Quote
Old 22nd August 2015, 14:59   #32640  |  Link
nussman
Registered User
 
Join Date: Nov 2010
Posts: 238
madVR can use GPU Deinterlacing like WMP does.
nussman is offline   Reply With Quote
Reply

Tags
direct compute, dithering, error diffusion, madvr, ngu, nnedi3, quality, renderer, scaling, uhd upscaling, upsampling


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT +1. The time now is 06:22.


Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2024, vBulletin Solutions Inc.