Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion. Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules. |
6th July 2019, 03:53 | #56781 | Link |
Registered User
Join Date: May 2013
Posts: 706
|
Between H and VH on luma, if you're more than 3 feet away, the visual difference is small, but VH is indeed sharper, and you CAN see this in fine lines like animal fur. sifu from kungfu panda , closeup on his furry ears are a good test.
For chroma, I wouldn't even recommend NGU, because it'd be a waste of electricity. Even at 3 feet, it becomes very hard to see the difference between ngu VH vs lanczos. at 3feet plus, I would say it's impossible, I tested for myself, I got it wrong 50% of the time.
__________________
Ghetto | 2500k 5Ghz |
6th July 2019, 07:41 | #56782 | Link | |
Registered User
Join Date: Mar 2002
Posts: 2,323
|
Quote:
__________________
Ryzen 5 2600,Asus Prime b450-Plus,16GB,MSI GTX 1060 Gaming X 6GB(v398.18),Win10 LTSC 1809,MPC-BEx64+LAV+MadVR,Yamaha RX-A870,LG OLED77G2(2160p@23/24/25/29/30/50/59/60Hz) | madvr config |
|
6th July 2019, 08:16 | #56783 | Link | |
Registered User
Join Date: Jul 2016
Posts: 130
|
Quote:
As @Asmodian stated above, NGU very high costs nearly triple on performance (44/16ms) of high, so it must use the most GPU power for a better result. |
|
6th July 2019, 08:52 | #56784 | Link | |||
Registered User
Join Date: Mar 2002
Posts: 2,323
|
Quote:
Quote:
Quote:
- first 3 minutes of Shazam 23p 4k HDR BD remux (~75GB, video bitrate 76.7 Mb/s) on a 4K screen - external srt subtitle is used (MPC-BE internal sub filter) - LAV filters - madvr: -- hdr passthrough -- only chroma upscaling is applied: NGU Sharp High -- dithering: Error Diffusion 2 -- no trade quality option is checked -- full screen window mode -- 10 bit output if possible GPU usage results (checked with nvidiainspector): Code:
- dxva2 native: 76% - 80% - dxva2 copy-back, - crop: 83% - 87% - dxva2 copy-back, + crop: 83% - 91% - d3d11 native: 73% - 77% - d3d11 copy-back, - crop: 85% - 88% - d3d11 copy-back, + crop: 87% - 95% Interestingly enough, cropping (with copy-back modes) increases GPU usage and don't reduce it (it uses the same profile, so result is valid). I'll be curious about your results/graphs with similar test case, guys, including your system (mine is in my signature).
__________________
Ryzen 5 2600,Asus Prime b450-Plus,16GB,MSI GTX 1060 Gaming X 6GB(v398.18),Win10 LTSC 1809,MPC-BEx64+LAV+MadVR,Yamaha RX-A870,LG OLED77G2(2160p@23/24/25/29/30/50/59/60Hz) | madvr config Last edited by chros; 8th July 2019 at 11:31. |
|||
6th July 2019, 14:13 | #56787 | Link | |
Registered User
Join Date: May 2013
Posts: 706
|
Quote:
w/ NGU it sometimes crops to 3838 then you get lanzos which obviously pushes above 39ms total w/ all the other toppings.
__________________
Ghetto | 2500k 5Ghz |
|
6th July 2019, 17:37 | #56788 | Link | |
Registered User
Join Date: Oct 2016
Posts: 896
|
dxva copyback benchmarks
Quote:
So I've done some benchmarks on my HTPC... Notes: - 'GPU' and 'video' numbers are the frequency reported / usage % (e.g. 1290 MHz at 50% load = 'GPU 645') - CPU and GPU usage counters of DXVA Checker are completely wrong, I don't know how it computes them. Maybe the GPU counter reports only shader usage so it could be right but not useful, but the CPU usage is always wrong. I used HWMonitor which gives the same values as other monitoring tools. CPU @ 3500 MHz (FSB 333), RAM verified dual channel 1. 4K HEVC 10-bit. Best of 5 passes with DXVA Checker decode/playback, best of 3 runs with madVR. Playback at 1920x1080. madVR settings: scale chroma separately; no compromise on HDR quality; SSIM2D downscale; clip pre-measured for HDR; no black bars detection. RAM @ 666: Decode: 63,0 fps, CPU 65, GPU 1006, Bus 24 Playback: 34,9 fps, CPU 81, GPU 731, Bus 21 madVR: 439 dropped frames, avg 50,16 ms, max 78,17 ms, GPU 1772, CPU 95 RAM @ 800: Decode: 66,8 fps, CPU 60, GPU 1017, Bus 25 Playback: 34,5 fps, CPU 84, GPU 656, Bus 21 madVR: 315 dropped frames, avg 45,68 ms, max 63,78 ms, GPU 1772, CPU 90 For reference, with Native: Decode: 178,5 fps, CPU 46, GPU 1642, video 1467 Playback: 177,0 fps, CPU 54, GPU 1785, video 1467 madVR: 0 dropped frames, avg 34,38 ms, max 38,46 ms, GPU 1613, CPU 68 Difference of dropped frames and max render times under madVR just with 20% faster RAM is massive. With DXVA Checker, CPU is not fully loaded with decode and only 6% faster decode with 20% faster RAM. Software/platform inefficiency? 2. Same test but with madVR 'light' settings: compromise on HDR quality checked; Bicubic downscaling instead of SSIM2D copyback: avg 16,5 ms, max 25,04 ms, GPU 1136, CPU 78 native: avg 14,93 ms, max 17,78 ms, GPU 592, CPU 25 max render time is 40% better while GPU is two times less loaded, CPU three times less loaded. Massive performance impact. I understand why CPU would be loaded if it has to wait for frames to be read/written from/to system RAM, but why more GPU load? Can't the GPU render a frame it has received from the renderer while the next queued frame from the decoder is transfered over the PCIe bus and back? A single 4K P010 frame is 25 MB, at PCIe 2 x16 it should take 3,125 ms, 6,25 ms round-trip just for the time over the bus. If the rendering has stalls it could explain the difference of a few ms between copyback & native even with very high end GPUs. 3. A lighter test comparing Jellyfish clip at 1080p HEVC, same bitrate, in 8-bit and 10-bit: decode 8-bit: 266,5 fps, CPU 54, GPU 1797, video 1430, bus 11 decode 10-bit: 210,7 fps, CPU 60, GPU 1797, video 996, bus 22 playback 8-bit: 240,2 fps, CPU 75, GPU 1797, video 1141, bus 15 playback 10-bit: 181,9 fps, CPU 75, GPU 1743, video 852, bus 20 We see 10-bit decode takes up exactly two times the bus bandwidth as 8-bit, as expected. The 10-bit decode performance doesn't scale to 4x the speed of the 4K clip (would be 267 fps). for reference, 10-bit native: 299,2 fps, CPU 18, GPU 1613, video 1415, bus 2 4. Just out of curiosity I underclocked the CPU to 2100 MHz (FSB 200), to be able to test more different RAM speeds: (Jellyfish 10-bit DXVA Checker decode): RAM @ 400: 131,6 fps (native 268,4), CPU 76, GPU 1589, video 989, bus 13 RAM @ 533: 139,6 fps (native 275,7), CPU 70, GPU 1642, video 909, bus 14 RAM @ 666: 148,8 fps (native 281,1), CPU 67, GPU 1642, video 798, bus 15 RAM @ 800: 146,0 fps (native 281,8), CPU 68, GPU 1428, video 766, bus 15 for reference, CPU @ 3500 & RAM @ 800: 210,7 fps, CPU 60, GPU 1797, video 996, bus 22 With same RAM speed but 66% faster CPU, 40-45% more fps. With same (slow) CPU speed but 66% faster RAM, 13% more fps.
__________________
HTPC: Windows 10 22H2, MediaPortal 1, LAV Filters/ReClock/madVR. DVB-C TV, Panasonic GT60, Denon 2310, Core 2 Duo E7400 oc'd, GeForce 1050 Ti 536.40 |
|
6th July 2019, 19:44 | #56789 | Link |
Registered Developer
Join Date: Mar 2010
Location: Hamburg/Germany
Posts: 10,340
|
70% CPU usage on Copy-Back is not a typical result, really. On NVIDIA or Intel you should see extremely low CPU usage, if you have a relatively recent CPU, since both of those will use the DMA engines to copy the image, which does not result in high CPU usage.
AMD, especially on older generations, has been notoriously bad with copy-back, and I would not recommend using it there, or using it as a testing reference for any meaning beyond those cards specifically. Unfortunately I couldn't really determine from your post which hardware was used.
__________________
LAV Filters - open source ffmpeg based media splitter and decoders |
6th July 2019, 20:46 | #56790 | Link |
Registered User
Join Date: Oct 2016
Posts: 896
|
Yes it's old it's the one from my sig, Core 2 E7400.
Is the DMA method possible starting from the CPUs with integrated memory controller? Edit: LAV says 'cb direct', if that's useful.
__________________
HTPC: Windows 10 22H2, MediaPortal 1, LAV Filters/ReClock/madVR. DVB-C TV, Panasonic GT60, Denon 2310, Core 2 Duo E7400 oc'd, GeForce 1050 Ti 536.40 |
7th July 2019, 02:06 | #56791 | Link | |
Registered User
Join Date: May 2013
Posts: 706
|
Quote:
My HTPC build with g3258 @ 4.7ghz will go from 60-70% for 4K Dx11 Copyback. I would expect i3 to have something like that too, but i5 and above shouldn't
__________________
Ghetto | 2500k 5Ghz |
|
7th July 2019, 05:16 | #56792 | Link |
Registered User
Join Date: Oct 2012
Posts: 7,903
|
i get 15 % CPU usage on an i3 4130 and it isn't even using the full clock... 2.5-3 ghz playing UHD 10 bit 59p using d3d9.
with 10 bit 23p i get about 5 % at 1.1 ghz. is the copyback operation AVX2 optimised and that's why? |
8th July 2019, 05:58 | #56795 | Link |
Registered User
Join Date: Oct 2008
Posts: 13
|
Hi, guys. Trying to troubleshoot my framerate sync issue. For a couple months I've been getting frame drops/skips where I haven't had them before. ReClock stopped working and I've been having issues with custom resolutions but still testing.
My composition rate is 23.971. Is that normal? |
8th July 2019, 09:54 | #56796 | Link |
Registered User
Join Date: Jan 2018
Location: Barcelona
Posts: 50
|
I think ideal would be 23.976
__________________
Livingroom: Ryzen 7 1700@3.9ghz - Win Insiders Fast Ring - MSI RTX 2700 Gaming - Philips 65OLED803 | Bedroom Ryzen 3 1200 - Win 8.1 - GTX1060 - LG OLED EG920V 55" > All with MadVR latest test build |
8th July 2019, 11:35 | #56797 | Link | |
Registered User
Join Date: Mar 2002
Posts: 2,323
|
Quote:
We can continue here.
__________________
Ryzen 5 2600,Asus Prime b450-Plus,16GB,MSI GTX 1060 Gaming X 6GB(v398.18),Win10 LTSC 1809,MPC-BEx64+LAV+MadVR,Yamaha RX-A870,LG OLED77G2(2160p@23/24/25/29/30/50/59/60Hz) | madvr config Last edited by chros; 8th July 2019 at 11:38. |
|
8th July 2019, 19:47 | #56800 | Link |
Registered User
Join Date: Aug 2007
Posts: 87
|
I encountered a crash in Potplayer but I do not know if it is due to Madvr or potplayer.
Unhandled exception occured [0xC000000D@0x000000004A535AB4] at MadVR64.ax Problem appears if while playing the video at full screen, I press the minimize button and then the maximize button. While sound plays normally, I get a white screen after maximize and program crashes with that message. |
Tags |
direct compute, dithering, error diffusion, madvr, ngu, nnedi3, quality, renderer, scaling, uhd upscaling, upsampling |
Thread Tools | Search this Thread |
Display Modes | |
|
|