Thanks to Olli's scene we could profile a very detailed, production grade car, at very high resolutions (around 60 MPx.).
So we now have a real-world measured performance chart for bucket versus progressive, with and without (many) Render Elements, and at increasing resolutions.
Tests conducted in max 2017.
The render was set up as a 1-100, 0.025 N.T.
The reason for the max AA subdivs is to make sure the noise threshold was met, and we didn't hide from stuck buckets or long sampling.
The higher Noise Threshold, instead, was chosen so to finish the profiling in a reasonable time, and should carry absolutely no conceptual difference from the same test carried with a lower Threshold.
The image resolution was 8961 x 6723 Px (~ 60.2 MPx), while the 16 Render Elements (when on) consisted of the usual set of beauty elements, and nine multi-mattes.
So, the first three rows are our "zero", the fastest way to get to an RGBA image: bucket sampler, and no Render Elements.
The next three rows are again the bucket sampler, but with render element active: we can see the slack is between 5 and 9 percent, give or take decimals.
Now, the first thre progressive sampler rows are calculated without Render Elements, and the performance slack with the bucket sampler doing the same job is between 2 and 4%. The middle result, showing a 12% difference, should be currently treated as an outlier, as i believe windows updated mid-job (don't ask...) (EDIT: I think it was windows defender working on a schedule.).
The last triplet of rows in the table above has the worst penalty for the progressive sampler, where compared to a bucket *without* REs, it's 10% or so slower, until the resolution is very big, and then it jumps to a 24%.
Notice the differences between Samplers (instead of tasks and samplers) are *much* more contained, for all but the two outlier values.
The test is being re-run to validate the results, so for now take with a pinch of salt.
So we now have a real-world measured performance chart for bucket versus progressive, with and without (many) Render Elements, and at increasing resolutions.
Tests conducted in max 2017.
The render was set up as a 1-100, 0.025 N.T.
The reason for the max AA subdivs is to make sure the noise threshold was met, and we didn't hide from stuck buckets or long sampling.
The higher Noise Threshold, instead, was chosen so to finish the profiling in a reasonable time, and should carry absolutely no conceptual difference from the same test carried with a lower Threshold.
The image resolution was 8961 x 6723 Px (~ 60.2 MPx), while the 16 Render Elements (when on) consisted of the usual set of beauty elements, and nine multi-mattes.
Sampler Type | Render Elements Active | Resolution | Render Time (Seconds) | Slower than Quickest Task | Slower than Bucket Sampler |
Bucket | No | Quarter (~2k) | 4195.27 | 0.00% | 0.00% |
Bucket | No | Half (~4k) | 16558.8 | 0.00% | 0.00% |
Bucket | No | Full (~8k) | 64265.5 | 0.00% | 0.00% |
Bucket | Yes | Quarter (~2k) | 4589.23 | 8.58% | 0.00% |
Bucket | Yes | Half (~4k) | 17591.5 | 5.87% | 0.00% |
Bucket | Yes | Full (~8k) | 68197.3 | 5.77% | 0.00% |
Progressive | No | Quarter (~2k) | 4289.12 | 2.19% | 2.19% |
Progressive | No | Half (~4k) | 18955.7 | 12.64% | 12.64% |
Progressive | No | Full (~8k) | 67315.7 | 4.53% | 4.53% |
Progressive | Yes | Quarter (~2k) | 4671.05 | 10.19% | 1.75% |
Progressive | Yes | Half (~4k) | 18301.9 | 9.52% | 3.88% |
Progressive | Yes | Full (~8k) | 85519.2 | 24.85% | 20.25% |
So, the first three rows are our "zero", the fastest way to get to an RGBA image: bucket sampler, and no Render Elements.
The next three rows are again the bucket sampler, but with render element active: we can see the slack is between 5 and 9 percent, give or take decimals.
Now, the first thre progressive sampler rows are calculated without Render Elements, and the performance slack with the bucket sampler doing the same job is between 2 and 4%. The middle result, showing a 12% difference, should be currently treated as an outlier, as i believe windows updated mid-job (don't ask...) (EDIT: I think it was windows defender working on a schedule.).
The last triplet of rows in the table above has the worst penalty for the progressive sampler, where compared to a bucket *without* REs, it's 10% or so slower, until the resolution is very big, and then it jumps to a 24%.
Notice the differences between Samplers (instead of tasks and samplers) are *much* more contained, for all but the two outlier values.
The test is being re-run to validate the results, so for now take with a pinch of salt.
Comment