Difference between revisions of "Performance & Benchmark"
Line 78: | Line 78: | ||
|0.34 | |0.34 | ||
|1.43 | |1.43 | ||
|- | |- | ||
|[[Function:GoodFeaturesToTrack|goodFeaturesToTrack]] | |[[Function:GoodFeaturesToTrack|goodFeaturesToTrack]] | ||
Line 97: | Line 96: | ||
|11,828.72 | |11,828.72 | ||
|8,466.54 | |8,466.54 | ||
|- | |- | ||
|[[Function:BitConversion|bitConversion]] - From 8 to 16 bits | |[[Function:BitConversion|bitConversion]] - From 8 to 16 bits | ||
Line 149: | Line 147: | ||
</tab> | </tab> | ||
--> | --> | ||
<tab name="RTX 2060"> | <tab name="RTX 2060"> | ||
{|class="wikitable" | {|class="wikitable" | ||
Line 213: | Line 212: | ||
|0.34 | |0.34 | ||
|1.43 | |1.43 | ||
|- | |- | ||
|[[Function:GoodFeaturesToTrack|goodFeaturesToTrack]] | |[[Function:GoodFeaturesToTrack|goodFeaturesToTrack]] | ||
Line 232: | Line 230: | ||
|11,828.72 | |11,828.72 | ||
|8,466.54 | |8,466.54 | ||
|- | |- | ||
|[[Function:BitConversion|bitConversion]] - From 8 to 16 bits | |[[Function:BitConversion|bitConversion]] - From 8 to 16 bits | ||
Line 283: | Line 280: | ||
|} | |} | ||
</tab> | </tab> | ||
<!-- | <!-- | ||
<tab name="GTX 1080"> | <tab name="GTX 1080"> |
Revision as of 20:35, 23 October 2022
If one thing CUVI gives you, it's performance boost over competitive libraries and solutions. Using GPGPU as the underlying hardware, Imaging and Vision modules get maximum benefit due to their inherent parallel algorithms. In addition to cost cutting on CPU-based clusters, CUVI gives up to 15x speedup over Intel IPP.
Applications using CUVI are generally ten times faster than their CPU counterpart. CUVI framework also gives the ease to scale the application on more than one GPU making it as fast as you want.
Benchmark
The following benchmark via NVIDIA's Performance tools for Windows and Linux. Timing figure represents frames per second (rounded) based on only the processing time on the single GPU. The benchmarks are performed on color 8-bit per channel images except where mentioned otherwise.
Algorithm / Image Size | 720p | 1080p | 4k (3840x2160) | 8k (7680x4320) |
---|---|---|---|---|
add - 2 Images | 0.06 | 0.14 | 0.51 | 2.01 |
channelMix | 0.07 | 0.14 | 0.55 | 2.25 |
demosaic | 0.24 | 0.53 | 2.10 | 8.10 |
demosaicDFPD | 0.52 | 1.22 | 4.53 | 18.1 |
gammaCorrect | 0.12 | 0.28 | 1.02 | 4.30 |
histEq - Single Channel | 0.21 | 0.24 | 0.84 | 3.10 |
LUT | 0.03 | 0.08 | 0.29 | 1.20 |
blackGammaLUT | 0.069 | 0.16 | 0.61 | 2.50 |
rgb2gray | 0.04 | 0.09 | 0.34 | 1.43 |
goodFeaturesToTrack | 26,720.11 | 20,384.87 | 11,828.72 | 8,466.54 |
opticalFlowPyrLKDense - 2 Images | 26,720.11 | 20,384.87 | 11,828.72 | 8,466.54 |
focusStack - 2 Images | 26,720.11 | 20,384.87 | 11,828.72 | 8,466.54 |
bitConversion - From 8 to 16 bits | 0.01 | 0.24 | 0.95 | 3.81 |
crop | 0.04 | 0.12 | 0.41 | 1.70 |
resize - Scale=2.0 | 0.25 | 0.55 | 2.21 | 8.70 |
rotate - Non Cropping, Angle = -3.76f | 0.04 | 0.09 | 0.36 | 1.11 |
warpPerspective | 0.08 | 0.20 | 0.77 | 3.10 |
imageFilter - 5x5 floating point window | 0.65 | 1.56 | 5.81 | 13.7 |
underwaterFilter | 0.53 | 1.10 | 4.00 | 15.2 |
haarFwd | 0.14 | 0.30 | 1.21 | 4.90 |