Difference between revisions of "CUVI Features"
(64 intermediate revisions by 2 users not shown) | |||
Line 1: | Line 1: | ||
__NOTOC__ | |||
CUVI SDK is an image processing kit written and developed on C-CUDA and C++ for over a period of ten years. The SDK includes 100+ image processing and computer vision functions that are plug-n-play for imaging applications in a plethora of verticals. Each function is hand crafted and highly optimized to run on CUDA supported GPUs. Many functions use our own proprietary approach to algorithms and processing to make the best out of hardware and input data. | |||
==Modules== | ==Modules== | ||
CUVI comes in following image processing modules. | CUVI comes in following image processing modules. | ||
# Arithmetic & Logical<!--[[Module:cuvial|Arithmetic & Logical]]--> | |||
# Color Operations <!--[[Module:cuvico|Color Operations]]--> | |||
# Computer Vision<!--[[Module:cuvicv|Computer Vision]]--> | |||
# Data Exchange<!--[[Module:cuvide|Data Exchange]]--> | |||
# Geometry Transforms<!--[[Module:cuvigt|Geometry Transforms]]--> | |||
# Image Filtering<!--[[Module:cuviif|Image Filtering]]--> | |||
# Image Statistics<!--[[Module:cuviis|Image Statistics]]--> | |||
# Image Transforms<!--[[Module:cuviit|Image Transforms]]--> | |||
# Image Codecs | |||
<br/> | <br/> | ||
Line 18: | Line 20: | ||
| | | | ||
===Arithmetic & Logical=== | ===Arithmetic & Logical=== | ||
# [[Function:Abs|abs]] - Computes absolute value of each image pixel | |||
# [[Function:AbsDiff|absDiff]] - Computes absolute difference between 2 images. | |||
# [[Function:Add|add]] - Adds two images | |||
# [[Function:AND|AND]] - Bitwise AND operation between two images. | |||
# [[Function:Cbrt|cbrt]] - Computes cube root of each image pixel | |||
# [[Function:Ceil|ceil]] - Applies ceil function to each image pixel | |||
# [[Function:Divide|divide]] - Divides pixel values of two images | |||
# [[Function:dot|dot]] | |||
# [[Function:Exp|exp]] - Computes exponential of each image pixel | |||
# [[Function:Floor|floor]] - Applies floor function to each image pixel | |||
# [[Function:InRange|inRange]] - Checks if image pixels lie between a range of values | |||
# [[Function:Invert|invert]] - Computes matrix inverse using LU factorization | |||
# [[Function:MatMul|matMul]] - Performs matrix multiplication | |||
# [[Function:MatMulTrans|matMulTrans]] - Performs matrix multiplication with its transpose | |||
# [[Function:Multiply|multiply]] - Multiplies pixel values of two images | |||
# [[Function:NOT|NOT]] - Inverts the bits of each pixel. | |||
# [[Function:OR|OR]] - Bitwise OR operation between two images. | |||
# [[Function:Pow|pow]] - Raise each pixel to a specified power. | |||
# [[Function:Round|round]] - Rounds each pixel to nearest integer value. | |||
# [[Function:SHL|SHL]] - Left shift each pixel of an image. | |||
# [[Function:SHR|SHR]] - Right shift each pixel of an image | |||
# [[Function:Sqrt|sqrt]] - Takes square root of each pixel of an image. | |||
# [[Function:Subtract|subtract]] - Subtracts pixel values of two images. | |||
# [[Function:XOR|XOR]] - Logical bitwise XOR operation between two images. | |||
===Color Operations=== | ===Color Operations=== | ||
# [[Function:Adjust|adjust]] - Equivalent to MATLAB's imadjust | |||
# [[Function:AutoColor|autoColor]] - Fixes false colors at pixel level using a propriety approach | |||
# [[Function:BorderMask|borderMask]] - Masks image borders with custom intensity | |||
# [[Function:blackGammaLUT|blackGammaLUT]] - Remaps image values according to 3 look up tables | |||
# [[Function:ChannelMix|channelMix]] - Applies a color twist matrix to an image | |||
# [[Function:ChannelMerge|channelMerge]] - Merges R, G and B Channel to form a color image | |||
# [[Function:ChannelSplit|channelSplit]] - Splits a three channel image into R, G and B channels | |||
# [[Function:ColorPick|colorPick]] - Performs color segmentation of image by selecting a single color | |||
# [[Function:Dehaze|dehaze]] - Makes sense of images taken in a blizzard | |||
# [[Function:Demosaic|demosaic]] - Restores an RGB image from a CFA Bayer image | |||
# [[Function:DemosaicDFPD|demosaicDFPD]] - Restores an RGB image from a CFA Bayer image using DFPD algorithm | |||
# [[Function:fpn|fpn]] - fixed point noise remove via camera black image | |||
# [[Function:FPNCorrection|FPNCorrection]] - fixed point noise removal via black and white images and gain | |||
# [[Function:GammaCorrect|gammaCorrect]] - Performs gamma correction of an image | |||
# [[Function:Gray2rgb|gray2rgb]] - Copy a grayscale image to each channel of an RGB image. | |||
# [[Function:HistEq|histEq]] - Performs histogram equalization | |||
# [[Function:Hsv2rgb|hsv2rgb]] - Converts image from HSV color space to RGB. | |||
# [[Function:ImageBinary|imageBinary]] - Converts input image into a binary image based on threshold | |||
# [[Function:Lab2rgb|Lab2rgb]] - Converts image from CIE Lab color space to RGB. | |||
# [[Function:LogTransform|logTransform]] - Applies log transformation for contrast enhancement of an image | |||
# [[Function:Lowlight|lowlight]] - enhances the low intensity colors in a dark photo | |||
# [[Function:LUT|LUT]] - Remaps image values according to look up table | |||
# [[Function:RGB2Gray|rgb2gray]] - Converts an RGB image to gray scale | |||
# [[Function:Rgb2hsv|rgb2hsv]] - Converts an RGB image to HSV color space | |||
# [[Function:Rgb2Lab|rgb2Lab]] - Converts an RGB image to CIE Lab color space | |||
# [[Function:Rgb2ycbcr|rgb2ycbcr]] - Converts an RGB image to YCbCr color space | |||
# [[Function:RGB2YUV|rgb2yuv]] - Converts RGB image to YUV color space | |||
# [[Function:Ycbcr2rgb|ycbcr2rgb]] - Converts YCbCr image to an RGB image | |||
# [[Function:YUV2RGB|yuv2rgb]] - Converts YUV image to an RGB image. | |||
# [[Function:YUV422TOV210|YUV422TOV210]] - Converts YUV422 image to an V210 image. | |||
# [[Function:YUV444TOV210|YUV444TOV210]] - Converts YUV444 image to an V210 image. | |||
# [[Function:V210TOYUV444|V210TOYUV444]] - Converts V210 image to an YUV444 image. | |||
===Computer Vision=== | ===Computer Vision=== | ||
# [[Function:EigenValsVecs|eigenValsVecs]] - Calculates eigen values and eigen vectors at each pixel in an image | |||
# [[Function:findTransformECC|findTransformECC]] - Calculates geometric transform between two images in terms of the ECC criterion | |||
# [[Function:GoodFeaturesToTrack|goodFeaturesToTrack]] - selects N best points on a image using KLT or Harris | |||
# [[Function:MinEigenVal|minEigenVal]] - Calculates the minimum eigen value at each pixel in an image | |||
# [[Function:OpticalFlowHS|opticalFlowHS]] - Calculates flow of each image pixel in two frames using Horn–Schunck method | |||
# [[Function:opticalFlowPyrLKDense|opticalFlowPyrLKDense]] - Calculates flow of each image pixel in two frames using pyramidal Lucas–Kanade method | |||
| | # [[Function:TrackFeatures|trackFeatures]] - Tracks input features from frame 1 onto frame 2 using KLT method | ||
# [[Function:FocusStack|focusStack]] - Stacks multiples photos with various focus into a single all focused image | |||
===Data Exchange=== | ===Data Exchange=== | ||
# [[Function:BitConversion|bitConversion]] - Converts image pixel values from one data type to another. | |||
# [[Function:ConvertScale|convertScale]] - Converts image data type with optional scaling. | |||
# [[Function:GenerateRandomImage|generateRandomImage]] - Creates a new random image. | |||
# [[Function:GetChannel|getChannel]] - Extracts a channel of an image. | |||
# [[Function:PadImage|padImage]] - Pads any side of the image with the value specified. | |||
# [[Function:Rectangle|rectangle]] - Draws a rectangle on the specified region of an image. | |||
# [[Function:Threshold|threshold]] - Clamps image pixel values between specified thresholds. | |||
===Geometry Transforms=== | ===Geometry Transforms=== | ||
# [[Function:Crop|crop]] - Crops the desired portion of an image | |||
# [[Function:Flip|flipImage]] - Mirrors an image about a horizontal or vertical axis, or both | |||
# [[Function:Resize|resize]] - Resizes input image using specified interpolation | |||
# [[Function:Rotate|rotate]] - Rotates an image about specified anchor | |||
# [[Function:RotateNoCrop|RotateNoCrop]] - Rotates an image about specified anchor | |||
# [[Function:Taper|taper]] - Smartly extends image border | |||
# [[Function:Transpose|transpose]] - Takes Transpose of the image | |||
# [[Function:WarpAffine|warpAffine]] - Applies an affine transformation to an image. | |||
# [[Function:WarpAffineBack|warpAffineBack]] - Applies an inverse affine transformation to an image. | |||
# [[Function:WarpPerspective|warpPerspective]] - Applies an affine transformation to an image. | |||
# [[Function:WarpPerspectiveBack|warpPerspectiveBack]] - Applies an inverse affine transformation to an image. | |||
===Image Filtering=== | ===Image Filtering=== | ||
# [[Function:ImageFilter|imageFilter]] - Performs linear 1D and 2D image filtering | |||
# [[Function:ImageGradients|imageGradients]] - Calculates X and Y gradients of image | |||
# [[Function:MinFilter|minFilter]] - Filters an image using a min filter | |||
# [[Function:MaxFilter|maxFilter]] - Filters an image using a max filter | |||
# [[Function:MedianFilter|medianFilter]] - Applies 2D median filter on an image | |||
# [[Function:UnderwaterFilter|underwaterFilter]] - Brings out original colors of underwater photography. | |||
===Image Statistics=== | ===Image Statistics=== | ||
# [[Function:CalcHist|calcHist]] - Computes the histogram of an 8 bit image. | |||
# [[Function:CountInRange|countInRange]] - Counts the number of pixels within the given intensity range | |||
# [[Function:Max|max]] - Computes the maximum of image pixel values | |||
# [[Function:Mean|mean]] - Computes the mean of image pixel values | |||
# [[Function:MeanStdDev|meanStdDev]] - Computes the mean and standard deviation of image pixel values | |||
# [[Function:Min|min]] - Computes the minimum of image pixel values | |||
# [[Function:MinMax|minMax]] - Computes the minimum and maximum of image pixel values | |||
# [[Function:Sum|sum]] - Computes the sum of image pixel values | |||
===Image Transforms=== | ===Image Transforms=== | ||
# [[Function:Fft2|fft2]] - Computes the 2D Fast Fourier Transform of an image. | |||
# [[Function:FftGetProperty|fftGetProperty]] - Gets magnitude, phase, real and imaginary part of an FFT image with optional scaling. | |||
# [[Function:FftShift|fftShift]] - Moves low frequency components to the center of an FFT image. | |||
# [[Function:haarFwd|haarFwd]] - Performs one-level wavelet decomposition of an image using haar basis | |||
# [[Function:haarInv|haarInv]] - Performs one-level haar wavelet reconstruction of an image | |||
===Image Codecs=== | |||
# [[Function:Encode|encode]] - Encodes image pixel array into JPEG. | |||
# [[Function:Decode|decode]] - Decodes a JPEG into an image pixel array. | |||
===Video Stabilization=== | <!--===Video Stabilization=== | ||
* [[Function:CuviMotionEstimator|CuviMotionEstimator]] - | * [[Function:CuviMotionEstimator|CuviMotionEstimator]] - Class to estimate global motion vectors between an image pair. | ||
* [[Function:CuviVideoStabilizer|CuviVideoStabilizer]] - | * [[Function:CuviVideoStabilizer|CuviVideoStabilizer]] - Class for transparent video stabilization of a video file.--> | ||
|} | |} | ||
<br/> | <br/> | ||
Line 133: | Line 155: | ||
A C++ class to hold image data on the device. It has the following methods | A C++ class to hold image data on the device. It has the following methods | ||
* CuviImage(); - default constructor | * CuviImage(); - default constructor | ||
* CuviImage(CuviSize size, | * CuviImage(CuviSize size, Cuvi32s depth, Cuvi32s channels) - Main Constructor. throws exception in case of failure | ||
* CuviImage(const CuviImage&) - copy constructor | * CuviImage(const CuviImage&) - copy constructor | ||
* | * upload(const void* pSrcHost, const Cuvi32s srcPitch) - Copy image data from host to device. | ||
* | * upload(const void* pSrcHost, const Cuvi32s srcPitch, const CuviStream& stream) - Asynchronously copy image data from host to device. | ||
* | * upload(const void* pSrcHost, CuviRect srcROI, const Cuvi32s srcPitch, CuviPoint2D<int> dstOrigin) - Copy a chunk of image data from host to device. | ||
* | * upload(const void* pSrcHost, CuviRect srcROI, const Cuvi32s srcPitch, CuviPoint2D<int> dstOrigin, const CuviStream& stream) - Asynchronously copy a chunk of image data from host to device. | ||
* download(void* pDstHost, const Cuvi32s dstPitch) - Copy image data from device to host | |||
* download(void* pDstHost, const Cuvi32s dstPitch, const CuviStream& stream) - Asynchronously copy image data from device to host | |||
* download(void* pDstHost, CuviPoint2D<int> dstOrigin, const Cuvi32s dstPitch, CuviRect srcROI) - Copy a chunk of image data from device to host | |||
* download(void* pDstHost, CuviPoint2D<int> dstOrigin, const Cuvi32s dstPitch, CuviRect srcROI, const CuviStream& stream) - Asynchronously copy a chunk of image data from device to host | |||
* copyTo(CuviImage& dst, CuviPoint2D<int> dstOrigin, CuviRect srcROI, const CuviStream& stream = CuviStream()) - Copy image data (or an ROI of it) from one Cuvi Image to another. | |||
* show(const std::string& title = "CUVI Image", Cuvi32s milliseconds = 0) - Render device image directly on the screen | |||
* ~CuviImage() - Destructor. Frees Device Memory. throws exception in case of failure | * ~CuviImage() - Destructor. Frees Device Memory. throws exception in case of failure | ||
Line 144: | Line 172: | ||
|-valign="top" | |-valign="top" | ||
|style="width:50%;"| | |style="width:50%;"| | ||
===Timer=== | ===Timer=== | ||
CUVI provides a C++ Timer class that accurately profiles your application timing | CUVI provides a C++ Timer class that accurately profiles your application timing |
Latest revision as of 15:11, 7 November 2022
CUVI SDK is an image processing kit written and developed on C-CUDA and C++ for over a period of ten years. The SDK includes 100+ image processing and computer vision functions that are plug-n-play for imaging applications in a plethora of verticals. Each function is hand crafted and highly optimized to run on CUDA supported GPUs. Many functions use our own proprietary approach to algorithms and processing to make the best out of hardware and input data.
Modules
CUVI comes in following image processing modules.
- Arithmetic & Logical
- Color Operations
- Computer Vision
- Data Exchange
- Geometry Transforms
- Image Filtering
- Image Statistics
- Image Transforms
- Image Codecs
Function List
Arithmetic & Logical
Color Operations
Computer Vision
Data Exchange
Geometry Transforms
Image Filtering
Image Statistics
Image Transforms
Image Codecs |
CUVI Core
CUVI Core includes the following helper functionality:
CuviImageA C++ class to hold image data on the device. It has the following methods
|