data.vision¶
Vision utilities.
Datasets¶
|
MNIST handwritten digits dataset from http://yann.lecun.com/exdb/mnist |
|
A dataset of Zalando’s article images consisting of fashion products, a drop-in replacement of the original MNIST dataset from https://github.com/zalandoresearch/fashion-mnist |
|
CIFAR10 image classification dataset from https://www.cs.toronto.edu/~kriz/cifar.html |
|
CIFAR100 image classification dataset from https://www.cs.toronto.edu/~kriz/cifar.html |
|
A dataset wrapping over a RecordIO file containing images. |
|
A dataset for loading image files stored in a folder structure. |
Data transformations¶
|
Sequentially composes multiple transforms. |
|
Cast input to a specific data type |
Converts an image NDArray to a tensor NDArray. |
|
|
Normalize an tensor of shape (C x H x W) with mean and standard deviation. |
|
Crop the input image with random scale and aspect ratio. |
|
Crops the image src to the given size by trimming on all four sides and preserving the center of the image. |
|
Resize an image to the given size. |
Randomly flip the input image left to right with a probability of 0.5. |
|
Randomly flip the input image top to bottom with a probability of 0.5. |
|
|
Randomly jitters image brightness with a factor chosen from [max(0, 1 - brightness), 1 + brightness]. |
|
Randomly jitters image contrast with a factor chosen from [max(0, 1 - contrast), 1 + contrast]. |
|
Randomly jitters image saturation with a factor chosen from [max(0, 1 - saturation), 1 + saturation]. |
|
Randomly jitters image hue with a factor chosen from [max(0, 1 - hue), 1 + hue]. |
|
Randomly jitters the brightness, contrast, saturation, and hue of an image. |
|
Add AlexNet-style PCA-based noise to an image. |