Learn R Programming

torchvision

torchvision is an extension for torch providing image loading, transformations, common architectures for computer vision, pre-trained weights and access to commonly used datasets.

Installation

The CRAN release can be installed with:

install.packages("torchvision")

You can install the development version from GitHub with:

remotes::install_github("mlverse/torchvision@main")

Copy Link

Version

Install

install.packages('torchvision')

Monthly Downloads

1,718

Version

0.7.0

License

MIT + file LICENSE

Issues

Pull Requests

Stars

Forks

Maintainer

Daniel Falbel

Last Published

July 18th, 2025

Functions in torchvision (0.7.0)

mnist_dataset

MNIST and Derived Datasets
flowers102_dataset

Oxford Flowers 102 Dataset
model_alexnet

AlexNet Model Architecture
flickr_caption_dataset

Flickr Caption Datasets
fgvc_aircraft_dataset

FGVC Aircraft Dataset
oxfordiiitpet_dataset

Oxford-IIIT Pet Classification Datasets
nms

Non-maximum Suppression (NMS)
model_mobilenet_v2

model_resnet

ResNet implementation
model_inception_v3

Inception v3 model
model_vgg

VGG implementation
transform_color_jitter

Randomly change the brightness, contrast and saturation of an image
generalized_box_iou

Generalized Box IoU
transform_convert_image_dtype

Convert a tensor image to the given dtype and scale the values accordingly
tensor_image_browse

Display image tensor
transform_affine

Apply affine transformation on an image keeping image center invariant
tensor_image_display

Display image tensor
transform_adjust_contrast

Adjust the contrast of an image
remove_small_boxes

Remove Small Boxes
transform_center_crop

Crops the given image at the center
oxfordiiitpet_segmentation_dataset

Oxford-IIIT Pet Segmentation Dataset
transform_adjust_brightness

Adjust the brightness of an image
draw_bounding_boxes

Draws bounding boxes on image.
magick_loader

Load an Image using ImageMagick
image_folder_dataset

Create an image folder dataset
tiny_imagenet_dataset

Tiny ImageNet dataset
transform_grayscale

Convert image to grayscale
transform_adjust_hue

Adjust the hue of an image
transform_adjust_saturation

Adjust the color saturation of an image
transform_random_affine

Random affine transformation of the image keeping center invariant
transform_hflip

Horizontally flip a PIL Image or Tensor
transform_crop

Crop the given image at specified location and output size
transform_random_resized_crop

Crop image to random size and aspect ratio
transform_random_perspective

Random perspective transformation of an image with a given probability
transform_random_apply

Apply a list of transformations randomly with a given probability
transform_random_horizontal_flip

Horizontally flip an image randomly with a given probability
transform_random_order

Apply a list of transformations in a random order
transform_linear_transformation

Transform a tensor image with a square transformation matrix and a mean_vector computed offline
transform_normalize

Normalize a tensor image with mean and standard deviation
transform_resized_crop

Crop an image and resize it to a desired size
transform_rgb_to_grayscale

Convert RGB Image Tensor to Grayscale
transform_rotate

Angular rotation of an image
transform_to_tensor

Convert an image to a tensor
transform_resize

Resize the input image to the given size
transform_ten_crop

Crop an image and the flipped image each into four corners and a central crop
transform_five_crop

Crop image into four corners and a central crop
transform_pad

Pad the given image on all sides with the given "pad" value
transform_perspective

Perspective transformation of an image
transform_random_erasing

Randomly selects a rectangular region in an image and erases its pixel values
transform_random_grayscale

Randomly convert image to grayscale with a given probability
transform_adjust_gamma

Adjust the gamma of an RGB image
transform_random_choice

Apply single transformation randomly picked from a list
vision_make_grid

A simplified version of torchvision.utils.make_grid
transform_random_crop

Crop the given image at a random location
transform_vflip

Vertically flip a PIL Image or Tensor
transform_random_vertical_flip

Vertically flip an image randomly with a given probability
transform_random_rotation

Rotate the image by angle
base_loader

Base loader
box_xyxy_to_cxcywh

box_xyxy_to_cxcywh
box_iou

Box IoU
box_convert

Box Convert
box_xywh_to_xyxy

box_xywh_to_xyxy
box_area

Box Area
caltech_dataset

Caltech Datasets
batched_nms

Batched Non-maximum Suppression (NMS)
box_cxcywh_to_xyxy

box_cxcywh_to_xyxy
box_xyxy_to_xywh

box_xyxy_to_xywh
coco_polygon_to_mask

Convert COCO polygon to mask tensor (Robust Version)
fer_dataset

FER-2013 Facial Expression Dataset
draw_segmentation_masks

Draw segmentation masks
eurosat_dataset

EuroSAT datasets
clip_boxes_to_image

Clip Boxes to Image
cifar10_dataset

CIFAR datasets
coco_caption_dataset

COCO Caption Dataset
draw_keypoints

Draws Keypoints
coco_detection_dataset

COCO Detection Dataset