Pytorch Basics - Transforms – Liam's Learning

Why do we need transform?

Data comes in many different formats. On the other hand, PyTorch can only do machine learning with one data type, the tensor. Transforms can convert any data to a tensor. In this post, we will look at how to transform images. I will assume that you are familiar with PyTorch Datasets. If you are not, I recommend reading this post before you continue.

How do PyTorch transforms work?

All built-in datasets from the torchvision module take the parameters transform and target_tranform. They take in a function that transforms input data into a tensor, following predefined steps. To avoid having to write these functions ourselves, the torchvision.transforms module come with an image-to-tensor transform, called ToTensor out of the box.

Let’s see an example through the FashionMNIST dataset.

import torch
from torchvision.datasets import FashionMNIST
from torchvision.transforms import ToTensor, Lambda

def our_own_transformation(target):
    """
    Transformes target label to a one-hot tensor
    example:

    >>> our_own_transformation(3)
    >>> torch.tensor([0, 0, 0, 1, 0, 0, 0, 0, 0, 0, 0])
    """

    zeros_list = torch.zeros(10, dtype=torch.float)
    one_hot_index = torch.tensor(target)
    one_hot_tensor = zeros_list.scatter_(0, one_hot_index, value=1)
    return one_hot_tensor

ds_train = FashionMNIST(
    root="data",
    train=True,
    download=True,
    transform=ToTensor(),
    target_transform=Lambda(our_own_transformation)
)

ds_test = FashionMNIST(
    root="data",
    train=True,
    download=True,
    transform=ToTensor(),
    target_transform=Lambda(our_own_transformation)
)

In this code, we specified that we want to convert our training data to a tensor using the ToTensor method, and the target label to a tensor using our_own_transformation.

Day 3: Pytorch Basics - Transforms

Why do we need transform?

How do PyTorch transforms work?

Further reading