Torch Core¶

This module contains all the basic functions we need in other modules of the fastai library (split with core that contains the ones not requiring pytorch). Its documentation can easily be skipped at a first read, unless you want to know what a given function does.

Global constants¶

AdamW = partial(optim.Adam, betas=(0.9,0.99))

[source]

bn_types = (nn.BatchNorm1d, nn.BatchNorm2d, nn.BatchNorm3d)

[source]

defaults.device = torch.device('cuda') if torch.cuda.is_available() else torch.device('cpu')

[source]

If you are trying to make fastai run on the CPU, simply change the default device: defaults.device = torch.device('cpu').

Alternatively, if not using wildcard imports: fastai.torch_core.defaults.device = torch.device('cpu').

Conversion functions¶

Flattens all the layers of m into an array. This allows for easy access to the layers of the model and allows you to manipulate the model as if it was an array.

m = simple_cnn([3,6,12])
m

Sequential(
  (0): Sequential(
    (0): Conv2d(3, 6, kernel_size=(3, 3), stride=(2, 2), padding=(1, 1))
    (1): ReLU(inplace=True)
  )
  (1): Sequential(
    (0): Conv2d(6, 12, kernel_size=(3, 3), stride=(2, 2), padding=(1, 1))
    (1): ReLU(inplace=True)
  )
  (2): Sequential(
    (0): AdaptiveAvgPool2d(output_size=1)
    (1): Flatten()
  )
)

flatten_model(m)

[Conv2d(3, 6, kernel_size=(3, 3), stride=(2, 2), padding=(1, 1)),
 ReLU(inplace=True),
 Conv2d(6, 12, kernel_size=(3, 3), stride=(2, 2), padding=(1, 1)),
 ReLU(inplace=True),
 AdaptiveAvgPool2d(output_size=1),
 Flatten()]

show_doc(flatten_check)

Converting model parameters to half precision allows us to leverage fast FP16 arithmetic which can speed up the computations by 2-8 times. It also reduces memory consumption allowing us to train deeper models.

Note: Batchnorm layers are not converted to half precision as that may lead to instability in training.

m = simple_cnn([3,6,12], bn=True)

def show_params_dtype(state_dict):
    """Simple function to pretty print the dtype of the model params"""
    for wt_name, param in state_dict.items():
        print("{:<30}: {}".format(wt_name, str(param.dtype)))
    print()    

print("dtypes of model parameters before model2half: ")
show_params_dtype(m.state_dict())

# Converting model to half precision
m_half = model2half(m)

print("dtypes of model parameters after model2half: ")
show_params_dtype(m_half.state_dict())

dtypes of model parameters before model2half: 
0.0.weight                    : torch.float32
0.2.weight                    : torch.float32
0.2.bias                      : torch.float32
0.2.running_mean              : torch.float32
0.2.running_var               : torch.float32
0.2.num_batches_tracked       : torch.int64
1.0.weight                    : torch.float32
1.0.bias                      : torch.float32

dtypes of model parameters after model2half: 
0.0.weight                    : torch.float16
0.2.weight                    : torch.float32
0.2.bias                      : torch.float32
0.2.running_mean              : torch.float32
0.2.running_var               : torch.float32
0.2.num_batches_tracked       : torch.int64
1.0.weight                    : torch.float16
1.0.bias                      : torch.float16

It is a wrapper on top of Pytorch's torch.as_tensor which converts numpy array to torch tensor, and additionally attempts to map all floats to torch.float32 and all integers to torch.int64 for consistencies in model data. Below is an example demonstrating it's functionality for floating number, similar functionality applies to integer as well.

a1 = np.ones((2, 3)).astype(np.float16)
a2 = np.ones((2, 3)).astype(np.float32)
a3 = np.ones((2, 3)).astype(np.float64)

b1 = np2model_tensor(a1) # Maps to torch.float32
b2 = np2model_tensor(a2) # Maps to torch.float32
b3 = np2model_tensor(a3) # Maps to torch.float32

print(f"Datatype of as': {a1.dtype}, {a2.dtype}, {a3.dtype}")
print(f"Datatype of bs': {b1.dtype}, {b2.dtype}, {b3.dtype}")

Datatype of as': float16, float32, float64
Datatype of bs': torch.float32, torch.float32, torch.float32

Performs both getting and setting of requires_grad parameter of the tensors, which decided whether to accumulate gradients or not.

If b is None: The function gets the requires_grad for the model parameter, to be more specific it returns the requires_grad of the first element in the model.
Else if b is passed (a boolean value), requires_grad of all parameters of the model is set to b.

# Any Pytorch model
m = simple_cnn([3, 6, 12], bn=True)

# Get the requires_grad of model
print("requires_grad of model: {}".format(requires_grad(m)))

# Set requires_grad of all params in model to false
requires_grad(m, False)

# Get the requires_grad of model
print("requires_grad of model: {}".format(requires_grad(m)))

requires_grad of model: True
requires_grad of model: False

Handy function when you want to convert any list type object to tensor, initialize your weights manually, and other similar cases.

NB: When passing multiple vectors, all vectors must be of same dimensions. (Obvious but can be forgotten sometimes)

# Conversion from any numpy array
b = tensor(np.array([1, 2, 3]))
print(b, type(b))

# Passing as multiple parameters
b = tensor(1, 2, 3)
print(b, type(b))

# Passing a single list
b = tensor([1, 2, 3])
print(b, type(b))

# Can work with multiple vectors / lists
b = tensor([1, 2], [3, 4])
print(b, type(b))

tensor([1, 2, 3]) <class 'torch.Tensor'>
tensor([1, 2, 3]) <class 'torch.Tensor'>
tensor([1, 2, 3]) <class 'torch.Tensor'>
tensor([[1, 2],
        [3, 4]]) <class 'torch.Tensor'>

A wrapper on top of Pytorch's torch.Tensor.cpu() function, which creates and returns a copy of a tensor or even a list of tensors in the CPU. As described in Pytorch's docs, if the tensor or list of tensor is already on the CPU, the exact data is returned and no copy is made.

Useful to convert all the list of parameters of the model to CPU in a single call.

if torch.cuda.is_available():
    a = [torch.randn((1, 1)).cuda() for i in range(3)]
    print(a)
    print("Id of tensors in a: ")
    for i in a: print(id(i))
    
    # Getting a CPU version of the tensors in GPU
    b = to_cpu(a)
    print(b)
    print("Id of tensors in b:")
    for i in b: print(id(i))
    
    # Trying to perform to_cpu on a list of tensor already in CPU
    c = to_cpu(b)
    print(c)
    # The tensors in c has exact id as that of b. No copy performed.
    print("Id of tensors in c:")
    for i in c: print(id(i))

[tensor([[0.0944]], device='cuda:0'), tensor([[0.6601]], device='cuda:0'), tensor([[-0.2045]], device='cuda:0')]
Id of tensors in a: 
139705804314496
139705804315856
139705804316576
[tensor([[0.0944]]), tensor([[0.6601]]), tensor([[-0.2045]])]
Id of tensors in b:
139705804395696
139705804200688
139705804201648
[tensor([[0.0944]]), tensor([[0.6601]]), tensor([[-0.2045]])]
Id of tensors in c:
139705804395696
139705804200688
139705804201648

Returns the data attribute from the object or collection of objects that inherits from ItemBase class. Useful to examine the exact values of the data, could be used to work with the data outside of fastai classes.

# Default example examined

from fastai import *
from fastai.vision import *

path = untar_data(URLs.MNIST_SAMPLE)
data = ImageDataBunch.from_folder(path)

# Examin the labels
ys = list(data.y)
print("Category display names: ", [ys[0], ys[-1]])

print("Unique classes internally represented as: ", to_data([ys[0], ys[-1]]))

Category display names:  [Category 0, Category 1]
Unique classes internally represented as:  [0, 1]

Converts the tensor or list of FP16, resulting in less memory consumption and faster computations with the tensor. It does not convert torch.int types to half precision.

a1 = torch.tensor([1, 2], dtype=torch.int64)
a2 = torch.tensor([1, 2], dtype=torch.int32)
a3 = torch.tensor([1, 2], dtype=torch.int16)
a4 = torch.tensor([1, 2], dtype=torch.float64)
a5 = torch.tensor([1, 2], dtype=torch.float32)
a6 = torch.tensor([1, 2], dtype=torch.float16)

print("dtype of as: ", a1.dtype, a2.dtype, a3.dtype, a4.dtype, a5.dtype, a6.dtype, sep="\t")

b1, b2, b3, b4, b5, b6 = to_half([a1, a2, a3, a4, a5, a6])

print("dtype of bs: ", b1.dtype, b2.dtype, b3.dtype, b4.dtype, b5.dtype, b6.dtype, sep="\t")

dtype of as: 	torch.int64	torch.int32	torch.int16	torch.float64	torch.float32	torch.float16
dtype of bs: 	torch.int64	torch.int32	torch.int16	torch.float16	torch.float16	torch.float16

Internally puts the data to CPU, and converts to numpy.ndarray equivalent of torch.tensor by calling torch.Tensor.numpy().

a = torch.tensor([1, 2], dtype=torch.float64)

if torch.cuda.is_available():
    a = a.cuda()

print(a, type(a), a.device)

b = to_np(a)

print(b, type(b))

tensor([1., 2.], device='cuda:0', dtype=torch.float64) <class 'torch.Tensor'> cuda:0
[1. 2.] <class 'numpy.ndarray'>

# Converts floating point numbers to integer
print(try_int(12.5), type(try_int(12.5)))

# This is a Rank-1 ndarray, which ideally should not be converted to int 
print(try_int(np.array([1.5])), try_int(np.array([1.5])).dtype)

# Numpy array with a single elements are converted to int
print(try_int(np.array(1.5)), type(try_int(np.array(1.5))))

print(try_int(torch.tensor(2.5)), type(try_int(torch.tensor(2.5))))

# Strings are not converted to int (of course)
print(try_int("12.5"), type(try_int("12.5")))

12 <class 'int'>
[1.5] float64
1 <class 'int'>
2 <class 'int'>
12.5 <class 'str'>

show_doc(to_float)

Functions to deal with model initialization¶

Functions to get information of a model¶

Functions to deal with BatchNorm layers¶

This is used by the optimizer to determine which params should be applied weight decay when using the option bn_wd=False is used in a Learner.

Functions to get random tensors¶

log_uniform(0.5,2,(8,))

tensor([0.6463, 0.6198, 0.5594, 0.8552, 1.7712, 0.7278, 1.1318, 0.8701])

rand_bool(0.5, 8)

tensor([ True,  True,  True, False,  True,  True, False, False])

uniform(0,1,(8,))

tensor([0.1251, 0.5484, 0.8592, 0.0317, 0.4503, 0.8402, 0.6992, 0.9824])

uniform_int(0,2,(8,))

tensor([1, 0, 2, 2, 0, 2, 0, 2])

Other functionality¶

class _T(Module):
    def __init__(self): self.f = nn.Linear(2,1)
    def forward(self,x): return self.f(x)

t = _T()
t(tensor(1.,2))

tensor([1.7807], grad_fn=<AddBackward0>)

If splits are layers, the model is split at those (not included) sequentially. If want_idxs is True, the corresponding indexes are returned. If splits are lists of layers, the model is split according to those.

Deprecated: This is v1 of fastai, which is not supported.

torch_core

Torch Core¶

Global constants¶

Conversion functions¶

batch_to_half[source][test]

flatten_model[source][test]

flatten_check[source][test]

model2half[source][test]

np2model_tensor[source][test]

requires_grad[source][test]

tensor[source][test]

to_cpu[source][test]

to_data[source][test]

to_detach[source][test]

to_device[source][test]

to_half[source][test]

to_np[source][test]

try_int[source][test]

to_float[source][test]

Functions to deal with model initialization¶

apply_init[source][test]

apply_leaf[source][test]

cond_init[source][test]

in_channels[source][test]

init_default[source][test]

Functions to get information of a model¶

children[source][test]

children_and_parameters[source][test]

first_layer[source][test]

last_layer[source][test]

num_children[source][test]

one_param[source][test]

range_children[source][test]

trainable_params[source][test]

Functions to deal with BatchNorm layers¶

bn2float[source][test]

set_bn_eval[source][test]

split_no_wd_params[source][test]

Functions to get random tensors¶

log_uniform[source][test]

rand_bool[source][test]

uniform[source][test]

uniform_int[source][test]

Other functionality¶

class Module[source][test]

class ModelOnCPU[source][test]

class NoneReduceOnCPU[source][test]

class ParameterModule[source][test]

data_collate[source][test]

get_model[source][test]

grab_idx[source][test]

logit[source][test]

logit_[source][test]

model_type[source][test]

np_address[source][test]

split_model[source][test]

split_model_idx[source][test]

trange_of[source][test]

`batch_to_half`[source][test]

`flatten_model`[source][test]

`flatten_check`[source][test]

`model2half`[source][test]

`np2model_tensor`[source][test]

`requires_grad`[source][test]

`tensor`[source][test]

`to_cpu`[source][test]

`to_data`[source][test]

`to_detach`[source][test]

`to_device`[source][test]

`to_half`[source][test]

`to_np`[source][test]

`try_int`[source][test]

`to_float`[source][test]

`apply_init`[source][test]

`apply_leaf`[source][test]

`cond_init`[source][test]

`in_channels`[source][test]

`init_default`[source][test]

`children`[source][test]

`children_and_parameters`[source][test]

`first_layer`[source][test]

`last_layer`[source][test]

`num_children`[source][test]

`one_param`[source][test]

`range_children`[source][test]

`trainable_params`[source][test]

`bn2float`[source][test]

`set_bn_eval`[source][test]

`split_no_wd_params`[source][test]

`log_uniform`[source][test]

`rand_bool`[source][test]

`uniform`[source][test]

`uniform_int`[source][test]

`class` `Module`[source][test]

`class` `ModelOnCPU`[source][test]

`class` `NoneReduceOnCPU`[source][test]

`class` `ParameterModule`[source][test]

`data_collate`[source][test]

`get_model`[source][test]

`grab_idx`[source][test]

`logit`[source][test]

`logit_`[source][test]

`model_type`[source][test]

`np_address`[source][test]

`split_model`[source][test]

`split_model_idx`[source][test]

`trange_of`[source][test]