Einops tutorial (ported)#

This tutorial is a port using xarray-einstats of the einops basics tutorial

Einops meets xarray!#

We don’t write

y = x.transpose(0, 2, 3, 1)

nor we write the more comprehensible alternative

y = rearrange(x, 'b c h w -> b h w c')

we write comprehensible code and use labeled arrays

y_da = rearrange(x_da, 'batch height width channel')
# or, also equivalent
y_da = x_da.einops.rearrange('batch height width channel')

x_da is a xarray.DataArray whose dimensions are already labeled, thus, we can skip the left side that defines the names of the dimensions.

xarray-einstats wraps einops functions to extend them to work on xarray objects.

What’s in this tutorial?#

fundamentals: reordering, composition and decomposition of axes
operations: rearrange, reduce, repeat
how much you can do with a single operation!

Preparations#

# Examples are given for numpy. This code also setups ipython/jupyter
# so that numpy arrays in the output are displayed as images
import numpy
import xarray
from xarray_einstats.tutorial import display_np_arrays_as_images

display_np_arrays_as_images()

Note

This cell above configures jupyter to display numpy arrays as images, which is a great visual help to understand the operations performed by einops. To take advantage of this we use .values. In some specific cases, we also omit it in order to show the values of the dimensions of the DataArray. If you are running this yourself we encourage you to try both views

Load a batch of images to play with#

Download the data for local use

The images are stored as a zarr store in the xarray-einstats repo. You can download it directly from your browser using https://download-directory.github.io/ and pasting https://github.com/arviz-devs/xarray-einstats/tree/main/docs/source/tutorials/einops-image.zarr there. Then move the file if necessary, uncompress and rename it so you can load it with open_zarr as shown below.

ds = xarray.open_zarr("einops-image.zarr").load()
ims = ds["ims"]
# There are 6 images of shape 96x96 with 3 color channels packed into the ims DataArray
ds

<xarray.Dataset>
Dimensions:  (batch: 6, height: 96, width: 96, channel: 3)
Dimensions without coordinates: batch, height, width, channel
Data variables:
    ims      (batch, height, width, channel) float64 1.0 0.902 ... 1.0 0.8039

display the first image (whole 4d tensor can’t be rendered)

ims.sel(batch=0).values

../_images/e6e7de9e4042ba65776da838c1002adb5714d803b24136dd0e6fef5f3749dbfc.png

second image in a batch

ims.sel(batch=1).values

../_images/7fcef7afebb6c88cc7e92738c124388b4a088675df5d28aa51c3f96c1ecf7682.png

we’ll use three operations

from xarray_einstats.einops import rearrange, reduce #, repeat

rearrange, as its name suggests, rearranges elements.

Below we swapped height and width. This can be seen as transposing the first two dimensions, but in xarray, by design, the order of the dimensions should not matter, and that exact code below should (and will work) if the input object has the same dimension names but different order (in which case the operation might be transposing 1st and 3rd dims) or doing nothing. Having said that, rearranging is still a valuable operation on xarray objects, especially if done right before accessing the underlying numpy or dask array.

By rearranging, we are enforcing the width, height, channel order.

ims.sel(batch=0).einops.rearrange('width height channel').values

../_images/84ac4fb0d2d810dec8ee3a3b7496a515227dd872efd0ba8007557c1766b24d59.png

Composition of axes#

transposition is very common and useful, but let’s move to other capabilities provided by einops

einops allows seamlessly composing batch and height to a new height dimension, something that tends to be tricky in xarray once we move from a standard stack.

We just rendered all images by collapsing to 3d tensor!

da = rearrange(ims, '(batch height) width channel')
da.values

../_images/9c0475fd10c6462f483940aa508a79dfb0b44cad29a11b2efcc22d0e37ca613a.png

but wait, dimensions must be named in xarray, so what happened with this stacked dimension we have just created?

xarray_einstats assigns it a dimension name based on the names of the parent dimensions:

da.dims

('batch-height', 'width', 'channel')

we can also compose a new dimension of batch and width. And now we will name it manually (strongly recommended over relying on the automatic names)

ims.einops.rearrange('(batch width)=batched_widths channel').values

../_images/af739ece801e10a8f4a5ad25b74300399b934b0cb3df6cae179c15a247b2b8a7.png

note that here we have skipped the height dimension not only from the input but also from the output expression. xarray_einstats follows xarray convention of adding the new or modified (aka present in the output expression) dimensions at the end.

As dimensions are already named, we can skip dimensions not only from the input as we have been doing but also from the output if we don’t mind the new dimensions being moved to the right of the omitted ones.

Resulting dimensions are computed very simply. The length of newly composed axis is a product of components: [6, 96, 96, 3] -> [96, (6 * 96), 3]

rearrange(ims, '(batch width) channel').shape

(96, 576, 3)

We can compose more than two axes. Let’s flatten 4d array into 1d, resulting array has as many elements as the original

rearrange(ims, '(batch height width channel)').shape

(165888,)

Note

Everything we have done so far could have been done with transpose or with stack, so choosing between those methods or einops is a matter of personal choice. We’d recommend you stick with the original xarray methods, especially if working with dask arrays, their defaults will be much more convenient than the lack of automatic dask handling in xarray_einstats.

The rearrangements below this point however can’t be reproduced by a single xarray method.

Decomposition of axis#

Decomposition is the inverse process, we represent an axis as a combination of new axes.

There will always be several decompositions possible, so we specify b1=2 to decompose batch to two dimensions b1 and b2 of lengths 2 and 3 respectively.

In addition, we also need to specify the name of the dimension we want to decompose.

ims.einops.rearrange('(b1 b2)=batch -> b1 b2 height width channel ', b1=2).shape

(2, 3, 96, 96, 3)

Finally, combine composition and decomposition:

da = rearrange(ims, '(b1 b2)=batch -> (b1 height) (b2 width) channel ', b1=2)
da.values

../_images/536195f4f94f2d5b01d60ae91a96750f6ada4ddbc6cce8ceb113ecca5d4fa237.png

Again, we skip naming the output dimensions so they are named by xarray_einstats

da.dims

('b1-height', 'b2-width', 'channel')

Slightly different composition: b1 is merged with width, b2 with height

… so letters are ordered by w then by h

rearrange(ims, '(b1 b2)=batch -> (b2 height) (b1 width) channel ', b1=2).values

../_images/415a9c978f75961d2d93b9efd0be57d7c26fe3545add4435e354f281c7a4567c.png

Wove part of width dimension to height. We should call this width-to-height as image width shrunk by 2 and height doubled.

but all pixels are the same!

Can you write reverse operation (height-to-width)?

rearrange(ims, '(w1 w2)=width -> (height w2) (batch w1) channel', w2=2).values

../_images/02b80fd10b8cebc25da2e4fe81c6c7e437a806cd80e5a8388b9355a59c66a34c.png

Order of axes matters#

Compare with the next two examples

rearrange(ims, '(batch width) channel').values

rearrange(ims, '(width batch) channel').values

../_images/a934ba5b307b8c521f6a760cba5443dfc569d986613fb24b357e10ef1e38eaf1.png

The order of axes in the composition is different.

The rule is just as for digits in the number: the leftmost digit is the most significant, while neighboring numbers differ in the rightmost axis. You can also think of this as lexicographic sort

And what if b1 and b2 are reordered before composing to width?

rearrange(ims, '(b1 b2)=batch -> (b1 b2 width) channel ', b1=2).values # produces 'einops'
rearrange(ims, '(b1 b2)=batch -> (b2 b1 width) channel ', b1=2).values # produces 'eoipns'

../_images/605cc18540aa1631dd2a71cce00f19c24f08a4d70ec98b405690eb2da5945142.png

Meet `reduce`#

From einops documentation:

In einops-land you don’t need to guess what happened
x.mean(-1)
Because you write what the operation does
reduce(x, 'b h w c -> b h w', 'mean')
if an axis is not present in the output — you guessed it — that axis is reduced.

using xarray objects, you already don’t have to guess what happened. Much like with rearrange, the first examples (three in this case) using reduce can be reproduced with a single xarray method. See the operation above with pure xarray and with reduce:

da.mean("channel")
reduce(x, "batch width height", "mean")

However, again much like with rearrange, reduce also opens the door to many other operations that go beyond what single xarray methods can do. These cases are worth making reduce work, and once it’s working the simple operations are possible automatically.

If you prefer thinking in terms of the dimensions that are kept instead of the ones that are reduced, using reduce can be more convenient even for simple operations.

# average over batch
ims.einops.reduce('height width channel', 'mean').values

../_images/279273dd39852c9d9ff1594dc927e4c2fe75c134261210ad65350de19d8387b4.png

# the previous is identical to:
ims.mean(dim="batch")    # as xarray operation
ims.values.mean(axis=0)  # as numpy operation

# Example of reducing of several axes 
# besides mean, there are also min, max, sum, prod
reduce(ims, 'height width', 'min').values

../_images/8ca2f573c548a6a4eb5883469eb3da0b2209231e20a70f3391297b42594ea6a8.png

# this is mean-pooling with 2x2 kernel
# image is split into 2x2 patches, each patch is averaged
reduce(ims, '(h h2)=height (w w2)=width -> h (batch w) channel', 'mean', h2=2, w2=2).values

../_images/fa71752fed71219a1075faae3eb3ea22b9e3e0179f94bc1ba01bacbf932bf7f5.png

# max-pooling is similar
# result is not as smooth as for mean-pooling
ims.einops.reduce('(h h2)=height (w w2)=width -> h (batch w) channel', 'max', h2=2, w2=2).values

../_images/f8e8ea90e132eafd34013d703e20c5d6bc6a25c636046254084b01a5f86b611e.png

# yet another example. Can you compute result shape?
reduce(ims, '(b1 b2)=batch -> (b2 height) (b1 width)', 'mean', b1=2).values

../_images/d3ebdab5628a2dd1496356bcbd5953ef8d26fcd4ca0d09743c551104b7349cdc.png

We skip the section about numpy-like stacking and concatenating because they aren’t relevant to xarray objects.

We have also reimagined the next section. We are working with xarray objects so adding new axis of length 1 to ensure broadcastability is not relevant either. We have therefore modified the section to a showcase of xarray automatic broadcasting and alignment.

Broadcasting and alignment#

As we are using xarray objects, we can directly operate between original and reduced inputs without the need for adding new dimensions, be it with [np.newaxis, :] or with 1 and () placeholders in einops expressions. See for yourself:

# compute max in each image individually, then show a difference 
x = reduce(ims, 'batch channel', 'max') - ims
rearrange(x, '(batch width) channel').values

../_images/db6f1ef731e60d7c43a914d55c80864aff2a756d81f6e641207e812fe9ccbf34.png

Repeating elements#

coming soon, for now jump to Fancy examples in random order

Third operation we introduce is repeat

# repeat along a new axis. New axis can be placed anywhere
repeat(ims[0], 'h w c -> h new_axis w c', new_axis=5).shape

# shortcut
repeat(ims[0], 'h w c -> h 5 w c').shape

# repeat along w (existing axis)
repeat(ims[0], 'h w c -> h (repeat w) c', repeat=3)

# repeat along two existing axes
repeat(ims[0], 'h w c -> (2 h) (2 w) c')

# order of axes matters as usual - you can repeat each element (pixel) 3 times 
# by changing order in parenthesis
repeat(ims[0], 'h w c -> h (w repeat) c', repeat=3)

Note: repeat operation covers functionality identical to numpy.repeat, numpy.tile and actually more than that.

Reduce ⇆ repeat#

reduce and repeat are like opposite of each other: first one reduces amount of elements, second one increases.

In the following example each image is repeated first, then we reduce over new axis to get back original tensor. Notice that operation patterns are “reverse” of each other

repeated = repeat(ims, 'b h w c -> b h new_axis w c', new_axis=2)
reduced = reduce(repeated, 'b h new_axis w c -> b h w c', 'min')
assert numpy.array_equal(ims, reduced)

Fancy examples in random order#

(a.k.a. mad designer gallery)

# interweaving pixels of different pictures
# all letters are observable
rearrange(ims, '(b1 b2)=batch -> (height b1) (width b2) channel ', b1=2).values

../_images/deb23e3856c4172993f85f1b1ce5295e7a5583cdfd0edae4a231d90e9f6b6247.png

# interweaving along vertical for couples of images
ims.einops.rearrange('(b1 b2)=batch -> (height b1) (b2 width) channel', b1=2).values

../_images/4ed7065f1f83a39e8c2cfe24199a32b3c5ab10c7acdec433b97ed5fef36ba9e9.png

# interweaving lines for couples of images
# exercise: achieve the same result without einops in your favourite framework
reduce(ims, '(b1 b2)=batch -> height (b2 width) channel', 'max', b1=2).values

../_images/3449994b9ce833aed8ac6d74f12dcf5e9eb417932cfebd8b62406869eeb6bf30.png

# color can be also composed into dimension
# ... while image is downsampled
reduce(ims, '(h h2)=height (w w2)=width -> (channel h) (batch w)', 'mean', h2=2, w2=2).values

../_images/a5d477ed5ba746d614745fe4f728b9b9fc30a03e37dbad05c031e528a7b099fd.png

# disproportionate resize
reduce(ims, '(h h4)=height (w w3)=width -> h (batch w)', 'mean', h4=4, w3=3).values

../_images/3d03ea7514d4aacfe67c476c1ec67ad586f5b4271d4de7a599f309e87ce81417.png

# spilt each image in two halves, compute mean of the two
ims.einops.reduce('(h1 h2)=height -> h2 (batch width)', 'mean', h1=2).values

../_images/5d23566a9d4fd4fd27fd0c497d02135d23e54929c91637120c15bc29a8eb5913.png

# split in small patches and transpose each patch
rearrange(ims, '(h1 h2)=height (w1 w2)=width -> (h1 w2) (batch w1 h2) channel', h2=8, w2=8).values

../_images/8622de973ab91c218747bfff14e949213227c0b2ecd92e3110b262c8fc8c1971.png

# stop me someone!
rearrange(
    ims, 
    '(h1 h2 h3)=height (w1 w2 w3)=width -> (h1 w2 h3) (batch w1 h2 w3) channel', 
    h2=2, w2=2, w3=2, h3=2
).values

../_images/d1fa643fda458fcdfd1a9bef1ffd08a5c6aead62dbaa87670be302c97a7cfaac.png

rearrange(
    ims, 
    '(b1 b2)=batch (h1 h2)=height (w1 w2)=width -> (h1 b1 h2) (w1 b2 w2) channel', 
    h1=3, w1=3, b2=3
).values

../_images/19d1dad6709c20768182c69fa4f8d0c94fcb48df78c72f58e088e2389bd88272.png

# patterns can be arbitrarily complicated
ims.einops.reduce(
    '(b1 b2)=batch (h1 h2 h3)=height (w1 w2 w3)=width -> (h1 w1 h3) (b1 w2 h2 w3 b2) channel', 
    'mean', h2=2, w1=2, w3=2, h3=2, b2=2
).values

../_images/f0ddee5031434809ebdc1621a60ab477500d56d0d6be19c2f07860f9f9935eb4.png

# subtract background in each image individually and normalize
im2 = reduce(ims, 'batch channel', 'max') - ims
im2 /= reduce(im2, 'batch channel', 'max')
rearrange(im2, '(batch width) channel').values

../_images/375210c02e626c16ee44f0e462a47caa2fb1aec8b4f34f2911c821271a5a1813.png

##### no repeat yet ####
# pixelate: first downscale by averaging, then upscale back using the same pattern
averaged = reduce(ims, 'b (h h2) (w w2) c -> b h w c', 'mean', h2=6, w2=8)
repeat(averaged, 'b h w c -> (h h2) (b w w2) c', h2=6, w2=8)

rearrange(ims, '(batch height) channel').values

../_images/c38cedecb357d1116fe15a6ecced492a6e963eb52299ba7e515ce23be836d410.png

# let's bring color dimension as part of horizontal axis
# at the same time horizontal axis is downsampled by 2x
reduce(ims, '(h h2)=height (w w2)=width -> (h w2) (batch w channel)', 'mean', h2=3, w2=3).values

../_images/9f18d399323cc62f0a78d696142319735ccd15dbd488c73d6c81c28fdbaa1ae7.png

Summary#

rearrange doesn’t change number of elements and covers different numpy functions (like transpose, reshape, stack, concatenate, squeeze and expand_dims)
reduce combines same reordering syntax with reductions (mean, min, max, sum, prod, and any others)
repeat additionally covers repeating and tiling
composition and decomposition of axes are a corner stone, they can and should be used together

%load_ext watermark
%watermark -n -u -v -iv -w -p einops,xarray_einstats

Last updated: Wed Jan 17 2024

Python implementation: CPython
Python version       : 3.11.7
IPython version      : 8.18.1

einops         : 0.7.0
xarray_einstats: 0.7.0

sys   : 3.11.7 | packaged by conda-forge | (main, Dec 15 2023, 08:38:37) [GCC 12.3.0]
numpy : 1.26.2
xarray: 2023.12.0

Watermark: 2.4.3