Getting Started

Understanding `Dim()`

Before diving in, a quick note on Dim(). It is a convenience function for creating named dimension tuples from a compact string:

from torchlinops import Dim

Dim("MN")     # -> ("M", "N")  — each uppercase letter starts a new dimension
Dim("NxNy")   # -> ("Nx", "Ny") — uppercase + lowercase letters form one name
Dim("M")      # -> ("M",)
Dim("")        # -> ()          — empty string for scalar/batch dims

You can also pass a plain tuple ("M", "N") anywhere Dim("MN") is accepted. Both forms are equivalent.

weightshape vs ishape/oshape

When creating operators like Dense, you'll encounter both:

weightshape — describes the dimensions of the matrix itself (the storage). For a matrix with shape (M, N), this is Dim("MN").
ishape / oshape — describe what the operator maps from and to. These are the dimensions of the input and output vectors.

For example, with a weight matrix W of shape (3, 7):

A = Dense(W, weightshape=Dim("MN"), ishape=Dim("N"), oshape=Dim("M"))
#                   ↑ matrix dims (M=3, N=7)       ↑ input (7)  ↑ output (3)

A simple example

Start by importing some stuff:

import torch
from torchlinops import Dense, FFT, Dim

Create a simple dense matrix-vector multiply.

M, N = (3, 7)
w = torch.randn(M, N)
A = Dense(w, Dim("MN"), ishape=Dim("N"), oshape=Dim("M"))
print(A)

Create a test input and verify that everything checks out:

# Input to A should have size N
x = torch.randn(N)
y_ref = w @ x
y = A(x) 
y2 = A @ x # Alternative syntax
print("y_ref matches y:", torch.allclose(y_ref, y))
print("y_ref_matches_y2: ", torch.allclose(y_ref, y2))

Compute the adjoint and apply it:

b = torch.randn(M)
c = A.H(b)
print(A.H)

Compute the normal operator and apply it:

u = torch.randn(N)
v = A.N(u)
print(A.N)

Composing linops

Linops can be composed with the @ operator, creating a Chain.

A note on printing linops

When several linops are chained together, they are printed from top to bottom in the order of execution, as this matches the behavior of nn.Sequential.

D = Dense(w, Dim("MN"), ishape=Dim("N"), oshape=Dim("M"))
F = FFT(ndim=1, grid_shapes=(Dim("M"), Dim("K")), batch_shape=Dim())
A = F @ D
x = torch.randn(N)
y = A(x)
print(A)

Advanced: `torchlinops.config`

Reducing `Identity` inside normal

from torchlinops import config
config.reduce_identity_in_normal = False
print(A.N) # Contains Identity in the middle since FFT.H @ FFT = Id

config.reduce_identity_in_normal = True
print(A.N) # No longer contains Identity

What does reduce_identity_in_normal do?

When computing the normal operator \(A^H A\), some operators have the property that \(F^H F = I\) (e.g., FFT with orthonormal normalization). By default, the library simplifies these to Identity operators in the normal chain.

True (default): Simplifies FFT.H @ FFT to Identity in the normal operator, producing a more compact operator graph.
False: Keeps the full chain, which can be useful for debugging or when you need to inspect the exact computation.

Note that you must call reset_adjoint_and_normal() after changing this setting to clear the cached operators.

Temporarily modifying config

Use config.using() as a context manager to temporarily modify config values:

from torchlinops import config

with config.using(reduce_identity_in_normal=False):
    # reduce_identity_in_normal is False here
    print(A.N)
# original value restored

You can modify multiple values at once:

with config.using(reduce_identity_in_normal=False, log_device_transfers=False):
    # multiple values temporarily changed
    pass

Splitting linops

Linops can be split across sub-problems (e.g., for multi-GPU processing) using split_linop. See Multi-GPU Splitting for the full explanation.

from torchlinops import split_linop
# splits = split_linop(A, ...)  # Splits A into sub-operators

Creating a custom linop

You can create your own NamedLinop by subclassing it and implementing fn() and adj_fn() as static methods. See the Custom Linops How-To Guide for the full requirements and a complete example.