Add a custom optimizer that implements `AdamW` with proper magnitude computation for complex tensors #420

dhpitt · 2024-08-20T16:49:05Z

torch.optim optimizers view all complex parameters as real before computing update steps. Even after the 2021 update to support complex tensors, the magnitudes computed for momentum are thrown off by computing grad * grad.conj() for a 2-tensor real stack, which is equivalent to squaring the real and imaginary components separately. This is different than an actual multiplication of grad * grad.conj() for complex numbers:

>>> x = torch.randn((2,2), dtype=torch.cfloat)

>>> x

tensor([[-1.1954+0.7736j, -0.3455+0.1481j],
        [-1.2845+0.6057j,  0.1566+0.3581j]])

>>> y = torch.view_as_real(x)

>>> y

tensor([[[-1.1954,  0.7736],
         [-0.3455,  0.1481]],

        [[-1.2845,  0.6057],
         [ 0.1566,  0.3581]]])

>>> torch.isclose(y**2, y * y.conj()).all()

tensor(True)

>>> torch.isclose(x**2, x * x.conj()).all()

tensor(False)

dhpitt added 6 commits August 14, 2024 15:25

real adam

1075f12

update sig

7f7ad07

add AdamW with complex support

48cc01f

cluttered api removal

1b611a8

add scripts with adamw

aa7124a

amp_autocast to mixed_Precision

4b9f003

dhpitt marked this pull request as ready for review August 20, 2024 16:54

dhpitt added 9 commits August 20, 2024 10:32

update adam --> neuralop.training.AdamW

3d15bac

add track_momentum and unit test

1c1e808

empty init

ccfe222

compare with default pytorch optim (wrong)

6ef7625

refactor into generic test of correct optim momentum

7fc3e0c

remove stale line

980df3b

better naming in test optim

e2b4157

use random instead

83ee6b6

convert goal tensor to complex instead of momentum to real

5866d27

dhpitt merged commit 3c95d8e into neuraloperator:main Aug 26, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a custom optimizer that implements `AdamW` with proper magnitude computation for complex tensors #420

Add a custom optimizer that implements `AdamW` with proper magnitude computation for complex tensors #420

dhpitt commented Aug 20, 2024

Add a custom optimizer that implements AdamW with proper magnitude computation for complex tensors #420

Add a custom optimizer that implements AdamW with proper magnitude computation for complex tensors #420

Conversation

dhpitt commented Aug 20, 2024

Add a custom optimizer that implements `AdamW` with proper magnitude computation for complex tensors #420

Add a custom optimizer that implements `AdamW` with proper magnitude computation for complex tensors #420