Cannot use `torch.jit.trace` to trace `LightningModule` in Lightning v1.7 #14036

J-shang · 2022-08-05T08:27:07Z

🐛 Bug

When I use torch.jit.trace to trace a LightningModule,
I got RuntimeError: XXX(LightningModule class name) is not attached to a Trainer.

This because in lightning 1.7.0, property trainer will raise an RuntimeError if the module doesn't attach a Trainer.

https://github.com/Lightning-AI/lightning/blob/12a061f2aaefaa9ed9ccf81ab6f378835b675a7e/src/pytorch_lightning/core/module.py#L179

but in torch.jit, it will justify each attr by hasattr,

https://github.com/pytorch/pytorch/blob/de0e03001d31523ef86c3d7852c87cdad6d96632/torch/_jit_internal.py#L749

and in hasattr docstring, Return whether the object has an attribute with the given name. This is done by calling getattr(obj, name) and catching AttributeError.

To Reproduce

Initialize any LightningModule under Lightning v1.7.0, and trace it by torch.jit.trace without attach trainer to the lightning module.

To Fix

Replace RuntimeError by AttributeError.

This fix is work for me, but I don't know will this cause other problems.

Environment

Lightning Component (e.g. Trainer, LightningModule, LightningApp, LightningWork, LightningFlow): LightningModule
PyTorch Lightning Version (e.g., 1.5.0): 1.7.0
PyTorch Version (e.g., 1.10): 1.10
Python version (e.g., 3.9): 3.7.12

cc @carmocca @justusschock @awaelchli @Borda @ananthsub @ninginthecloud @jjenniferdai @rohitgr7

The text was updated successfully, but these errors were encountered:

carmocca · 2022-08-05T16:37:41Z

Hi! Unfortunately, this is caused by a bug in PyTorch where properties are not correctly ignored: pytorch/pytorch#67146

As a workaround, you can use model.to_torchscript(method="trace")

J-shang · 2022-08-09T08:25:39Z

Hi! Unfortunately, this is caused by a bug in PyTorch where properties are not correctly ignored: pytorch/pytorch#67146

As a workaround, you can use model.to_torchscript(method="trace")

Thank you for your reply, model.to_torchscript(method="trace") works for me.

Animesh081005 · 2022-09-01T18:45:30Z

Hi @carmocca, I am getting the same error when using torch.jit.trace for tracing a LightningModule. Unfortunately, I cannot use the above workarounds, as torch.jit.trace is being internally called by a library I am using with LightningModule. So do you have any suggestions to make torch.jit.trace work with a LightningModule?

FYI, I am using pytorch-lightning version 1.7.3

rohitgr7 · 2022-09-01T19:33:07Z

@Animesh081005 can you open an issue with a reproducible script?

kyoungrok0517 · 2022-09-26T09:04:55Z

UPDATE: Solved in 1.7.7

The same happens with me.

Erland366 · 2022-10-04T12:18:22Z

I am still having this issue. I am using Lightning Flash ObjectDetector with YOLOv5 backbone and neither script or trace the model works. The error says

ModelAdapter is not attached to a Trainer.

Stack-Attack · 2022-11-24T02:58:00Z

I am also still seeing this error. On the most recent versions.
pytorch-lightning 1.8.3.post0
pytorch 1.13.0

carmocca · 2022-11-24T13:15:47Z

Can you share the full error stacktrace?

kishcs · 2022-12-16T09:42:16Z

I am also having same issue with pytorch_lightning version 1.8.4.post0
I am trying to convert parseq torch model (https://github.com/baudm/parseq/releases/download/v1.0.0/parseq-bb5792a6.pt) to trt_ts with sample code given here (https://pytorch.org/TensorRT/getting_started/getting_started_with_python_api.html).

Error:

Traceback (most recent call last):
  File "parseq_to_trt.py", line 18, in <module>
    trt_ts_module = torch_tensorrt.compile(model, inputs=inputs, enabled_precisions=enabled_precisions)
  File "/usr/local/lib/python3.8/dist-packages/torch_tensorrt/_compile.py", line 124, in compile
    ts_mod = torch.jit.script(module)
  File "/usr/local/lib/python3.8/dist-packages/torch/jit/_script.py", line 1286, in script
    return torch.jit._recursive.create_script_module(
  File "/usr/local/lib/python3.8/dist-packages/torch/jit/_recursive.py", line 473, in create_script_module
    concrete_type = get_module_concrete_type(nn_module, share_types)
  File "/usr/local/lib/python3.8/dist-packages/torch/jit/_recursive.py", line 424, in get_module_concrete_type
    concrete_type = concrete_type_store.get_or_create_concrete_type(nn_module)
  File "/usr/local/lib/python3.8/dist-packages/torch/jit/_recursive.py", line 365, in get_or_create_concrete_type
    concrete_type_builder = infer_concrete_type_builder(nn_module)
  File "/usr/local/lib/python3.8/dist-packages/torch/jit/_recursive.py", line 273, in infer_concrete_type_builder
    overloads.update(get_overload_name_mapping(get_overload_annotations(nn_module, ignored_properties)))
  File "/usr/local/lib/python3.8/dist-packages/torch/jit/_recursive.py", line 639, in get_overload_annotations
    item = getattr(mod, name, None)
  File "/usr/local/lib/python3.8/dist-packages/pytorch_lightning/core/module.py", line 179, in trainer
    raise RuntimeError(f"{self.__class__.__qualname__} is not attached to a `Trainer`.")
RuntimeError: PARSeq is not attached to a `Trainer`.

carmocca · 2022-12-16T14:07:26Z

You will need to torchscript the model first model.to_torchscript() and then pass that to torch_tensorrt.compile()

naveenkumarkr723 · 2022-12-21T09:11:18Z

hi @carmocca
i converted the paseq model to to_torchscript() , now can u provide me the code how to convert TRT by using this link (https://pytorch.org/TensorRT/getting_started/getting_started_with_python_api.html).

Duplicated in #16157

laclouis5 · 2023-07-20T17:39:52Z

This issue is still not solved. Is a fix to be expected soon?

carmocca · 2023-07-20T18:26:31Z

@laclouis5 No, and most likely never since PyTorch no longer works on TorchScript since the release of torch.compile

hjp709394 · 2023-09-04T23:12:12Z

can we just create a dummy object and attach it to the lightningmodule? @carmocca

eval-dev · 2023-12-05T06:46:37Z

My workaround is give it a dummy trainer model._trainer = pl.Trainer()

Apolloxyy · 2024-08-19T12:26:58Z

I am still having this issue. I am using Lightning Flash ObjectDetector with YOLOv5 backbone and neither script or trace the model works. The error says

ModelAdapter is not attached to a Trainer.

Hi, have you solved this problem? I encounter the same problem when I tried to load a lightning model.

Apolloxyy · 2024-08-19T12:28:04Z

I am also having same issue with pytorch_lightning version 1.8.4.post0 I am trying to convert parseq torch model (https://github.com/baudm/parseq/releases/download/v1.0.0/parseq-bb5792a6.pt) to trt_ts with sample code given here (https://pytorch.org/TensorRT/getting_started/getting_started_with_python_api.html).

Error:

Traceback (most recent call last):
  File "parseq_to_trt.py", line 18, in <module>
    trt_ts_module = torch_tensorrt.compile(model, inputs=inputs, enabled_precisions=enabled_precisions)
  File "/usr/local/lib/python3.8/dist-packages/torch_tensorrt/_compile.py", line 124, in compile
    ts_mod = torch.jit.script(module)
  File "/usr/local/lib/python3.8/dist-packages/torch/jit/_script.py", line 1286, in script
    return torch.jit._recursive.create_script_module(
  File "/usr/local/lib/python3.8/dist-packages/torch/jit/_recursive.py", line 473, in create_script_module
    concrete_type = get_module_concrete_type(nn_module, share_types)
  File "/usr/local/lib/python3.8/dist-packages/torch/jit/_recursive.py", line 424, in get_module_concrete_type
    concrete_type = concrete_type_store.get_or_create_concrete_type(nn_module)
  File "/usr/local/lib/python3.8/dist-packages/torch/jit/_recursive.py", line 365, in get_or_create_concrete_type
    concrete_type_builder = infer_concrete_type_builder(nn_module)
  File "/usr/local/lib/python3.8/dist-packages/torch/jit/_recursive.py", line 273, in infer_concrete_type_builder
    overloads.update(get_overload_name_mapping(get_overload_annotations(nn_module, ignored_properties)))
  File "/usr/local/lib/python3.8/dist-packages/torch/jit/_recursive.py", line 639, in get_overload_annotations
    item = getattr(mod, name, None)
  File "/usr/local/lib/python3.8/dist-packages/pytorch_lightning/core/module.py", line 179, in trainer
    raise RuntimeError(f"{self.__class__.__qualname__} is not attached to a `Trainer`.")
RuntimeError: PARSeq is not attached to a `Trainer`.

Hi, have you solve this problem? I encountered the same problem when I saved the trained lightning model.

williamgao07 · 2024-11-27T19:39:13Z

add a dummy trainer addressed my issue. According to this post #17517 (comment)

J-shang added the needs triage Waiting to be triaged by maintainers label Aug 5, 2022

J-shang changed the title ~~Cannot use torch.jit.trace to trace LightningModule in Lightning v1.7~~ Cannot use torch.jit.trace to trace LightningModule in Lightning v1.7 Aug 5, 2022

carmocca added bug Something isn't working lightningmodule pl.LightningModule and removed needs triage Waiting to be triaged by maintainers labels Aug 5, 2022

carmocca added this to the pl:1.7.x milestone Aug 5, 2022

carmocca self-assigned this Aug 5, 2022

carmocca removed this from the pl:1.7.x milestone Aug 5, 2022

J-shang closed this as completed Aug 9, 2022

carmocca mentioned this issue Aug 10, 2022

Lightning Trainer/PyTorch Lightning 1.7.0 support + CI Fixes (JIT Tracing and Functions to Classes conversion) Lightning-Universe/lightning-flash#1410

Merged

8 tasks

Animesh081005 mentioned this issue Sep 1, 2022

ModelSpeedup doesn't work for Pytorch-Lightning models microsoft/nni#5104

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cannot use `torch.jit.trace` to trace `LightningModule` in Lightning v1.7 #14036

Cannot use `torch.jit.trace` to trace `LightningModule` in Lightning v1.7 #14036

J-shang commented Aug 5, 2022 •

edited by github-actions bot

Loading

carmocca commented Aug 5, 2022

J-shang commented Aug 9, 2022

Animesh081005 commented Sep 1, 2022

rohitgr7 commented Sep 1, 2022

kyoungrok0517 commented Sep 26, 2022 •

edited

Loading

Erland366 commented Oct 4, 2022

Stack-Attack commented Nov 24, 2022 •

edited

Loading

carmocca commented Nov 24, 2022

kishcs commented Dec 16, 2022

carmocca commented Dec 16, 2022

naveenkumarkr723 commented Dec 21, 2022 •

edited by carmocca

Loading

laclouis5 commented Jul 20, 2023

carmocca commented Jul 20, 2023

hjp709394 commented Sep 4, 2023 •

edited

Loading

eval-dev commented Dec 5, 2023

Apolloxyy commented Aug 19, 2024

Apolloxyy commented Aug 19, 2024

williamgao07 commented Nov 27, 2024

Cannot use torch.jit.trace to trace LightningModule in Lightning v1.7 #14036

Cannot use torch.jit.trace to trace LightningModule in Lightning v1.7 #14036

Comments

J-shang commented Aug 5, 2022 • edited by github-actions bot Loading

🐛 Bug

To Reproduce

To Fix

Environment

carmocca commented Aug 5, 2022

J-shang commented Aug 9, 2022

Animesh081005 commented Sep 1, 2022

rohitgr7 commented Sep 1, 2022

kyoungrok0517 commented Sep 26, 2022 • edited Loading

Erland366 commented Oct 4, 2022

Stack-Attack commented Nov 24, 2022 • edited Loading

carmocca commented Nov 24, 2022

kishcs commented Dec 16, 2022

carmocca commented Dec 16, 2022

naveenkumarkr723 commented Dec 21, 2022 • edited by carmocca Loading

laclouis5 commented Jul 20, 2023

carmocca commented Jul 20, 2023

hjp709394 commented Sep 4, 2023 • edited Loading

eval-dev commented Dec 5, 2023

Apolloxyy commented Aug 19, 2024

Apolloxyy commented Aug 19, 2024

williamgao07 commented Nov 27, 2024

Cannot use `torch.jit.trace` to trace `LightningModule` in Lightning v1.7 #14036

Cannot use `torch.jit.trace` to trace `LightningModule` in Lightning v1.7 #14036

J-shang commented Aug 5, 2022 •

edited by github-actions bot

Loading

kyoungrok0517 commented Sep 26, 2022 •

edited

Loading

Stack-Attack commented Nov 24, 2022 •

edited

Loading

naveenkumarkr723 commented Dec 21, 2022 •

edited by carmocca

Loading

hjp709394 commented Sep 4, 2023 •

edited

Loading