Add support for INT64 types in TensorRT constant layer calibration #21041

kevinch-nv · 2024-06-13T22:23:34Z

Description

TensorRT supports INT64 types since TRT 10, so add support for it when calibrating constant layers.
Add more logging statements for helpful debug messages to users when there is an issue reading from a calibration cache.

Motivation and Context

This change is necessary for functional parity with TRT 10

Signed-off-by: Kevin Chen <kevinch@nvidia.com>

chilo-ms · 2024-06-13T23:00:31Z

/azp run Linux CPU CI Pipeline, Linux CPU Minimal Build E2E CI Pipeline, Linux GPU CI Pipeline, Linux GPU TensorRT CI Pipeline, Linux OpenVINO CI Pipeline, MacOS CI Pipeline, ONNX Runtime Web CI Pipeline, onnxruntime-binary-size-checks-ci-pipeline, Linux QNN CI Pipeline

chilo-ms · 2024-06-13T23:00:46Z

/azp run Windows CPU CI Pipeline, Windows GPU CI Pipeline, Windows GPU TensorRT CI Pipeline, Windows ARM64 QNN CI Pipeline, orttraining-linux-ci-pipeline, orttraining-linux-gpu-ci-pipeline, orttraining-ortmodule-distributed, ONNX Runtime React Native CI Pipeline, Windows x64 QNN CI Pipeline

chilo-ms · 2024-06-13T23:01:10Z

/azp run Linux MIGraphX CI Pipeline, orttraining-amd-gpu-ci-pipeline

azure-pipelines · 2024-06-13T23:01:22Z

Azure Pipelines successfully started running 2 pipeline(s).

azure-pipelines · 2024-06-13T23:01:23Z

Azure Pipelines successfully started running 9 pipeline(s).

azure-pipelines · 2024-06-13T23:01:25Z

Azure Pipelines successfully started running 9 pipeline(s).

chilo-ms · 2024-06-13T23:06:49Z

/azp run Big Models, Linux CPU Minimal Build E2E CI Pipeline

azure-pipelines · 2024-06-13T23:07:00Z

Azure Pipelines successfully started running 2 pipeline(s).

chilo-ms · 2024-06-13T23:09:49Z

Please help fix Lint / Python format

Signed-off-by: Kevin Chen <kevinch@nvidia.com>

chilo-ms · 2024-06-14T04:52:28Z

/azp run Linux CPU CI Pipeline, Linux CPU Minimal Build E2E CI Pipeline, Linux GPU CI Pipeline, Linux GPU TensorRT CI Pipeline, Linux OpenVINO CI Pipeline, MacOS CI Pipeline, ONNX Runtime Web CI Pipeline, onnxruntime-binary-size-checks-ci-pipeline, Linux QNN CI Pipeline

chilo-ms · 2024-06-14T04:52:39Z

/azp run Windows CPU CI Pipeline, Windows GPU CI Pipeline, Windows GPU TensorRT CI Pipeline, Windows ARM64 QNN CI Pipeline, orttraining-linux-ci-pipeline, orttraining-linux-gpu-ci-pipeline, orttraining-ortmodule-distributed, ONNX Runtime React Native CI Pipeline, Windows x64 QNN CI Pipeline

chilo-ms · 2024-06-14T04:52:50Z

/azp run Big Models, Linux CPU Minimal Build E2E CI Pipeline

azure-pipelines · 2024-06-14T04:53:00Z

Azure Pipelines successfully started running 9 pipeline(s).

azure-pipelines · 2024-06-14T04:53:00Z

Azure Pipelines successfully started running 2 pipeline(s).

azure-pipelines · 2024-06-14T04:53:11Z

Azure Pipelines successfully started running 9 pipeline(s).

chilo-ms · 2024-06-14T05:00:05Z

/azp run Linux Android Emulator QNN CI Pipeline

azure-pipelines · 2024-06-14T05:01:08Z

Azure Pipelines successfully started running 1 pipeline(s).

chilo-ms · 2024-06-14T15:28:20Z

onnxruntime/core/providers/tensorrt/tensorrt_execution_provider.cc

@@ -108,13 +111,19 @@ bool SetDynamicRange(nvinfer1::INetworkDefinition& network, std::unordered_map<s
            case nvinfer1::DataType::kINT32:
              weight = static_cast<const int32_t*>(trt_weights.values)[k];
              break;
+#if NV_TENSORRT_MAJOR >= 10
+            case nvinfer1::DataType::kINT64:
+              weight = static_cast<const int64_t*>(trt_weights.values)[k];


Converting "const int64_t" to "double" may lost data.

warning C4244: '=': conversion from 'const int64_t' to 'double', possible loss of data

@kevinch-nv, fyi this is causing the build to fail.

chilo-ms · 2024-06-18T22:58:59Z

/azp run Linux CPU CI Pipeline, Linux CPU Minimal Build E2E CI Pipeline, Linux GPU CI Pipeline, Linux GPU TensorRT CI Pipeline, Linux OpenVINO CI Pipeline, MacOS CI Pipeline, ONNX Runtime Web CI Pipeline, onnxruntime-binary-size-checks-ci-pipeline, Linux QNN CI Pipeline

chilo-ms · 2024-06-18T22:59:12Z

/azp run Windows CPU CI Pipeline, Windows GPU CI Pipeline, Windows GPU TensorRT CI Pipeline, Windows ARM64 QNN CI Pipeline, orttraining-linux-ci-pipeline, orttraining-linux-gpu-ci-pipeline, orttraining-ortmodule-distributed, ONNX Runtime React Native CI Pipeline, Windows x64 QNN CI Pipeline

azure-pipelines · 2024-06-18T22:59:36Z

Azure Pipelines successfully started running 9 pipeline(s).

azure-pipelines · 2024-06-18T22:59:45Z

Azure Pipelines successfully started running 9 pipeline(s).

chilo-ms · 2024-06-18T23:00:38Z

/azp run Linux MIGraphX CI Pipeline, orttraining-amd-gpu-ci-pipeline Big Models, Linux CPU Minimal Build E2E CI Pipeline

azure-pipelines · 2024-06-18T23:01:07Z

Azure Pipelines successfully started running 2 pipeline(s).

chilo-ms · 2024-06-18T23:02:26Z

/azp run Big Models, Linux Android Emulator QNN CI Pipeline

azure-pipelines · 2024-06-18T23:02:41Z

Azure Pipelines successfully started running 2 pipeline(s).

Signed-off-by: Kevin Chen <kevinch@nvidia.com>

chilo-ms · 2024-06-19T17:08:21Z

Please help fix the format.

BTW, i create a duplicated PR in case this PR can't be updated for patch release timeline.

jywu-msft · 2024-06-20T01:34:24Z

Please help fix the format.

BTW, i create a duplicated PR in case this PR can't be updated for patch release timeline.

#21101

…21101) This PR is a duplicate of the #21041 Create this PR in case the original one can't be updated for patch release timeline.

jywu-msft · 2024-06-24T05:44:29Z

superseded by #21101

yf711 added the release:1.18.1 label Jun 13, 2024

Add support for INT64 types in TensorRT constant layer calibration

e6fff47

Signed-off-by: Kevin Chen <kevinch@nvidia.com>

Fix comment whitespace

ed1a45e

Signed-off-by: Kevin Chen <kevinch@nvidia.com>

chilo-ms previously approved these changes Jun 14, 2024

View reviewed changes

chilo-ms reviewed Jun 14, 2024

View reviewed changes

kevinch-nv dismissed chilo-ms’s stale review via 521fffa June 18, 2024 22:53

Fix warning

521fffa

Signed-off-by: Kevin Chen <kevinch@nvidia.com>

chilo-ms mentioned this pull request Jun 19, 2024

Add support for INT64 types in TensorRT constant layer calibration #21101

Merged

jywu-msft pushed a commit that referenced this pull request Jun 20, 2024

Add support for INT64 types in TensorRT constant layer calibration (#…

e737547

…21101) This PR is a duplicate of the #21041 Create this PR in case the original one can't be updated for patch release timeline.

yf711 pushed a commit that referenced this pull request Jun 21, 2024

Add support for INT64 types in TensorRT constant layer calibration (#…

57475ff

…21101) This PR is a duplicate of the #21041 Create this PR in case the original one can't be updated for patch release timeline.

jywu-msft closed this Jun 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for INT64 types in TensorRT constant layer calibration #21041

Add support for INT64 types in TensorRT constant layer calibration #21041

kevinch-nv commented Jun 13, 2024

chilo-ms commented Jun 13, 2024

chilo-ms commented Jun 13, 2024

chilo-ms commented Jun 13, 2024

azure-pipelines bot commented Jun 13, 2024

azure-pipelines bot commented Jun 13, 2024

azure-pipelines bot commented Jun 13, 2024

chilo-ms commented Jun 13, 2024

azure-pipelines bot commented Jun 13, 2024

chilo-ms commented Jun 13, 2024

chilo-ms commented Jun 14, 2024

chilo-ms commented Jun 14, 2024

chilo-ms commented Jun 14, 2024

azure-pipelines bot commented Jun 14, 2024

azure-pipelines bot commented Jun 14, 2024

azure-pipelines bot commented Jun 14, 2024

chilo-ms commented Jun 14, 2024

azure-pipelines bot commented Jun 14, 2024

chilo-ms Jun 14, 2024

jywu-msft Jun 18, 2024

kevinch-nv Jun 18, 2024

chilo-ms commented Jun 18, 2024

chilo-ms commented Jun 18, 2024

azure-pipelines bot commented Jun 18, 2024

azure-pipelines bot commented Jun 18, 2024

chilo-ms commented Jun 18, 2024

azure-pipelines bot commented Jun 18, 2024

chilo-ms commented Jun 18, 2024

azure-pipelines bot commented Jun 18, 2024

chilo-ms commented Jun 19, 2024

jywu-msft commented Jun 20, 2024

jywu-msft commented Jun 24, 2024

Add support for INT64 types in TensorRT constant layer calibration #21041

Add support for INT64 types in TensorRT constant layer calibration #21041

Conversation

kevinch-nv commented Jun 13, 2024

Description

Motivation and Context

chilo-ms commented Jun 13, 2024

chilo-ms commented Jun 13, 2024

chilo-ms commented Jun 13, 2024

azure-pipelines bot commented Jun 13, 2024

azure-pipelines bot commented Jun 13, 2024

azure-pipelines bot commented Jun 13, 2024

chilo-ms commented Jun 13, 2024

azure-pipelines bot commented Jun 13, 2024

chilo-ms commented Jun 13, 2024

chilo-ms commented Jun 14, 2024

chilo-ms commented Jun 14, 2024

chilo-ms commented Jun 14, 2024

azure-pipelines bot commented Jun 14, 2024

azure-pipelines bot commented Jun 14, 2024

azure-pipelines bot commented Jun 14, 2024

chilo-ms commented Jun 14, 2024

azure-pipelines bot commented Jun 14, 2024

chilo-ms Jun 14, 2024

Choose a reason for hiding this comment

jywu-msft Jun 18, 2024

Choose a reason for hiding this comment

kevinch-nv Jun 18, 2024

Choose a reason for hiding this comment

chilo-ms commented Jun 18, 2024

chilo-ms commented Jun 18, 2024

azure-pipelines bot commented Jun 18, 2024

azure-pipelines bot commented Jun 18, 2024

chilo-ms commented Jun 18, 2024

azure-pipelines bot commented Jun 18, 2024

chilo-ms commented Jun 18, 2024

azure-pipelines bot commented Jun 18, 2024

chilo-ms commented Jun 19, 2024

jywu-msft commented Jun 20, 2024

jywu-msft commented Jun 24, 2024