initial tensorrt ep commit #921

manickavela29 · 2024-05-26T17:59:40Z

Adds in TensorRT EP
Observed that Encoder Model is much faster with TRT than with CUDA backend
Initialization of onnxrt session takes more time as TRT models are generated and converted

ToDo :

Validate Accuracy of the model inference
Reducing the starting time

Ref : #40, #41
CC : @csukuangfj

Signed-off-by: manickavela1998@gmail.com <manickavela1998@gmail.com>

sherpa-onnx/csrc/session.cc

-- cleaner implementation -- releasing memory leak Signed-off-by: manickavela1998@gmail.com <manickavela.arumugam@uniphore.com>

manickavela29 · 2024-06-03T10:30:22Z

I will send a separate PR for handling configs of OnnxRT EP configs separately.
along with TRT config options.

In the middle of something, and it might take sometime

sherpa-onnx/csrc/session.cc

Signed-off-by: manickavela1998@gmail.com <manickavela1998@gmail.com>

manickavela29 · 2024-06-03T18:47:51Z

I think the build failures are coming in from the python dependency of nvinfer and similar, I will try to add some libraries and check

csukuangfj · 2024-06-04T02:21:54Z

https://github.com/k2-fsa/sherpa-onnx/actions/runs/9354678305/job/25748084556#step:5:1121

[ 62%] Linking CXX executable ../../bin/sherpa-onnx
/opt/rh/devtoolset-10/root/usr/libexec/gcc/x86_64-redhat-linux/10/ld: ../../lib/libsherpa-onnx-core.so: undefined reference to `OrtSessionOptionsAppendExecutionProvider_Tensorrt'
collect2: error: ld returned 1 exit status
make[2]: *** [bin/sherpa-onnx] Error 1
make[1]: *** [sherpa-onnx/csrc/CMakeFiles/sherpa-onnx.dir/all] Error 2
make: *** [all] Error 2

I think an extra lib should be linked to for tensorrt.

Signed-off-by: manickavela1998@gmail.com <manickavela1998@gmail.com>

manickavela29 · 2024-06-04T07:51:13Z

yes, it seems directly having OrtSessionOptionsAppendExecutionProvider_Tensorrt() as in interface is not working out
and AppendExecutionProvider_TensorRT_V2 is actually a wrapper around that function.

so it should be alright now

sherpa-onnx/csrc/session.cc

csukuangfj · 2024-06-05T02:15:35Z

Please merge the latest master into your branch and the CI should pass or you can just ignore the failed tests.

Could you describe what users need to do to build sherpa-onnx with TensorRT support?

csukuangfj · 2024-06-05T02:17:38Z

please leave a comment if you think it is ready to merge.

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>

manickavela29 · 2024-06-05T03:30:15Z

Regarding what is required to run TensorRT,

Hardware : Nvidia GPU 😄 ,(not sure how compatible AMD GPUs are)
Software lib : TensorRT requires libnvinfer, libnvinfer-dispatch, libnvinfer-plugin, libnvonnxparsers8 (from my experience)

But seeing that CI/CD's are working fine, I think onnxrt is able to handle these,
as long as someone is having a compatible GPU it should work fine.

Just setting the appropriate provider argument should be enough,
if there are any github pages/doc, this can be updated there

--provider=trt

Signed-off-by: manickavela1998@gmail.com <manickavela1998@gmail.com>

manickavela29 · 2024-06-05T03:43:25Z

It can be merged,
after ensuring that basic builds are working fine

Thank you for the review 😄

manickavela29 · 2024-06-05T09:24:04Z

Regarding what is required to run TensorRT,

Hardware : Nvidia GPU 😄 ,(not sure how compatible AMD GPUs are) Software lib : TensorRT requires libnvinfer, libnvinfer-dispatch, libnvinfer-plugin, libnvonnxparsers8 (from my experience)

But seeing that CI/CD's are working fine, I think onnxrt is able to handle these, as long as someone is having a compatible GPU it should work fine.

Just setting the appropriate provider argument should be enough, if there are any github pages/doc, this can be updated there

--provider=trt

Update this comment, by mistake I had typed ' don't think onnxrt' but edited it

manickavela29 · 2024-06-05T17:53:57Z

Please merge the latest master into your branch and the CI should pass or you can just ignore the failed tests.

Could you describe what users need to do to build sherpa-onnx with TensorRT support

I think the builds are also good enough.
Shall we merge it @csukuangfj

csukuangfj · 2024-06-06T02:44:29Z

Thank you for your contribution!

manickavela29 added 2 commits May 26, 2024 17:55

initial tensorrt commit

0e50b64

Signed-off-by: manickavela1998@gmail.com <manickavela1998@gmail.com>

fixing cpplint

b714817

Signed-off-by: manickavela1998@gmail.com <manickavela1998@gmail.com>

manickavela29 force-pushed the trt branch from 5c41063 to b714817 Compare May 27, 2024 12:58

csukuangfj reviewed May 28, 2024

View reviewed changes

sherpa-onnx/csrc/session.cc Outdated Show resolved Hide resolved

csukuangfj reviewed May 28, 2024

View reviewed changes

sherpa-onnx/csrc/session.cc Outdated Show resolved Hide resolved

csukuangfj reviewed May 28, 2024

View reviewed changes

sherpa-onnx/csrc/session.cc Outdated Show resolved Hide resolved

manickavela29 marked this pull request as draft June 3, 2024 09:52

Clean Tenssort provider

04ca048

-- cleaner implementation -- releasing memory leak Signed-off-by: manickavela1998@gmail.com <manickavela.arumugam@uniphore.com>

manickavela29 changed the title ~~initial tensorrt commit~~ initial tensorrt ep commit Jun 3, 2024

manickavela29 requested a review from csukuangfj June 3, 2024 10:27

manickavela29 marked this pull request as ready for review June 3, 2024 10:27

csukuangfj reviewed Jun 3, 2024

View reviewed changes

sherpa-onnx/csrc/session.cc Show resolved Hide resolved

csukuangfj reviewed Jun 3, 2024

View reviewed changes

sherpa-onnx/csrc/session.cc Outdated Show resolved Hide resolved

manickavela29 added 2 commits June 3, 2024 10:42

adding comments

7f4b3c5

Signed-off-by: manickavela1998@gmail.com <manickavela1998@gmail.com>

updating to static function

93f6fdf

Signed-off-by: manickavela1998@gmail.com <manickavela1998@gmail.com>

manickavela29 requested a review from csukuangfj June 3, 2024 11:06

manickavela29 added 2 commits June 3, 2024 16:48

cpp lint

957996d

Signed-off-by: manickavela1998@gmail.com <manickavela1998@gmail.com>

build errors fix attempt

db321c1

Signed-off-by: manickavela1998@gmail.com <manickavela1998@gmail.com>

stensorrt interface rewamp

b208537

Signed-off-by: manickavela1998@gmail.com <manickavela1998@gmail.com>

yuekaizhang reviewed Jun 5, 2024

View reviewed changes

sherpa-onnx/csrc/session.cc Show resolved Hide resolved

csukuangfj reviewed Jun 5, 2024

View reviewed changes

sherpa-onnx/csrc/session.cc Outdated Show resolved Hide resolved

sherpa-onnx/csrc/session.cc Outdated Show resolved Hide resolved

sherpa-onnx/csrc/session.cc Outdated Show resolved Hide resolved

manickavela29 and others added 3 commits June 5, 2024 08:37

Update sherpa-onnx/csrc/session.cc

486c687

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>

Update session.cc

e063d8c

Update sherpa-onnx/csrc/session.cc

f01c836

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>

Merge branch 'k2-fsa:master' into trt

7752d0c

fix bug

62cd503

Signed-off-by: manickavela1998@gmail.com <manickavela1998@gmail.com>

manickavela29 requested review from csukuangfj and yuekaizhang June 5, 2024 03:40

csukuangfj merged commit 69347ff into k2-fsa:master Jun 6, 2024
180 of 209 checks passed

manickavela29 deleted the trt branch June 6, 2024 04:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

initial tensorrt ep commit #921

initial tensorrt ep commit #921

manickavela29 commented May 26, 2024 •

edited

Loading

manickavela29 commented Jun 3, 2024

manickavela29 commented Jun 3, 2024

csukuangfj commented Jun 4, 2024

manickavela29 commented Jun 4, 2024

csukuangfj commented Jun 5, 2024

csukuangfj commented Jun 5, 2024

manickavela29 commented Jun 5, 2024 •

edited

Loading

manickavela29 commented Jun 5, 2024

manickavela29 commented Jun 5, 2024 •

edited

Loading

manickavela29 commented Jun 5, 2024

csukuangfj commented Jun 6, 2024

initial tensorrt ep commit #921

initial tensorrt ep commit #921

Conversation

manickavela29 commented May 26, 2024 • edited Loading

manickavela29 commented Jun 3, 2024

manickavela29 commented Jun 3, 2024

csukuangfj commented Jun 4, 2024

manickavela29 commented Jun 4, 2024

csukuangfj commented Jun 5, 2024

csukuangfj commented Jun 5, 2024

manickavela29 commented Jun 5, 2024 • edited Loading

manickavela29 commented Jun 5, 2024

manickavela29 commented Jun 5, 2024 • edited Loading

manickavela29 commented Jun 5, 2024

csukuangfj commented Jun 6, 2024

manickavela29 commented May 26, 2024 •

edited

Loading

manickavela29 commented Jun 5, 2024 •

edited

Loading

manickavela29 commented Jun 5, 2024 •

edited

Loading