ort-perf.html - simple benchmark tool to automate perf testing for onnxruntime-web

Install

Dependencies

npm install

Models

We use mostly models provided by transformers.js.

The steps to download the models we use:

Set ORT_PERF_DIR is set to root of this project.

Set TJS_DIR to the root of the transformers.js repo.

git clone https://github.com/xenova/transformers.js/

pip install optimum

cd $TJS_DIR
$ORT_PERF_DIR/download-models.sh

ort-perf assumes all models are in the models directory. Copy or link the transformers.js models to that models directory:

cd $ORT_PERF_DIR
mkdir models/
ln -s $TJS_DIR/models models/tjs

Interactive Run

npx light-server -s . -p 8888

point your browser to http://localhost:8888/ort-perf.html

Automated run

npx light-server -s . -p 8888
npx playwright test

The playwright configuration currently is somewhat hard coded. Take a look at playwright.config.js. Also take a look at ort-perf.spec.js and change settings to your needs.

Options

ort-perf is configured by arguments on the url. Currently supported:

model

the model path, ie. tjs/t5-small/onnx/encoder_model.onnx

name

the model name.

filter

filter pre-configured models to the given set. Currently supported is default and tjs-demo. The later has all models used by https://xenova.github.io/transformers.js/ with similar parameters.

provider

wasm|webgpu|webnn

device

Only valid for webnn: cpu|gpu

threads

number of threads to use

profiler

1 = capture onnxruntime profile output for 20 runs

gen

How the input data for the model is generated. See ort-perf.html for supported types.

verbose

1 = verbose onnxruntime output

min_query_count

Mimimum numbers of queries to run (default=30).

min_query_time

Minimum time to run (default=10sec)

seqlen

Sequence length if model supports it (default=128)

enc_seqlen

Encoder sequence length if models supports it (default=128)

go

1 = start benchmark directly

csv

1 = generate csv output

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
README.md		README.md
download-models.sh		download-models.sh
onnx-add-intermediate-output.py		onnx-add-intermediate-output.py
onnx-chunk-external-data.py		onnx-chunk-external-data.py
onnx-llama-to-fp16.py		onnx-llama-to-fp16.py
onnx-remove-const.py		onnx-remove-const.py
onnx-remove-double.py		onnx-remove-double.py
onnx-split-external-data.py		onnx-split-external-data.py
onnx-wrap-fp16.py		onnx-wrap-fp16.py
ort-perf-all.sh		ort-perf-all.sh
ort-perf-models.js		ort-perf-models.js
ort-perf.html		ort-perf.html
ort-perf.spec.js		ort-perf.spec.js
ort-t5.html		ort-t5.html
ort-trace-color.py		ort-trace-color.py
ort-trace.py		ort-trace.py
ort-web-profile.py		ort-web-profile.py
package.json		package.json
playwright.config.js		playwright.config.js
sd-turbo-for-web.sh		sd-turbo-for-web.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ort-perf.html - simple benchmark tool to automate perf testing for onnxruntime-web

Install

Dependencies

Models

Interactive Run

Automated run

Options

model

name

filter

provider

device

threads

profiler

gen

verbose

min_query_count

min_query_time

seqlen

enc_seqlen

go

csv

About

Releases

Packages

Languages

guschmue/ort-web-perf

Folders and files

Latest commit

History

Repository files navigation

ort-perf.html - simple benchmark tool to automate perf testing for onnxruntime-web

Install

Dependencies

Models

Interactive Run

Automated run

Options

model

name

filter

provider

device

threads

profiler

gen

verbose

min_query_count

min_query_time

seqlen

enc_seqlen

go

csv

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages