Skip to content

Commit

Permalink
Add the ability to benchmark multiple models concurrently (#850)
Browse files Browse the repository at this point in the history
* Add the ability to benchmark multiple models concurrently.
This is useful for benchmarking multiple LoRA adapters.
- Also fix the latency_throughput_curve.sh to parse non-integer request
  rate properly.
- Also added "errors" to the benchmark results.

* Re-sample requests for each model
  • Loading branch information
liu-cong authored Oct 23, 2024
1 parent 8d48829 commit a3401f2
Show file tree
Hide file tree
Showing 7 changed files with 229 additions and 115 deletions.
Loading

0 comments on commit a3401f2

Please sign in to comment.