Skip to content

Commit

Permalink
Add the ability to benchmark multiple models concurrently.
Browse files Browse the repository at this point in the history
This is useful for benchmarking multiple LoRA adapters.
- Also fix the latency_throughput_curve.sh to parse non-integer request
  rate properly.
- Also added "errors" to the benchmark results.
  • Loading branch information
liu-cong committed Oct 15, 2024
1 parent b0588cc commit e5ac29f
Show file tree
Hide file tree
Showing 2 changed files with 165 additions and 103 deletions.
Loading

0 comments on commit e5ac29f

Please sign in to comment.