Enabled Qwen2-MoE Tensor Parallelism (TP) inference #6551

…() for uniform code management. Both have the same function and the same result.

Commits on Oct 8, 2024

Merge branch 'master' into qwen2-moe

loadams authored Oct 8, 2024

Configuration menu

View commit details

Copy full SHA for 932d4b2

Browse repository at this point

Copy the full SHA

932d4b2 View commit details

Browse the repository at this point in the history