Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[LLM Inference] Support qwen2 #8893

Merged
merged 9 commits into from
Aug 12, 2024
Merged

Commits on Aug 6, 2024

  1. stage 1

    yuanlehome committed Aug 6, 2024
    Configuration menu
    Copy the full SHA
    e410e4a View commit details
    Browse the repository at this point in the history
  2. update

    yuanlehome committed Aug 6, 2024
    Configuration menu
    Copy the full SHA
    6f9d819 View commit details
    Browse the repository at this point in the history

Commits on Aug 7, 2024

  1. update

    yuanlehome committed Aug 7, 2024
    Configuration menu
    Copy the full SHA
    244802a View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    bc75f59 View commit details
    Browse the repository at this point in the history
  3. support qwen2 bf16/wint8

    yuanlehome committed Aug 7, 2024
    Configuration menu
    Copy the full SHA
    a6bde28 View commit details
    Browse the repository at this point in the history

Commits on Aug 8, 2024

  1. add qwen2 ptq map

    yuanlehome committed Aug 8, 2024
    Configuration menu
    Copy the full SHA
    1b063ac View commit details
    Browse the repository at this point in the history

Commits on Aug 12, 2024

  1. Configuration menu
    Copy the full SHA
    febe651 View commit details
    Browse the repository at this point in the history
  2. update

    yuanlehome committed Aug 12, 2024
    Configuration menu
    Copy the full SHA
    6091ccb View commit details
    Browse the repository at this point in the history
  3. fix tune_cublaslt_gemm.cu

    yuanlehome committed Aug 12, 2024
    Configuration menu
    Copy the full SHA
    cc76b25 View commit details
    Browse the repository at this point in the history