Skip to content

cosmo3769/Quantized-LLMs

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

54 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Quantized-LLMs

Visit the docs link for Literature Review.

Check all the colab notebooks here.

Check the document on LLM Quantization and Benchmarking here. It includes all my implementation details, challenges and adaptations.

Check the document on Deploying LLM to Mobile here. It includes all my implementation details, challenges and adaptations.

Published studio at Lightning AI

To-Dos

  • Benchmark GGUF format quantized model using lm-evaluation-harness and llama-cpp-python
  • HumanEval benchmark (non-quantized, quantized (GPTQ, GGUF))
  • Research SmoothQuant
  • Pruning
  • Distillation

About

Quantization of LLMs and benchmarking.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published