Skip to content

Commit

Permalink
Add papers
Browse files Browse the repository at this point in the history
  • Loading branch information
emphasis10 committed Sep 6, 2024
1 parent a92232c commit 3c381d0
Show file tree
Hide file tree
Showing 2 changed files with 6 additions and 0 deletions.
1 change: 1 addition & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -1731,6 +1731,7 @@
#### [Manifold Diffusion Fields](summaries/2305.15586.md)
#### [QLoRA: Efficient Finetuning of Quantized LLMs](summaries/2305.14314.md)
#### [Memory-Efficient Fine-Tuning of Compressed Large Language Models via sub-4-bit Integer Quantization](summaries/2305.14152.md)
#### [GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints](summaries/2305.13245.md)
#### [RWKV: Reinventing RNNs for the Transformer Era](summaries/2305.13048.md)
#### [Accurate Knowledge Distillation with n-best Reranking](summaries/2305.12057.md)
#### [LLM-Pruner: On the Structural Pruning of Large Language Models](summaries/2305.11627.md)
Expand Down
5 changes: 5 additions & 0 deletions summaries/2305.13245.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
# GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints
## TL;DR
## Summary
- [https://arxiv.org/pdf/2305.13245.pdf](https://arxiv.org/pdf/2305.13245.pdf)

0 comments on commit 3c381d0

Please sign in to comment.