geronimi73

Follow

geronimi73

Follow

61 followers · 79 following

Achievements

Achievements

geronimi73/README.md

👋 Hi, I’m geronimo

trying to understand LLMs. This is my journey so far:

🚀 Tutorials and Repositories

A failed experiment with LISA: "Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning", code, paper
🛠️ Memory-efficient LLM Training with GaLore, yet another PEFT approach, code
⚖️ Evaluating LLMs with Semantic Similarity, code
🛠️ Finetune TinyLlama and StableLM 2, code
🛠️ Finetune Microsoft's Phi-2, code
🛠️ Finetune Mamba, code
🛠️ Finetune Llama2 and Mistral using QLoRA, code
⚖️ Evaluate LLM language capabilities with meta's Belebele benchmark, code
⚖️ Evaluate LLM language capabilities with BLEU, code
⚖️ Llama2-70B as a judge of LLMs performs almost as good as GPT-4, code
⚖️ Validation loss is not a good metric for chatbot quality
⚖️ Use GPT3.5 as a judge of open-source LLMs, code
🛠️ Finetune Llama on podcast transripts with QLoRA, code
💅 Use Stable Diffusion for sketch-guided image generation, code

💎 Other Repositories

Twitter Hugging Face

Popular repositories Loading

phi2-finetune phi2-finetune Public

Jupyter Notebook 87 12
qlora-minimal qlora-minimal Public

Jupyter Notebook 80 13
3090_shorts 3090_shorts Public

minimal LLM scripts for 24GB VRAM GPUs. training, inference, whatever

Jupyter Notebook 32 2
mamba mamba Public

Jupyter Notebook 31 2
accelerate_tricks accelerate_tricks Public

Jupyter Notebook 9 1
semscore semscore Public

Jupyter Notebook 9 2