A Bayesian Interpretation of Adaptive Low-Rank Adaptation

This repository contains the source code for the paper "A Bayesian Interpretation of Adaptive Low-Rank Adaptation" by Haolin Chen and Philip N. Garner.

It comprises three components:

run_glue_no_trainer.py: the main Python script which is adapted from the Hugging Face Transformers version 4.40.0.
peft: a customized Python package based on Hugging Face PEFT version 0.11.0. It includes the implementation of importance scores for AdaLoRA.
ivon: a slightly modified implementation of Improved Variational Online Newton (IVON).

Licenses: 1 and 2 are licensed under Apache-2.0, 3 are licensed under GPL-3.0.

Setup

Follow instructions from Transformers to setup the python envrionment.
Install the customized peft and ivon packages.

Fine-tuning

Scripts for fine-tuning are in scripts.

File name	Model	Optimizer	Criterion
full.sh	Full fine-tuning	Adam	N/A
lora_all.sh	LoRA	Adam	$r=2/4$
adalora.sh	AdaLoRA	Adam	Sensitivity
adalora_ivon{_clr}.sh	AdaLoRA	IVON	Sensitivity
vilora{_clr}.sh	AdaLoRA	IVON	$\mathrm{SNR}(\|\theta\|)$
vilora{_clr}_criterion.sh	AdaLoRA	IVON	$\mathrm{SNR}(\|\theta\|), \|\mu\|/\sigma, \|\mu\|, 1/\sigma$

clr stands for customized learning rate schedule, which is used with IVON on COLA, STS-B, MRPC, and RTE.

Evaluation

Evaluation is conducted automatically after fine-tuning.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

A Bayesian Interpretation of Adaptive Low-Rank Adaptation

Setup

Fine-tuning

Evaluation

Files

README.md

Latest commit

History

README.md

File metadata and controls

A Bayesian Interpretation of Adaptive Low-Rank Adaptation

Setup

Fine-tuning

Evaluation