MedINST: Meta Dataset of Biomedical Instructions

Meta Dataset of Biomedical Instructions (MedINST) is a novel multi-domain, multi-task instructional meta-dataset. MedINST comprises 133 biomedical NLP tasks and over 7 million training samples, making it the most comprehensive biomedical instruction dataset to date. Using MedINST as the meta dataset, we curate MedINST32, a challenging benchmark with different task difficulties aiming to evaluate LLMs' generalization ability.

Data Access

Complete dataset MedINST can be accessed at [LiinXemmon/MedINST].

MedINST32 Benchmark Dataset can be checked at [LiinXemmon/MedINST32].

Models

LLaMA3-MI: Further fine-tune LLaMA-3-8B-Instruct on 100K samples from MedINST.

MMedL3-MI: Further fine-tune MMed-Llama-3-8B on 100K samples from MedINST. [Google Drive]

Evaluation

We access and evaluate LLMs using OpenAI compatible APIs.

Evaluate ChatGPT

python evaluation.py --name <SAVE_NAME> --dir <SAVE_DIR> --model 'gpt-3.5-turbo' --key <YOUR_KEY>

Add --zero option to evaluate the model in the zero-shot setting.

Other Models

You may use vLLM to deploy a model at local.

Specify the URL of the deployed API by the option --base_url. For example

python evaluation.py --name <SAVE_NAME> --dir <SAVE_DIR> --model <YOUR_MODEL> --key <YOUR_KEY> --base_url "http://localhost:8000/v1"

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
evaluation.py		evaluation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MedINST: Meta Dataset of Biomedical Instructions

Data Access

Models

Evaluation

Evaluate ChatGPT

Other Models

About

Releases

Packages

Contributors 2

Languages

aialt/MedINST

Folders and files

Latest commit

History

Repository files navigation

MedINST: Meta Dataset of Biomedical Instructions

Data Access

Models

Evaluation

Evaluate ChatGPT

Other Models

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages