🎉 Dialogue Safety in Mental Health Support

This repository contains the code and data for the paper titled A Benchmark for Understanding Dialogue Safety in Mental Health Support, which is accepted to The 12th CCF International Conference on Natural Language Processing and Chinese Computing (NLPCC2023).

📗 Taxonomy and Framework

Figure 1. Our proposed sequential and inclusive taxonomy aims to ensure safe and supportive dialogue for mental health support. In a given conversational context, annotators are required to sequentially label the model responses based on the node order. It is crucial to note that content that is not understandable will be considered unsafe. Therefore, addressing this issue takes precedence in our sequential framework, which occurs in Node 1. The four rightmost categories in Node 4 cover the unsafe classes in the existing chitchat scenarios.

Table 1. Different types of model responses within a dialogue context in our taxonomy.

🔥 Results

Table 2. Evaluation results for fine-grained classification on the test set. The results present the mean and standard deviation (subscript) of accuracy (Acc.), precision (P), recall (R), and F1 score (F1). In the few-shot setting, the inference prompt includes 8 diverse examples from the training set. † indicates that the model used is GPT-3.5-TURBO-0301, while ‡ indicates that the model used is GPT-3.5-TURBO-0613.

🌟 data

Due to privacy concerns regarding psychological counseling data, we currently only release the test set. The data used in this paper is included in the data directory. The data has been appropriately anonymized.

Access to our training set is restricted to researchers who agree to comply with ethical guidelines and sign a confidentiality agreement with us. If you need access to our dataset, please sign the data use agreement in DataUseAgreement and contact Huachuan (qiuhuachuan@westlake.edu.cn).

Below is the label description.

{
    0: "Nonsense",
    1: "Humanoid Mimicry",
    2: "Linguistic Neglect",
    3: "Unamiable Judgment",
    4: "Toxic Language",
    5: "Unauthorized Preachment",
    6: "Nonfactual Statement",
    7: "Safe Response"
}

🔍 ChatGPT Eval

ChatGPT_eval.py is used to generate model responses in a zero- and few-shot paradigm. For analysis, please execute the script analyze_ChatGPT_results.py.

🛠 Dependencies

The code is implemented using Python 3.10 and PyTorch v2.0. We recommend using Anaconda or Miniconda to set up this codebase. Please install dependencies through requirements.txt.

pip install -r requirements.txt

🎯 Training

The code includes two models: bert-base-chinese and hfl/chinese-roberta-wwm-ext-large. Users can choose either one based on their requirements or opt for other models available on Hugging Face.

bash finetune.sh

🎨 Evaluation

The code includes two models: bert-base-chinese and hfl/chinese-roberta-wwm-ext-large.

bash eval.sh

📚 BibTeX entry and citation info

If you find the data or paper useful, kindly cite it in your own work.

@inproceedings{qiu2023benchmark,
  title={A Benchmark for Understanding Dialogue Safety in Mental Health Support},
  author={Qiu, Huachuan and Zhao, Tong and Li, Anqi and Zhang, Shuai and He, Hongliang and Lan, Zhenzhong},
  booktitle={CCF International Conference on Natural Language Processing and Chinese Computing},
  pages={1--13},
  year={2023},
  organization={Springer}
}

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
ChatGPT_gen		ChatGPT_gen
DataUseAgreement		DataUseAgreement
data		data
finetuned_model_predict		finetuned_model_predict
image		image
log		log
utils		utils
ChatGPT_eval.py		ChatGPT_eval.py
LICENSE		LICENSE
README.md		README.md
analyze_ChatGPT_results.py		analyze_ChatGPT_results.py
analyze_finetuned_results.py		analyze_finetuned_results.py
eval.py		eval.py
eval.sh		eval.sh
eval_one_by_one.py		eval_one_by_one.py
eval_one_by_one.sh		eval_one_by_one.sh
finetune.py		finetune.py
finetune.sh		finetune.sh
manual_inspection.py		manual_inspection.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎉 Dialogue Safety in Mental Health Support

📗 Taxonomy and Framework

🔥 Results

🌟 data

🔍 ChatGPT Eval

🛠 Dependencies

🎯 Training

🎨 Evaluation

📚 BibTeX entry and citation info

About

Releases

Packages

Languages

License

qiuhuachuan/DialogueSafety

Folders and files

Latest commit

History

Repository files navigation

🎉 Dialogue Safety in Mental Health Support

📗 Taxonomy and Framework

🔥 Results

🌟 data

🔍 ChatGPT Eval

🛠 Dependencies

🎯 Training

🎨 Evaluation

📚 BibTeX entry and citation info

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages