ChatGLM3 多轮对话训练数据

2020-09-13

资源描述

本仓库提供了一个用于训练 ChatGLM3 模型的多轮对话数据集。该数据集包含了原始数据、数据处理代码以及训练所需的 train.json、dev.json 和 test.json 文件。

训练数据文件 train.json 存放在 finetune_demo/data/JDMulConversations/train.json 路径下。

在使用该数据集进行训练前，需要修改 Lora 配置文件，具体配置如下：

data_config:
  train_file: train.json
  val_file: dev.json
  test_file: test.json
  num_proc: 16

使用以下命令进行模型训练：

CUDA_VISIBLE_DEVICES=1 python finetune_hf.py data/JDMulConversations/ /root/autodl-tmp/model/chatglm3-6b configs/lora.yaml

欢迎提交 Issue 或 Pull Request 来改进本数据集和相关代码。