数据集:
TigerResearch/tigerbot-OIG-multichat-en-50k
语言:
en许可:
apache-2.0Tigerbot 基于开源OIG数据集加工生成的多轮对话sft数据集
原始来源: https://huggingface.co/datasets/laion/OIG
import datasets ds_sft = datasets.load_dataset('TigerResearch/tigerbot-OIG-multichat-en-50k')