数据集:

Thaweewat/databricks-dolly-15k-th

中文

Summary

This is a Thai ??-instructed dataset translated from databricks-dolly-15k using Google Cloud Translation. databricks-dolly-15k is an open-source dataset of instruction-following records generated by thousands of Databricks employees in several behavioral categories outlined in the InstructGPT paper, including brainstorming, classification, closed QA, generation, information extraction, open QA, and summarization.

This dataset can be used for any purpose, whether academic or commercial, under the terms of the Creative Commons Attribution-ShareAlike 3.0 Unported License .

Supported Tasks:

  • Training LLMs
  • Synthetic Data Generation
  • Data Augmentation

Languages: Thai Version: 1.0