数据集:
emrecan/nli_tr_for_simcse
This dataset is a modified version of NLI-TR dataset. Its intended use is to train Supervised SimCSE models for sentence-embeddings. Steps followed to produce this dataset are listed below:
See this Colab Notebook for training and evaluation on Turkish sentences.