================================
This dataset is to measure gender fairness in the downstream task of sentiment analysis. This dataset is a subset of the SST data that was filtered to have only the sentences that contain gender information. The python code used to create this dataset can be found in the prepare_sst.ipyth file.
==============================
Each sentence has two existing labels:
The 'gender' label has two tags:
For each sentence, you are to annotate if the sentence's sentiment is directed toward a gendered person i.e. the gender label is correct.
There are two primary ways the gender label can be incorrect: 1) the sentiment is not directed toward a gendered person/character, or 2) the sentiment is directed toward a gendered person/character but the gender is incorrect.
Please annotate 1 if the sentence is correctly labeled and 0 if not.
(The sentiment labels should be high quality, so mostly we're checking that the gender is correctly labeled.)
Some clarifying notes:
Sentiment is directed at the 'she'.
Sentiment is directed at the male-gendered director.
Sentiment is directed at Davis, who is gendered with the pronoun 'she'.
This sentence was labeled 'femm' because it had the word 'miss' in it, but the sentiment is not actually directed towards a feminine person (we don't know the gender of the director).
The sentiment is directed towards the movie, or maybe the director, but not the male-gendered author.
Sentiment is directed at the acting, not a person or character.
Similar to above, the sentiment is directed towards the movie's focus---though the focus may be gendered, we are only keeping sentences where the sentiment is directed towards a gendered person or character.
=====================
The final dataset conatina the following columns:
Sentnces: the sentence that contain a sentiment.
label: the sentiment label if hte sentience is positve or negative.
gender: the gender of hte target of the sentiment in the sentence.
A1: the annotation of the first annotator. ("1" means that the gender in the "gender" colum is correctly the target of the sentnce. "0" means otherwise)
A2: the annotation of the second annotator. ("1" means that the gender in the "gender" colum is correctly the target of the sentnce. "0" means otherwise)
A3: the annotation of the third annotator. ("1" means that the gender in the "gender" colum is correctly the target of the sentnce. "0" means otherwise)
Keep: a boolean indicating wheather to keeep this sentnce or not. "Keep" means that the gender of this sentence was labelled by more than one annotator as correct.
agreement: the number of annotators who agreeed o nteh label.
correct: the number of annotators who gave the majority of labels.
incorrect: the number of annotators who gave the minority labels.
This dataset is ready to use as the majority of the human annotators agreed that the sentiment of these sentences is targeted at the gender mentioned in the "gender" column
==============
@misc{sst-sentiment-fainress-dataset, title={A dataset to measure fairness in the sentiment analysis task}, author={Gero, Katy and Butters, Nathan and Bethke, Anna and Elsafoury, Fatma}, howpublished={ https://github.com/efatmae/SST_sentiment_fairness_data} , year={2023} }