d4data/bias-detection-model | ATYUN.COM 官网-人工智能教程资讯全方位服务平台

模型:

d4data/bias-detection-model

任务:

文本分类

类库:

TensorFlow Transformers

语言:

其他:

distilbert Text Classification Carbon Emissions Text+Classification

模型介绍文件清单

中文

About the Model

An English sequence classification model, trained on MBAD Dataset to detect bias and fairness in sentences (news articles). This model was built on top of distilbert-base-uncased model and trained for 30 epochs with a batch size of 16, a learning rate of 5e-5, and a maximum sequence length of 512.

Dataset : MBAD Data
Carbon emission 0.319355 Kg

Train Accuracy	Validation Accuracy	Train loss	Test loss
76.97	62.00	0.45	0.96

Usage

The easiest way is to load the inference api from huggingface and second method is through the pipeline object offered by transformers library.

from transformers import AutoTokenizer, TFAutoModelForSequenceClassification
from transformers import pipeline
tokenizer = AutoTokenizer.from_pretrained("d4data/bias-detection-model")
model = TFAutoModelForSequenceClassification.from_pretrained("d4data/bias-detection-model")

classifier = pipeline('text-classification', model=model, tokenizer=tokenizer) # cuda = 0,1 based on gpu availability
classifier("The irony, of course, is that the exhibit that invites people to throw trash at vacuuming Ivanka Trump lookalike reflects every stereotype feminists claim to stand against, oversexualizing Ivanka’s body and ignoring her hard work.")

Author

This model is part of the Research topic "Bias and Fairness in AI" conducted by Deepak John Reji, Shaina Raza. If you use this work (code, model or dataset), please star at:

Bias & Fairness in AI, (2022), GitHub repository, https://github.com/dreji18/Fairness-in-AI

作者:

D 4 Data Community

数据集大小:

255.76 MB