A dataset from kaggle with duplicate data removed.
The data instances have the following fields:
{
"cat": 0,
"dog": 1,
}
| train | test | |
|---|---|---|
| # of examples | 8000 | 2000 |
>>> from datasets import load_dataset
>>> dataset = load_dataset("Bingsu/Cat_and_Dog")
>>> dataset
DatasetDict({
train: Dataset({
features: ['image', 'labels'],
num_rows: 8000
})
test: Dataset({
features: ['image', 'labels'],
num_rows: 2000
})
})
>>> dataset["train"].features
{'image': Image(decode=True, id=None), 'labels': ClassLabel(num_classes=2, names=['cat', 'dog'], id=None)}