A dataset from kaggle with duplicate data removed.
The data instances have the following fields:
{ "cat": 0, "dog": 1, }
train | test | |
---|---|---|
# of examples | 8000 | 2000 |
>>> from datasets import load_dataset >>> dataset = load_dataset("Bingsu/Cat_and_Dog") >>> dataset DatasetDict({ train: Dataset({ features: ['image', 'labels'], num_rows: 8000 }) test: Dataset({ features: ['image', 'labels'], num_rows: 2000 }) }) >>> dataset["train"].features {'image': Image(decode=True, id=None), 'labels': ClassLabel(num_classes=2, names=['cat', 'dog'], id=None)}