数据集:
jordyvl/rvl_cdip_easyocr
许可:
other预印本库:
arxiv:1502.07058源数据集:
extended|iit_cdip批注创建人:
found语言创建人:
found大小:
100K<n<1M计算机处理:
monolingual语言:
en任务:
图像分类The data loader provides support for loading easyOCR files together with the images It is not included under '../data', yet is available upon request via email firstname@contract.fit .
The RVL-CDIP (Ryerson Vision Lab Complex Document Information Processing) dataset consists of 400,000 grayscale images in 16 classes, with 25,000 images per class. There are 320,000 training images, 40,000 validation images, and 40,000 test images. The images are sized so their largest dimension does not exceed 1000 pixels.
All the classes and documents use English as their primary language.
A sample from the training set is provided below :
{ 'image': <PIL.TiffImagePlugin.TiffImageFile image mode=L size=754x1000 at 0x7F9A5E92CA90>, 'label': 15 }
{ "0": "letter", "1": "form", "2": "email", "3": "handwritten", "4": "advertisement", "5": "scientific report", "6": "scientific publication", "7": "specification", "8": "file folder", "9": "news article", "10": "budget", "11": "invoice", "12": "presentation", "13": "questionnaire", "14": "resume", "15": "memo" }
train | test | validation | |
---|---|---|---|
# of examples | 320000 | 40000 | 40000 |
The dataset was split in proportions similar to those of ImageNet.
From the paper:
This work makes available a new labelled subset of the IIT-CDIP collection, containing 400,000 document images across 16 categories, useful for training new CNNs for document analysis.
The same as in the IIT-CDIP collection.
Who are the source language producers?The same as in the IIT-CDIP collection.
The same as in the IIT-CDIP collection.
Who are the annotators?The same as in the IIT-CDIP collection.
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
The dataset was curated by the authors - Adam W. Harley, Alex Ufkes, and Konstantinos G. Derpanis.
RVL-CDIP is a subset of IIT-CDIP, which came from the Legacy Tobacco Document Library , for which license information can be found here .
@inproceedings{harley2015icdar, title = {Evaluation of Deep Convolutional Nets for Document Image Classification and Retrieval}, author = {Adam W Harley and Alex Ufkes and Konstantinos G Derpanis}, booktitle = {International Conference on Document Analysis and Recognition ({ICDAR})}}, year = {2015} }
Thanks to @dnaveenr for adding this dataset.