You cannot select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
PP-Structure Model list
1. Layout Analysis
model name |
description |
inference model size |
download |
dict path |
picodet_lcnet_x1_0_fgd_layout |
The layout analysis English model trained on the PubLayNet dataset based on PicoDet LCNet_x1_0 and FGD . the model can recognition 5 types of areas such as Text, Title, Table, Picture and List |
9.7M |
inference model / trained model |
PubLayNet dict |
ppyolov2_r50vd_dcn_365e_publaynet |
The layout analysis English model trained on the PubLayNet dataset based on PP-YOLOv2 |
221.0M |
inference_moel / trained model |
same as above |
picodet_lcnet_x1_0_fgd_layout_cdla |
The layout analysis Chinese model trained on the CDLA dataset, the model can recognition 10 types of areas such as Table、Figure、Figure caption、Table、Table caption、Header、Footer、Reference、Equation |
9.7M |
inference model / trained model |
CDLA dict |
picodet_lcnet_x1_0_fgd_layout_table |
The layout analysis model trained on the table dataset, the model can detect tables in Chinese and English documents |
9.7M |
inference model / trained model |
Table dict |
ppyolov2_r50vd_dcn_365e_tableBank_word |
The layout analysis model trained on the TableBank Word dataset based on PP-YOLOv2, the model can detect tables in English documents |
221.0M |
inference model |
same as above |
ppyolov2_r50vd_dcn_365e_tableBank_latex |
The layout analysis model trained on the TableBank Latex dataset based on PP-YOLOv2, the model can detect tables in English documents |
221.0M |
inference model |
same as above |
2. OCR and Table Recognition
2.1 OCR
model name |
description |
inference model size |
download |
en_ppocr_mobile_v2.0_table_det |
Text detection model of English table scenes trained on PubTabNet dataset |
4.7M |
inference model / trained model |
en_ppocr_mobile_v2.0_table_rec |
Text recognition model of English table scenes trained on PubTabNet dataset |
6.9M |
inference model / trained model |
If you need to use other OCR models, you can download the model in PP-OCR model_list or use the model you trained yourself to configure to det_model_dir
, rec_model_dir
field.
2.2 Table Recognition
model |
description |
inference model size |
download |
en_ppocr_mobile_v2.0_table_structure |
English table recognition model trained on PubTabNet dataset based on TableRec-RARE |
6.8M |
inference model / trained model |
en_ppstructure_mobile_v2.0_SLANet |
English table recognition model trained on PubTabNet dataset based on SLANet |
9.2M |
inference model / trained model |
ch_ppstructure_mobile_v2.0_SLANet |
Chinese table recognition model based on SLANet |
9.3M |
inference model / trained model |
3. KIE
On XFUND_zh dataset, Accuracy and time cost of different models on V100 GPU are as follows.
- Note: The above time cost information just considers inference time without preprocess or postprocess, test environment:
V100 GPU + CUDA 10.2 + CUDNN 8.1.1 + TRT 7.2.3.4
On wildreceipt dataset, the algorithm result is as follows: