You cannot select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
|
8 months ago | |
---|---|---|
.. | ||
kie_dict | 8 months ago | |
layout_dict | 8 months ago | |
README.md | 8 months ago | |
ar_dict.txt | 8 months ago | |
arabic_dict.txt | 8 months ago | |
be_dict.txt | 8 months ago | |
bengali_dict.txt | 8 months ago | |
bg_dict.txt | 8 months ago | |
bm_dict.txt | 8 months ago | |
bm_dict_add.txt | 8 months ago | |
bn_dict.txt | 8 months ago | |
chinese_cht_dict.txt | 8 months ago | |
confuse.pkl | 8 months ago | |
cyrillic_dict.txt | 8 months ago | |
devanagari_dict.txt | 8 months ago | |
en_dict.txt | 8 months ago | |
fa_dict.txt | 8 months ago | |
french_dict.txt | 8 months ago | |
german_dict.txt | 8 months ago | |
gujarati_dict.txt | 8 months ago | |
hi_dict.txt | 8 months ago | |
it_dict.txt | 8 months ago | |
japan_dict.txt | 8 months ago | |
ka_dict.txt | 8 months ago | |
kazakh_dict.txt | 8 months ago | |
korean_dict.txt | 8 months ago | |
latex_ocr_tokenizer.json | 8 months ago | |
latex_symbol_dict.txt | 8 months ago | |
latin_dict.txt | 8 months ago | |
mr_dict.txt | 8 months ago | |
ne_dict.txt | 8 months ago | |
oc_dict.txt | 8 months ago | |
parseq_dict.txt | 8 months ago | |
pu_dict.txt | 8 months ago | |
rs_dict.txt | 8 months ago | |
rsc_dict.txt | 8 months ago | |
ru_dict.txt | 8 months ago | |
spin_dict.txt | 8 months ago | |
ta_dict.txt | 8 months ago | |
table_dict.txt | 8 months ago | |
table_master_structure_dict.txt | 8 months ago | |
table_structure_dict.txt | 8 months ago | |
table_structure_dict_ch.txt | 8 months ago | |
te_dict.txt | 8 months ago | |
ug_dict.txt | 8 months ago | |
uk_dict.txt | 8 months ago | |
ur_dict.txt | 8 months ago | |
vi_dict.txt | 8 months ago | |
xi_dict.txt | 8 months ago |
README.md
Dictionary and Corpus
Dictionary files (usually character level vocabulary) are included here for easier configuration. Corpus contributed by OSS contirbutors are listed here, please respect copyrights when using them at your own risk.
- Burmese corpus: https://github.com/1chimaruGin/BurmeseCorpus