Input Formats
Aryn DocParse supports the following input formats:.pdf.docx.doc.pptx.ppt.csv.jpg(.jpeg).png.bmp.tiff.html.odt.rtf.txt.xls.xlsx.xml.svg.webp.wmf.emf.mml.ods.xhtml.odp.odg.odf.ots.xltx.fods.xlt.slk
Output Formats
Aryn DocParse supports the following output formats:.json.md.html
OCR Languages
Aryn DocParse supports the following OCR languages. The default isenglish:
- Abaza:
abaza - Adyghe:
adyghe - Afrikaans:
afrikaans - Albanian:
albanian - Angika:
angika - Arabic:
arabic - Avar:
avar - Azerbaijani:
azerbaijani - Belarusian:
belarusian - Bhojpuri:
bhojpuri - Bihari:
bihari - Bosnian:
bosnian - Bulgarian:
bulgarian - Chinese:
chinese - Chinese (Traditional):
chinese_traditional - Croatian:
croatian - Czech:
czech - Danish:
danish - Dargwa:
dargwa - Dutch:
dutch - English:
english - Estonian:
estonian - French:
french - German:
german - Hindi:
hindi - Hungarian:
hungarian - Icelandic:
icelandic - Indonesian:
indonesian - Ingush:
ingush - Irish:
irish - Italian:
italian - Japanese:
japanese - Kabardian:
kabardian - Konkani:
konkani - Korean:
korean - Kurdish:
kurdish - Lak:
lak - Latvian:
latvian - Lezghian:
lezghian - Lithuanian:
lithuanian - Magahi:
magahi - Maithili:
maithili - Malay:
malay - Maltese:
maltese - Maori:
maori - Marathi:
marathi - Mongolian:
mongolian - Nagpuri:
nagpuri - Nepali:
nepali - Newari:
newari - Norwegian:
norwegian - Occitan:
occitan - Persian:
persian - Polish:
polish - Portuguese:
portuguese - Romanian:
romanian - Russian:
russian - Serbian (Cyrillic):
serbian_cyrillic - Serbian (Latin):
serbian_latin - Slovak:
slovak - Slovenian:
slovenian - Spanish:
spanish - Swahili:
swahili - Swedish:
swedish - Tabassaran:
tabassaran - Tagalog:
tagalog - Tamil:
tamil - Telugu:
telugu - Turkish:
turkish - Ukrainian:
ukrainian - Urdu:
urdu - Uyghur:
uyghur - Uzbek:
uzbek - Vietnamese:
vietnamese - Welsh:
welsh
