Here are the input and output formats, and OCR languages supported by Aryn DocParse
.pdf
.docx
.doc
.pptx
.ppt
.csv
.jpg
(.jpeg
).png
.bmp
.tiff
.html
.odt
.rtf
.txt
.xls
.xlsx
.xml
.svg
.webp
.wmf
.emf
.mml
.ods
.xhtml
.odp
.odg
.odf
.ots
.xltx
.fods
.xlt
.slk
.json
.md
english
:
abaza
adyghe
afrikaans
albanian
angika
arabic
avar
azerbaijani
belarusian
bhojpuri
bihari
bosnian
bulgarian
chinese
chinese_traditional
croatian
czech
danish
dargwa
dutch
english
estonian
french
german
hindi
hungarian
icelandic
indonesian
ingush
irish
italian
japanese
kabardian
konkani
korean
kurdish
lak
latvian
lezghian
lithuanian
magahi
maithili
malay
maltese
maori
marathi
mongolian
nagpuri
nepali
newari
norwegian
occitan
persian
polish
portuguese
romanian
russian
serbian_cyrillic
serbian_latin
slovak
slovenian
spanish
swahili
swedish
tabassaran
tagalog
tamil
telugu
turkish
ukrainian
urdu
uyghur
uzbek
vietnamese
welsh