html
in the output_format option. This is useful when converting documents for RAG and other LLM-based applications.image_extraction_options: {'associate_captions': true}
and you get images expanded to encompass the captions. For more information, take a look at the Image Extraction tutorial.table_mode
= vision
) and VLM-based OCR (text_mode
= vision_ocr
), allowing you to process twice the number of pages in the same time budget. Try it out on your most complex docs.