Use Aryn DocParse to extract an image from a battery manual
partitioned_file
, you’ll notice that it’s a large JSON object with
details about all the parsed elements in the PDF (checkout this page to understand
the schema of the returned JSON object in detail). Below, we show the first few elements of
partitioned_file
:
Image
element that contains the information about the first
schematic image in the file. You see key properties of the image, including its bounding box
(which indicates the coordinates of the image in the page) and
a base64 encoded binary representation of the image.
associate_captions
parameter within image_extraction_options
set to True
.
associate_captions
parameter, you’ll notice that the image is now associated with a caption, as seen below.
caption
field of the element.