Partition Document

curl --request POST \
  --url https://api.aryn.cloud/v1/document/partition \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: multipart/form-data' \
  --form 'file_url=<string>' \
  --form 'options={
  "selected_pages": [
    123
  ],
  "extract_images": false,
  "image_extraction_options": {
    "associate_captions": false,
    "extract_image_format": "ppm"
  },
  "table_extraction_options": {
    "include_additional_text": true,
    "model_selection": "pixels > 500 -> deformable_detr; table_transformer"
  },
  "summarize_images": false,
  "text_mode": "inline_fallback_to_ocr",
  "table_mode": "standard",
  "text_extraction_options": {
    "ocr_text_mode": "vision",
    "remove_line_breaks": true
  },
  "ocr_language": "english",
  "threshold": "auto",
  "chunking_options": {
    "strategy": "context_rich",
    "tokenizer": "openai_tokenizer",
    "tokenizer_options": {
      "model_name": "text-embedding-3-small"
    },
    "max_tokens": 123,
    "merge_across_pages": true
  },
  "output_format": "json",
  "output_label_options": {
    "title_candidate_elements": [
      "<string>"
    ],
    "promote_title": false,
    "orientation_correction": false
  },
  "markdown_options": {
    "include_pagenum": false,
    "include_headers": false,
    "include_footers": false
  },
  "extract_table_structure": true,
  "use_ocr": true,
  "extract_image_format": "ppm"
}' \
  --form file=@example-file

{
  "status": [
    "<string>"
  ],
  "status_code": 123,
  "error": "<string>",
  "elements": [
    {
      "type": "<string>",
      "bbox": [
        123
      ],
      "properties": {},
      "text_representation": "<string>"
    }
  ],
  "markdown": "<string>"
}

POST

document

partition

Partition Document

curl --request POST \
  --url https://api.aryn.cloud/v1/document/partition \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: multipart/form-data' \
  --form 'file_url=<string>' \
  --form 'options={
  "selected_pages": [
    123
  ],
  "extract_images": false,
  "image_extraction_options": {
    "associate_captions": false,
    "extract_image_format": "ppm"
  },
  "table_extraction_options": {
    "include_additional_text": true,
    "model_selection": "pixels > 500 -> deformable_detr; table_transformer"
  },
  "summarize_images": false,
  "text_mode": "inline_fallback_to_ocr",
  "table_mode": "standard",
  "text_extraction_options": {
    "ocr_text_mode": "vision",
    "remove_line_breaks": true
  },
  "ocr_language": "english",
  "threshold": "auto",
  "chunking_options": {
    "strategy": "context_rich",
    "tokenizer": "openai_tokenizer",
    "tokenizer_options": {
      "model_name": "text-embedding-3-small"
    },
    "max_tokens": 123,
    "merge_across_pages": true
  },
  "output_format": "json",
  "output_label_options": {
    "title_candidate_elements": [
      "<string>"
    ],
    "promote_title": false,
    "orientation_correction": false
  },
  "markdown_options": {
    "include_pagenum": false,
    "include_headers": false,
    "include_footers": false
  },
  "extract_table_structure": true,
  "use_ocr": true,
  "extract_image_format": "ppm"
}' \
  --form file=@example-file

{
  "status": [
    "<string>"
  ],
  "status_code": 123,
  "error": "<string>",
  "elements": [
    {
      "type": "<string>",
      "bbox": [
        123
      ],
      "properties": {},
      "text_representation": "<string>"
    }
  ],
  "markdown": "<string>"
}

This is the Aryn DocParse API for partitioning (and optionally chunking) a document synchronously.

Authorizations

Authorization

string

header

required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Headers

User-Agent

string | null

Body

multipart/form-data

file

file_url

string<url>

options

object

Show child attributes

Response

Successful Response

status

string[]

required

status_code

integer

required

elements

Element · object[] | null

required

Show child attributes

markdown

string | null

required

error

string | null

The error message if the partitioning is not successful.

⌘I

API Documentation

DocParse

Aryn Platform

Partition Document

Authorizations

Headers

Body

Response