> ## Documentation Index
> Fetch the complete documentation index at: https://docs.aryn.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# Introduction

> Welcome to Aryn DocParse!

Aryn DocParse is a compound AI system for parsing, chunking, enriching, and storing unstructured documents at scale. It uses a set of purpose-built AI models for document segmentation, optical character recognition (OCR), and extracting tables, images, metadata, properties, and more.

**Key Features**

* Return the structured output of each document in JSON, Markdown or HTML, and provide labeled bounding boxes for titles, tables, table rows and columns, images, and regular text.

* High quality AI models for complex table extraction, optical character recognition (OCR), image summarization, and more.

* Process over 30 types of document formats, including PDFs, Microsoft Word, Microsoft PowerPoint, text, and more.

* Store and index processed documents, extract metadata using GenAI, search your documents at scale with vector (semantic) or keyword search.

* Optional integration with Python document ETL pipelines using the open source [Sycamore document ETL library](https://github.com/aryn-ai/sycamore). Customize your pipeline with additional data transforms, LLM-based entity extraction, data enrichment, data cleaning, and loading vector databases and search engines.

You can use DocParse to prepare complex, unstructured data for retrieval-augmented generation (RAG) applications, document processing workflows, extracting content from documents (like tables), and semantic search systems.

[Sign-up here for free](https://console.aryn.ai/signup/) to use DocParse. You can use the DocParse UI to visualize your parsing and extraction, or get an API key and use the Aryn SDK.

You can learn more from our [introduction video](https://www.aryn.ai/?name=ArynPartitioningService_Intro)) or get started with a [Quickstart](docparse/quickstart).

If you are interested in the Aryn Platform -- an agentic unstructured data warehouse -- visit the [Aryn Platform documentation](./introduction). Aryn uses DocParse under the hood to parse and process documents when ingesting them.

## Getting started

[Sign-up here for free](https://console.aryn.ai/signup/)) to get started with DocParse.

<CardGroup cols={2}>
  <Card title="Quickstart" icon="play" href="/docparse/quickstart">
    Get Started with Aryn DocParse
  </Card>

  <Card title="Use the Aryn-SDK" icon="rectangle-terminal" href="docparse/aryn_sdk">
    Using the Aryn-SDK to call DocParse
  </Card>

  <Card title="Use DocParse UI" icon="palette" href="http://app.aryn.ai/docparse/">
    Access the DocParse UI to visualize how your documents will be partitioned
  </Card>

  <Card title="Slack Community" icon="slack" href="https://join.slack.com/t/aryn-community/shared_invite/zt-36vhennsx-mN3UsqD6PT2vxVZxpqdHsw">
    Join the Slack community for any questions
  </Card>

  <Card title="API Reference" icon="webhook" href="api-reference/introduction">
    Aryn DocParse API Reference
  </Card>

  <Card title="Aryn DocParse SDK Reference" icon="python" href="sdk-reference/partition">
    Aryn DocParse Python SDK Reference
  </Card>
</CardGroup>
