Unstructured.io icon

Unstructured.io

Open-source library and API for converting unstructured documents into LLM-ready data.

Visit Website Intelligent Document Processing

Overview

Open-source library and API for converting unstructured documents into LLM-ready data.

Details

Unstructured is a popular open-source Python library and managed API for ingesting, parsing, and chunking unstructured documents (PDFs, Word, HTML, emails, images) into structured data ready for LLMs. Widely used in RAG pipelines.

Tags