Overview
Open-source library and API for converting unstructured documents into LLM-ready data.
Details
Unstructured is a popular open-source Python library and managed API for ingesting, parsing, and chunking unstructured documents (PDFs, Word, HTML, emails, images) into structured data ready for LLMs. Widely used in RAG pipelines.