Announcing LlamaSplit Public Beta: Divide Long Document into Clear, Targeted Sections
Blog post from LllamaIndex
LlamaSplit is a new addition to the LlamaCloud product line, offering a beta API designed to automate the separation of bundled documents into distinct sections based on user-defined categories. This tool addresses the challenge of dealing with documents that combine multiple distinct files, such as resumes, financial documents, research papers, and court filings, by using AI to identify and categorize individual segments within a single document. Users can upload documents and define categories with natural language descriptions to receive segmented results with page ranges and confidence scores, which can be accessed through the API. LlamaSplit is particularly useful in fields like HR, financial services, legal, healthcare, and real estate, where it facilitates the organization and processing of documents by type. Unlike LlamaCloud Classify, which categorizes entire documents, LlamaSplit focuses on identifying boundaries within a single document, allowing for targeted extraction and workflow routing of the segmented content.