Home / Companies / LllamaIndex / Blog / Post Details
Content Deep Dive

Announcing LlamaSplit Public Beta: Divide Long Document into Clear, Targeted Sections

Blog post from LllamaIndex

Post Details
Company
Date Published
Author
Tuana Çelik
Word Count
679
Language
English
Hacker News Points
-
Summary

LlamaSplit is a new addition to the LlamaCloud product line, offering a beta API designed to automate the separation of bundled documents into distinct sections based on user-defined categories. This tool addresses the challenge of dealing with documents that combine multiple distinct files, such as resumes, financial documents, research papers, and court filings, by using AI to identify and categorize individual segments within a single document. Users can upload documents and define categories with natural language descriptions to receive segmented results with page ranges and confidence scores, which can be accessed through the API. LlamaSplit is particularly useful in fields like HR, financial services, legal, healthcare, and real estate, where it facilitates the organization and processing of documents by type. Unlike LlamaCloud Classify, which categorizes entire documents, LlamaSplit focuses on identifying boundaries within a single document, allowing for targeted extraction and workflow routing of the segmented content.