Build an AI Application for Document Classification: A Step-by-Step Guide
Blog post from Orkes
Part 2 of the AI App Development series focuses on creating an application using Orkes Conductor to automate document classification with Large Language Models (LLMs). The tutorial addresses the challenge organizations face with the manual sorting of large volumes of documents by demonstrating how to build a workflow that classifies PDF documents into predefined categories such as W2 forms, driving licenses, and pay stubs. The workflow employs a series of tasks, including checking for PDF files, extracting text using Optical Character Recognition (OCR) for non-text-based PDFs, and using LLMs for document classification. The tutorial also guides users in setting up an OCR worker using Node.js and integrating their preferred LLM provider within the Orkes Conductor platform. It emphasizes the importance of crafting a tailored AI prompt to guide the LLM in accurately classifying documents and concludes by encouraging users to test and refine their workflows, with a promise of further tutorials to expand functionality for additional file types.