Build a smart financial assistant with LlamaParse and Gemini 3.1

Post Details

Company

Google Cloud

Date Published

March 23, 2026

Author

Vishal Dharmadhikari, and Clelia Astra Bertelli

Word Count

1,040

Company Posts That Month

15

Language

English

Hacker News Points

-

Source URL

developers.googleblog.com/build-a-smart-financial-assistant-with-llamaparse-and-gemini-31

Summary

Extracting text from unstructured documents has traditionally been challenging, especially for complex layouts like multi-column PDFs and nested tables, but advancements in large language models (LLMs) are now facilitating reliable document understanding. LlamaParse is a tool that enhances traditional Optical Character Recognition (OCR) by integrating multimodal capabilities and customized parsing instructions, improving text extraction from documents such as PDFs, presentations, and images. Utilizing Gemini 3.1 Pro, LlamaParse offers a robust workflow for parsing brokerage statements, which include dense financial jargon and complex tables, by breaking down the process into stages: ingesting the document, routing and parsing it, extracting text and tables concurrently, and synthesizing a summary using Gemini's capabilities. This approach not only ensures high-quality parsing but also optimizes for both accuracy and cost by using a two-model architecture. The setup involves installing necessary Python packages and configuring API keys, followed by creating a LlamaParse client to parse and extract information from documents. The workflow is designed to be scalable and resilient, running text and table extractions in parallel to minimize latency and enhance performance. This system demonstrates how combining LLM capabilities with dedicated parsing tools can structure complex data, making applications like personal finance assistants more efficient and reliable.

Trends Found in this Post

Trend	Post Mentions	Total Month Mentions	Posts	Companies	MoM
LLM	8	6,078	960	218	+18%