Home / Companies / Unstructured / Blog / Post Details
Content Deep Dive

Transform files in S3 to Pinecone with Unstructured Platform with no code

Blog post from Unstructured

Post Details
Company
Date Published
Author
Nina Lopatina
Word Count
229
Language
English
Hacker News Points
-
Summary

The text provides a concise guide on transforming unstructured data from an S3 bucket into a Pinecone vector database using the Unstructured Platform, highlighting the ease and efficiency of this process with a no-code ETL approach. It details the creation of a source connector for S3 and a destination connector for Pinecone, specifying the embedding dimensions for optimal setup, and emphasizes the use of a VLM transformation strategy for handling complex PDFs with images, code, and formulas. The process involves setting up a workflow that efficiently processes new documents without reprocessing existing ones, demonstrating how 1,290 files were structured in just 10 minutes, illustrating the platform's capability to handle large-scale data transformation swiftly.