Home / Companies / Qdrant / Blog / Post Details
Content Deep Dive

Loading Unstructured.io Data into Qdrant from the Terminal

Blog post from Qdrant

Post Details
Company
Date Published
Author
Anush Shetty
Word Count
464
Language
English
Hacker News Points
-
Summary

Loading data into Qdrant, a vector search engine, from Unstructured.io involves a series of steps starting with data extraction, cleaning, chunking, and generating embeddings before finally loading it into Qdrant. The blog post details the process of ingesting data from Discord channels into Qdrant using the Unstructured CLI, which supports over 20 vetted data sources. It outlines the prerequisites needed, such as a running Qdrant instance, a Discord bot token, and the Unstructured CLI with specific extras. The process involves generating structured data using a Discord bot, setting up Qdrant collections with specific vector dimensions, and loading the data with configurable options for embedding fields, partitioning, and chunking. The post also points to additional resources for setting up and configuring the Qdrant ingestion process, emphasizing Unstructured's capability to function programmatically or via a hosted API.