Home / Companies / GitLab / Blog / Post Details
Content Deep Dive

GitLab's data migration process for Advanced Search

Blog post from GitLab

Post Details
Company
Date Published
Author
Dmitry Gruzd
Word Count
1,410
Company Posts That Month
15
Language
English
Hacker News Points
-
Summary

GitLab has been enhancing its Elasticsearch integration to enable Advanced Global Search features for GitLab.com users, addressing challenges like scalability and efficiency in data migration processes. Initially, reindexing large GitLab instances posed significant issues, prompting the development of a more efficient migration strategy inspired by Rails DB migrations. The new approach includes automated, background-run migrations with a focus on minimizing downtime and strain on system resources. This involves utilizing timestamps for migration files, storing migration data within Elasticsearch, and implementing various configuration options such as batch processing and space requirement checks. Since its implementation in the 13.6 release, the framework has improved performance by successfully migrating various document types like issues and merge requests, while also emphasizing the importance of regular updates to align with ongoing enhancements.

Trends Found in this Post

No tracked trend matches for this post yet.