Shopify cuts execution time from hours to seconds with Iceberg and Trino
Blog post from Starburst
Shopify significantly improved its data processing efficiency by migrating from Hive and JSON table formats to Apache Iceberg and Parquet, using Trino as the compute engine. This transition, prompted by data silos and interoperability challenges, led to execution time reductions from hours to mere minutes, greatly enhancing the productivity of data analysts and scientists. The migration involved rewriting vast amounts of data and overcoming technical challenges with the help of the Trino community, showcasing the benefits of modern table formats and open-source collaboration. The shift resulted in execution speeds that were 1000 times faster, underscoring the importance of adopting efficient data storage and processing frameworks to drive business insights and performance.