Home / Companies / Fivetran / Blog / Post Details
Content Deep Dive

4 methods for exporting CSV files from Databricks

Blog post from Fivetran

Post Details
Company
Date Published
Author
Michel Zurkirchen
Word Count
1,312
Language
English
Hacker News Points
-
Summary

The article provides a comprehensive guide on exporting CSV files from Databricks, detailing four distinct methods to achieve this task. It first outlines how to use Databricks Notebook, allowing users to download datasets directly or export them to DBFS for larger datasets, with options to customize CSV formatting and file size. The second method involves using the Databricks command-line interface (CLI) to transfer CSV files from DBFS to other locations, requiring Python and a personal access token for authentication. The third method utilizes JSpark, a Java-based tool, to execute SQL queries and save results as CSV files directly to a local machine. Lastly, the article suggests using external client tools like Visual Studio Code with a Databricks extension or standalone DBFS Explorer for easy file navigation and downloads. It concludes by recommending Fivetran Activations for a more streamlined data synchronization process if the outlined methods seem cumbersome.