Home / Companies / Zapier / Blog / Post Details
Content Deep Dive

Machine learning made easier with datto package

Blog post from Zapier

Post Details
Company
Date Published
Author
Kristie Wirth
Word Count
1,007
Language
English
Hacker News Points
-
Summary

Zapier is actively exploring various machine learning algorithms to enhance tasks such as recommending Zaps, automating support ticket responses, and classifying spam, acknowledging the "No Free Lunch" theorem, which suggests no single approach suits all machine learning problems. In response to the challenges of repetitive tasks in machine learning, the author created a personal Python package named "datto" (DATa TOols) to streamline processes like data cleaning, feature generation, and model tweaks. This package includes methods for exploratory data analysis (EDA), data preparation, and model training, with functionalities like identifying unique values, excluding certain columns, and removing correlated features. Additionally, datto aids in data type correction, duplicate column removal, and data frame compression, while integrating with other tools such as Altair for visualizations, Featuretools for feature generation, and snorkel for labeling. It also automates model selection using grid searches and provides insights into model performance and biases. The author encourages feedback and contributions to the datto repository for further enhancements.