Home / Companies / GitHub / Blog / Post Details
Content Deep Dive

The State of the Octoverse: machine learning

Blog post from GitHub

Post Details
Company
Date Published
Author
Thomas Elliott
Word Count
566
Language
English
Hacker News Points
-
Summary

In 2018, machine learning and data science were prominent topics on GitHub, with significant contributions to projects like TensorFlow, which had the highest number of contributors, and PyTorch, noted for its rapid growth. Python emerged as the dominant programming language in machine learning repositories, though other languages like C++, JavaScript, and R were also commonly used. The analysis focused on contributions made throughout the year, revealing that popular Python packages such as NumPy, SciPy, and pandas were widely imported in machine learning and data science projects. TensorFlow was notably used in nearly a quarter of these projects, while other packages like Scikit-learn and Matplotlib were also frequently utilized. Additionally, several projects centered on natural language processing and image processing, underscoring the diverse applications of machine learning on the platform.