How CrowdStrike Trains GenAI Models at Scale Using Distributed Computing

Post Details

Company

Crowdstrike

Date Published

Dec. 2, 2019

Author

ProLong

Word Count

2,953

Company Posts That Month

1,408

Language

English

Hacker News Points

-

Post removed?

No

Source URL

www.crowdstrike.com/en-us/blog/how-crowdstrike-trains-genai-models-at-scale-using-distributed-computing

Summary

CrowdStrike is significantly advancing the training of large language models (LLMs) for cybersecurity applications by leveraging distributed computing and cloud-based infrastructures. As threats evolve with the integration of LLMs in cyber attacks, CrowdStrike has made it a strategic priority to develop custom LLMs tailored for cybersecurity challenges. Utilizing resources such as the Google Cloud Vertex Training Platform, the company efficiently manages the training of these models at scale, employing techniques like data, tensor, and pipeline parallelism to optimize resource use and performance. The company focuses on addressing practical challenges in LLM training, such as data diversity and memory management, by implementing synthetic data augmentation and gradient checkpointing. These efforts are part of a broader initiative to enhance the capabilities of their cybersecurity solutions, ensuring they remain at the forefront of AI-driven threat detection and response. CrowdStrike's ongoing research and infrastructure investments aim to improve the efficiency and scalability of their machine learning models, ultimately strengthening their ability to preemptively counteract sophisticated cyber threats.

Trends Found in this Post

Trend	Post Mentions	Total Month Mentions	Posts	Companies	MoM
LLM	24	240	126	2	+5900%
Real-time	2	1,659	640	46	+203%
AI Agents	1	2,394	1,321	1	-
AI Model Fine-tuning	1	No monthly metrics for this publish month.
Data Pipeline	1	120	59	13	+380%
Observability	1	557	139	11	+117%
Zero Trust	1	1,843	1,331	3	+61333%

Use This Data

Use this post, company, and trend context to find content marketing opportunities, perform competitive analysis, or address product feature gaps via the Plushcap MCP server or the Plushcap API.