Home / Companies / WhyLabs / Blog / Post Details
Content Deep Dive

Data Logging with whylogs

Blog post from WhyLabs

Post Details
Company
Date Published
Author
WhyLabs Admin
Word Count
1,659
Company Posts That Month
2
Language
English
Hacker News Points
1
Summary

whylogs is an open source tool for data logging that enables users to detect data drift, prevent ML model performance degradation, and validate the quality of their data. The v1 release brings a simpler API, new data constraints, new profile visualizations, faster performance, and a usability refresh. With whylogs, users can generate statistical summaries (termed whylogs profiles) from data as it flows through their data pipelines and into their machine learning models. These profiles enable users to track changes in their data over time, detecting data drift or data quality problems. The tool supports both tabular and complex data and runs natively in Python and JVM environments. It also supports batch processing (e.g., Apache Spark) and streaming (e.g., Apache Kafka). whylogs v1 is built for scale and optimized for massive data sets, with a more than 500x improvement in the speed of generating profiles for large datasets compared to the previous version.

Trends Found in this Post
Trend Post Mentions Total Month Mentions Posts Companies MoM
LLM 12 85 22 10 +130%
Observability 5 954 176 57 +31%
AI Guardrails 4 No monthly metrics for this publish month.
Data Pipeline 2 410 80 33 +22%
RAG 2 9 7 2 +13%
Real-time 1 1,102 330 114 -6%