Content Deep Dive
Advanced Data Analysis in PostgreSQL: Statistical Properties Explored
Blog post from Airbyte
Post Details
Company
Date Published
Author
Arun Nanda
Word Count
2,364
Language
English
Hacker News Points
-
Summary
This article demonstrates how to perform basic statistical analysis using PostgreSQL built-in functions. It covers the mean, variance, standard deviation, coefficient of variation, outliers, covariance, correlation, and regression. The advantages of running these analyses within the database include fewer IT systems to manage and maintain, avoiding passing data back and forth between different systems, and leveraging a mature RDBMS for enforcing data integrity and consistency. The example dataset used in this article is based on cancer statistics.