Six Types of AI Bias Everyone Should Know
Blog post from Seldon
AI bias in machine learning can manifest in various forms, often mirroring the developmental influences seen in human upbringing. Historical bias arises when models are trained on past data that may contain prejudices, as seen in Amazon's recruitment algorithm, which favored male candidates due to historical gender imbalances. Sample bias occurs when training data doesn't represent real-world diversity, exemplified by speech recognition systems that perform poorly for underrepresented groups. Label bias is introduced through inconsistent data labeling practices, affecting model accuracy. Aggregation bias involves simplifying data in ways that distort reality, leading to skewed outcomes, such as in salary predictions across professions. Confirmation bias reflects the human tendency to favor information that aligns with pre-existing beliefs, which can impair the acceptance of AI-driven insights, particularly in fields like healthcare. Evaluation bias emerges when models are tested in limited contexts, leading to poor generalization, as illustrated by voting prediction models that fail outside their initial testing environment. Understanding these biases is crucial for developing fair and effective machine learning systems.