How FiftyOne's Model Evaluation Helped Me Reduce False Positives by 70.9% Without Retraining
Blog post from Voxel51
An ML Engineer shares insights on improving model performance without retraining by leveraging sample-level debugging and the right tools, using the example of a YOLO11 model applied to Ring camera footage. By employing FiftyOne's Model Evaluation tools, the engineer identified an issue with high false positives that were due to Non-Maximum Suppression (NMS) failures, which were resolved by adjusting post-processing parameters rather than retraining the model. This adjustment led to a 16.5% improvement in precision and a 70.9% reduction in false positives. The engineer emphasizes the importance of analyzing data at the sample level to gain actionable insights and highlights the often underappreciated impact of tuning inference-time parameters such as the NMS IoU threshold. The process underscores how sample-level debugging can transform vague aggregate metrics into specific, actionable solutions, ultimately enhancing real-world model performance efficiently.