PredictionProbabilitiesHistogramDrift
Compares prediction probability distributions between reference and monitoring datasets.
Purpose
The Prediction Probabilities Histogram Drift test is designed to evaluate changes in the model’s probability predictions over time. By comparing probability distributions between reference and monitoring datasets using histograms, this test helps identify whether the model’s probability assignments have shifted in production. This is crucial for understanding if the model’s risk assessment behavior remains consistent and whether its probability estimates maintain their original distribution patterns.
Test Mechanism
This test proceeds by generating histograms of prediction probabilities for both reference and monitoring datasets. For each class, it analyzes the distribution shape, central tendency, and spread of probabilities. The test computes distribution moments (mean, variance, skewness, kurtosis) and quantifies their drift between datasets. Visual comparison of overlaid histograms provides immediate insight into distribution changes.
Signs of High Risk
- Significant shifts in probability distribution shapes
- Large drifts in distribution moments exceeding threshold
- Appearance of new modes or peaks in monitoring data
- Changes in the spread or concentration of probabilities
- Systematic shifts in probability assignments
- Unexpected changes in distribution characteristics
Strengths
- Provides intuitive visualization of probability changes
- Identifies specific changes in distribution shape
- Enables quantitative assessment of distribution drift
- Supports analysis across multiple classes
- Includes comprehensive moment analysis
- Maintains interpretable probability scale
Limitations
- May be sensitive to binning choices
- Requires sufficient samples for reliable histograms
- Cannot suggest probability recalibration
- Complex interpretation for multiple classes
- May not capture subtle distribution changes
- Limited to univariate probability analysis