Continuously Monitor the Behaviour of Deployed Models
33 / 57 •
Deployment •
This practice was ranked as medium.
Click to read more. •
This practice helps to increase
the team's agility.
Click to read more. •
This practice helps to increase
the traceability of ML components.
Click to read more.
Click to read more. •
the team's agility.
Click to read more. •
the traceability of ML components.
Click to read more.
Intent
Avoid unintended behaviour in production models.
Motivation
Once a model is promoted to production, the team has to understand how it performs.
Applicability
Monitoring should be implemented in any production-level ML application.
Description
Monitoring plays an important role in production level machine learning. Because the performance between training and production data can vary drastically, it is important to continuously monitor the behaviour of deployed models and raise alerts when unintended behaviour is observed.
The monitoring pipeline should include:
- performance, quality and skew metrics,
- fairness metrics,
- model interpretability outputs (e.g. LIME),
- metrics for the perceived effect of the model, e.g. user interactions, conversion rates, etc.
Monitor the Input Data Pipeline
Model behaviour reflects the quality of the data it receives. Monitor the data pipeline itself, not just the model output:
- track feature distributions at ingestion time and alert on drift relative to the training baseline,
- detect missing, delayed, or malformed data feeds before they silently degrade model performance,
- measure data freshness: flag when upstream data sources stop updating within expected windows,
- log schema violations and type mismatches on incoming records.
Data-pipeline monitoring should trigger the same alert and incident workflows as model performance degradation.
Adoption
Related
- Perform Checks to Detect Skew between Models
- Enable Automatic Roll Backs for Production Models
- Continuously Measure Model Quality and Performance
- Build an ML Observability Infrastructure
- Automate Feedback Loops Between Production Monitoring and Training Pipelines
Read more
- Continuous Delivery for Machine Learning
- Machine Learning Logistics
- Machine Learning Operations: A Mapping Study
- Machine learning: Moving from experiments to production
- Testing and Debugging in Machine Learning
- TFX: A tensorflow-based Production-Scale ML Platform
33 / 57 •
Deployment •
This practice was ranked as medium.
Click to read more. •
This practice helps to increase
the team's agility.
Click to read more. •
This practice helps to increase
the traceability of ML components.
Click to read more.
Click to read more. •
the team's agility.
Click to read more. •
the traceability of ML components.
Click to read more.