Degrees Of Human Intervention For Multimodal Explainability Alignment