Thanks, that makes sense. I took a look at the ratio of instances of mixed scanner time points within subjects for both cases and in case B (hanging failure) ~13% of the subjects contributed mixed data while in case A, 25% of the subjects had mixed scanner within-subject, so perhaps that is the difference.