If it were me, I would create the average template brain (average of the two session high-res images) and align the functional EPI to that. Is there a reason you really want to use the individual session high-res images? I would expect the end result to be similar.