PhD Comprehensive Proposal Examination Notice: Towards a Multi-Scale Collaborative Transformer Architecture for Image Captioning (MSCFT)