CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification | IEEE Conference Publication | IEEE Xplore