Image and Encoded Text Fusion for Multi-Modal Classification | IEEE Conference Publication | IEEE Xplore