Fine-tuned CLIP Models are Efficient Video Learners | IEEE Conference Publication | IEEE Xplore