Caption-aided speech detection in videos | IEEE Conference Publication | IEEE Xplore