Combining text and audio-visual features in video indexing | IEEE Conference Publication | IEEE Xplore