On the Optimization of a Duplicate Document Detection Algorithm Based on SIMD and Document Statistics | IEEE Conference Publication | IEEE Xplore