Compiling a text re-use detection corpus from scientific papers with semi-real cases of plagiarism | IEEE Conference Publication | IEEE Xplore