Abstract—Plagiarism has become a serious problem mainly because of the electronically available documents. An online document retrieval is a weighty part of a modern anti-plagiarism tool. This paper describes an architecture and concepts of a real-world document retrieval system, which is a part of a general anti-plagiarism software. Up to date systems for plagiarism detection are discussed from the source retrieval perspective. The key approaches of source retrieval are compared. The system recommendations stem from design, implementation, and several years of operation experience of a nationwide plagiarism solution at Masaryk University in the Czech Republic. The design can be adapted to many situations. Proper usage of such systems contributes to the gradual improvement of the quality of student theses.
Index Terms—plagiarism detection, plagiarism, source document retrieval, candidate document retrieval, system design, system architecture
Cite: Šimon Suchomel and Michal Brandejs, "Source Retrieval for Plagiarism Detection," Vol. 6, No. 1, pp. 18-26, February, 2015. doi:10.12720/jait.6.1.18-26
Copyright © 2013-2021. Journal of Advances in Information Technology. All Rights Reserved
This work is licensed under the Creative Commons Attribution License (CC BY-NC-ND 4.0)