Plagiarism Detection Techniques for Arabic Script Languages: A Literature Review

Abstract = 38 times | PDF = 264 times


Ribwar Ibrahim Soran Saeed Karzan Wakil


Plagiarism is generally defined as literary theft and academic dishonesty. This considered as the serious issue in an academic documents and texts. There are numerous of plagiarism detection techniques have been developed for various natural languages, mainly English. In this paper we investigate and review the plagiarism detection techniques and algorithms which have been developed for Arabic Script Languages (ASL), and providing a literature review of the utilized methods in terms of techniques and outcomes.  The result of this paper will help the researchers who are going to commence their development and extend their researches in ASL like Arabic, Persian, Urdu, and Kurdish.


Plagiarism Detection Techniques, Literature Review, Arabic, Kurdish, Persia, Urdu.


[1], "glatt plagiarism services," 2017.
[2] UKessays, "A Survey Of Plagiarism Detection Methods Information Technology Essay," 2015.
[3] H. A. Maurer, et al., "Plagiarism-a survey," J. UCS, vol. 12, pp. 1050-1084, 2006.
[4] M. E. B. Menai, "Detection of plagiarism in Arabic documents," International journal of information technology and computer science (IJITCS), vol. 4, p. 80, 2012.
[5], "What is Plagiarism," 2015.
[6] A. Jadalla and A. Elnagar, "A plagiarism detection system for Arabic text-based documents," Intelligence and Security Informatics, pp. 145-153, 2012.
[7] C. Lyon, et al., "Plagiarism is easy, but also easy to detect," Plagiary, 2006.
[8] M. Mirdehghan, "Persian, Urdu, and Pashto: A comparative orthographic analysis," Writing Systems Research, vol. 2, pp. 9-23, 2010.
[9] E. Britannica, "Encyclopædia Britannica Online. Encyclopædia Britannica, 2011," Web. Feb, vol. 10, 2011.
[10] H. A. Maurer, et al., "Plagiarism-a survey," 2006.
[11] A. M. E. T. Ali, et al., "Survey of plagiarism detection methods," in Modelling Symposium (AMS), 2011 Fifth Asia, 2011, pp. 39-42.
[12] A. H. Osman, et al., "Survey of text plagiarism detection," Computer Engineering and Applications Journal (ComEngApp), vol. 1, pp. 37-45, 2012.
[13] A. Bin-Habtoor and M. Zaher, "A survey on plagiarism detection systems," International Journal of Computer Theory and Engineering, vol. 4, p. 185, 2012.
[14] T. A. E. Eisa, et al., "Existing plagiarism detection techniques: A systematic mapping of the scholarly literature," Online Information Review, vol. 39, pp. 383-400, 2015.
[15] S. M. Alzahrani and N. Salim, "On the use of fuzzy information retrieval for gauging similarity of arabic documents," in Applications of Digital Information and Web Technologies, 2009. ICADIWT'09. Second International Conference on the, 2009, pp. 539-544.
[16] A. A. Raza, et al., "N-Gram Based Authorship Attribution in Urdu Poetry," in Proceedings of the Conference on Language & Technology, 2009, pp. 88-93.
[17] S. M. Alzahrani, et al., "Work in progress: Developing Arabic plagiarism detection tool for e-learning systems," in Computer Science and Information Technology-Spring Conference, 2009. IACSITSC'09. International Association of, 2009, pp. 105-109.
[18] C. K. Kent and N. Salim, "Features based text similarity detection," arXiv preprint arXiv:1001.3487, 2010.
[19] M. A. Khan, et al., "Copy detection in Urdu language documents using n-grams model," in Computer Networks and Information Technology (ICCNIT), 2011 International Conference on, 2011, pp. 263-266.
[20] M. E. B. Menai and M. Bagais, "APlag: A plagiarism checker for Arabic texts," in Computer Science & Education (ICCSE), 2011 6th International Conference on, 2011, pp. 1379-1383.
[21] I. Bensalem, et al., "Intrinsic plagiarism detection in Arabic text: Preliminary experiments," in II Spanish Conference on Information Retrieval (CERI’12), 2012.
[22] A. Jadalla and A. Elnagar, "A fingerprinting-based plagiarism detection system for Arabic text-based documents," in Computing Technology and Information Management (ICCM), 2012 8th International Conference on, 2012, pp. 477-482.
[23] L. Ramya and R. Venkatalakshmi, "Intelligent plagiarism detection," International Journal of Research in Engineering & Advanced Technology (IJREAT), vol. 1, pp. 171-174, 2013.
[24] S. Ouamour and H. Sayoud, "Authorship attribution of short historical arabic texts based on lexical features," in Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC), 2013 International Conference on, 2013, pp. 144-147.
[25] A. S. Altheneyan and M. E. B. Menai, "Naïve Bayes classifiers for authorship attribution of Arabic texts," Journal of King Saud University-Computer and Information Sciences, vol. 26, pp. 473-484, 2014.
[26] A. F. Otoom, et al., "Towards author identification of Arabic text articles," in Information and Communication Systems (ICICS), 2014 5th International Conference on, 2014, pp. 1-4.
[27] M. Mahmoodi and M. M. Varnamkhasti, "Design a Persian Automated Plagiarism Detector (AMZPPD)," arXiv preprint arXiv:1403.1618, 2014.
[28] K. Khoshnavataher, et al., "Developing monolingual Persian corpus for extrinsic plagiarism detection using artificial obfuscation," Notebook for PAN at CLEF, 2015.
[30] S. Rakian, et al., "A Persian Fuzzy Plagiarism Detection Approach," Journal of Information Systems and Telecommunication (JIST), vol. 3, pp. 182-190, 2015.
[31] H. Ahangarbahan and G. A. Montazer, "A Fuzzy Approach for Ambiguity Reduction in Text Similarity Estimation (Case Study: Persian Web Contents)," Information Systems & Telecommunication, p. 216, 2015.
[32] M. R. Sharifabadi and S. A. Eftekhari, "Mahak Samim: A Corpus of Persian Academic Texts for Evaluating Plagiarism Detection Systems," in FIRE (Working Notes), 2016, pp. 190-192.
[33] E. Gharavi, et al., "A Deep Learning Approach to Persian Plagiarism Detection," in FIRE (Working Notes), 2016, pp. 154-159.
[34] S. Rafieian, "Plagiarism checker for Persian (PCP) texts using hash-based tree representative fingerprinting," Journal of AI and Data Mining, vol. 4, pp. 125-133, 2016.
[35] L. Gillam and A. Vartapetiance, "From English to Persian: Conversion of Text Alignment for Plagiarism Detection," PAN@ FIRE2016 Shared Task on Persian Plagiarism Detection and Text Alignment Corpus Construction. Notebook Papers of FIRE 2016, 2016.
[36] N. Ehsan and A. Shakery, "A Pairwise Document Analysis Approach for Monolingual Plagiarism Detection," in FIRE (Working Notes), 2016, pp. 145-148.
[37] F. Esteki and F. S. Esfahani, "A Plagiarism Detection Approach Based on SVM for Persian Texts," in FIRE (Working Notes), 2016, pp. 149-153.
[38] M. Mansoorizadeh, et al., "Persian Plagiarism Detection Using Sentence Correlations," in FIRE (Working Notes), 2016, pp. 163-166.
[39] M. Momtaz, et al., "Graph-based Approach to Text Alignment for Plagiarism Detection in Persian Documents," in FIRE (Working Notes), 2016, pp. 176-179.
[40] F. Safi-Esfahani, et al., "English-Persian Plagiarism Detection based on a Semantic Approach," Journal of AI and Data Mining, vol. 5, pp. 275-284, 2017.
[41] Y. A. Abdelrahman, et al., "A Method For Arabic Documents Plagiarism Detection," International Journal of Computer Science and Information Security, vol. 15, p. 79, 2017.
[42] S. M. Alzahrani, et al., "Understanding plagiarism linguistic patterns, textual features, and detection methods," IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), vol. 42, pp. 133-149, 2012.