• Home
  • Browse
    • Current Issue
    • By Issue
    • By Author
    • By Subject
    • Author Index
    • Keyword Index
  • Journal Info
    • About Journal
    • Aims and Scope
    • Editorial Board
    • Publication Ethics
    • Peer Review Process
  • Guide for Authors
  • Submit Manuscript
  • Contact Us
 
  • Login
  • Register
Home Articles List Article Information
  • Save Records
  • |
  • Printable Version
  • |
  • Recommend
  • |
  • How to cite Export to
    RIS EndNote BibTeX APA MLA Harvard Vancouver
  • |
  • Share Share
    CiteULike Mendeley Facebook Google LinkedIn Twitter
Journal of Computing and Communication
arrow Articles in Press
arrow Current Issue
Journal Archive
Volume Volume 4 (2025)
Volume Volume 3 (2024)
Volume Volume 2 (2023)
Issue Issue 2
Issue Issue 1
Volume Volume 1 (2022)
Eliwa, E., Essam, S., Ashraf, M., Sayed, A. (2023). Automated Detection Approaches for Source Code Plagiarism in Students' Submissions. Journal of Computing and Communication, 2(2), 8-18. doi: 10.21608/jocc.2023.307054
Essam Eliwa; Shereen Essam; Mohamed Ashraf; Abdelrahman Sayed. "Automated Detection Approaches for Source Code Plagiarism in Students' Submissions". Journal of Computing and Communication, 2, 2, 2023, 8-18. doi: 10.21608/jocc.2023.307054
Eliwa, E., Essam, S., Ashraf, M., Sayed, A. (2023). 'Automated Detection Approaches for Source Code Plagiarism in Students' Submissions', Journal of Computing and Communication, 2(2), pp. 8-18. doi: 10.21608/jocc.2023.307054
Eliwa, E., Essam, S., Ashraf, M., Sayed, A. Automated Detection Approaches for Source Code Plagiarism in Students' Submissions. Journal of Computing and Communication, 2023; 2(2): 8-18. doi: 10.21608/jocc.2023.307054

Automated Detection Approaches for Source Code Plagiarism in Students' Submissions

Article 2, Volume 2, Issue 2, July 2023, Page 8-18  XML PDF (614.16 K)
Document Type: Original Article
DOI: 10.21608/jocc.2023.307054
View on SCiNiTO View on SCiNiTO
Authors
Essam Eliwa email orcid 1; Shereen Essam2; Mohamed Ashraf3; Abdelrahman Sayed3
1Software Engineering, Computer Science, MIU, Egypt
2Computer Science, Misr International University
3Faculty of Computer Science, Misr International University, Cairo, Egypt
Abstract
Code plagiarism is a significant concern in software development, as it compromises the integrity of original work and can lead to ethical and legal consequences. The need for effective plagiarism detection techniques has grown in parallel with the rise in online coding resources and collaborative platforms. The paper analyses existing plagiarism detection tools, comparing their characteristics, functions, and development timelines. Emphasis is placed on essential factors such as additional case detection, direct display of matched pairings, and compatibility with multiple programming languages. By examining these features, educators and software developers can decide which tools best suit their needs.
Additionally, the paper explores various plagiarism detection techniques, including attribute counting, content comparison, string tiling, and parse tree comparison. The advantages and limitations of each method are examined, underscoring the need for continuous improvement and innovation in the field. This paper presents the most widely available plagiarism detection tools that can be seamlessly integrated into learning management systems. In conclusion, the paper highlights critical areas for future research and development in plagiarism detection. These include the integration of plagiarism detection with live learning management systems to streamline the process for educators and students, the enhancement of usability and user experience in plagiarism detection tools to facilitate their adoption, the advancement of detection algorithms to improve accuracy, and the support for multi-language and cross-language comparisons to cater to diverse programming environments.
Keywords
Plagiarism detection; automatic assessment; programming education
References
[1]      Strickroth, Sven. "Plagiarism Detection Approaches for Simple Introductory Programming Assignments." In Proceedings of the Fifth Workshop" Automatische Bewertung von Programmieraufgaben"(ABP 2021), virtual event, October 28-29, 2021. 2021.

[2]      Franca B Allyson et al. "Sherlock N-Overlap: invasive normalisation and overlap coefficient for the similarity analysis between source code". In: IEEE Transactions on Computers 68.5 (2018), pp. 740–751.

[3]      Michael J Wise. "YAP3: Improved detection of similarities in computer program and other texts". In: Proceedings of the twenty-seventh SIGCSE technical symposium on Computer science education. 1996, pp. 130–134.

[4]      Xiao Li and Xiao Jing Zhong. "The source code plagiarism detection using AST". In: 2010 International Symposium on intelligence information processing and trusted computing. IEEE. 2010, pp. 406–408.

[5]      Aleksi Ahtiainen, Sami Surakka, and Mikko Rahikainen. "Plaggie: GNU-licensed source code plagiarism detection engine for Java exercises". In: Proceedings of the 6th Baltic Sea conference on Computing education research: Koli Calling 2006. 2006, pp. 141–142

[6]      EnilPajic ́andVedranLjubovic ́. "Improving plagiarism detection using genetic algorithm". In: 2019 42nd International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO). IEEE. 2019, pp. 571–576.

[7]      Ljubovic, Vedran, and Enil Pajic. "Plagiarism detection in computer programming using feature extraction from ultra-fine-grained repositories." IEEE Access 8 (2020): 96505-96514.

[8]      Ludlow, Barbara L. "Virtual reality: Emerging applications and future directions." Rural Special Education Quarterly 34, no. 3 (2015): 3-10.

[9]      M Jiffriya, MA Jahan, and R Ragel. "Plagiarism detection tools and techniques: A comprehensive survey". In: Journal of Science-FAS-SEUSL 2.02 (2021), pp. 47–64.

[10]    Anala A Pandit and Gaurav Toksha. "Review of plagiarism detection technique in source code". In: International Conference on Intelligent Computing and Smart Communication 2019. Springer. 2020, pp. 393–405.

[11]    Dana Sheahen and David Joyner. "TAPS: A MOSS extension for detecting software plagiarism at scale". In: Proceedings of the Third (2016) ACM Conference on Learning@ Scale. 2016, pp. 285–288.

[12]    Dirson Santos de Campos and Deller James Fer- reira. "Plagiarism detection based on blinded logi- cal test automation results and detection of textual similarity between source codes". In: 2020 IEEE Frontiers in Education Conference (FIE). IEEE. 2020, pp. 1–9.

[13]    Ilana Shay, Nikolaus Baer, and Robert Zeidman. "Measuring whitespace patterns as an indication of plagiarism". In: (2010).

[14]    Thomas Lancaster and Fintan Culwin. "A comparison of source code plagiarism detection engines". In: Computer Science Education 14.2 (2004), pp. 101–112.

[15]    Jurriaan Hage, Peter Rademaker, and Nike` Van Vugt. "A comparison of plagiarism detection tools". In: Utrecht University. Utrecht, The Netherlands 28.1 (2010).

[16]    Mayank Agrawal and Dilip Kumar Sharma. "A state of art on source code plagiarism detection". In: 2016 2nd International Conference 135. on Next Generation Computing Technologies (NGCT). IEEE. 2016, pp. 236–241.

[17]    Marwah Najm Mansoor and Mohammed SH Al- Tamimi. "Computer-based plagiarism detection techniques: A comparative study". In: International Journal of Nonlinear Analysis and Applications 13.1 (2022), pp. 3599–3611.

[18]    Arwin, Christian, and Seyed MM Tahaghoghi. "Plagiarism detection across programming languages." In Proceedings of the 29th Australasian Computer Science Conference-Volume 48, pp. 277-286. 2006.

[19]    Franclinton, Ricardo, Oscar Karnalim, and Mewati Ayub. "A Scalable Code Similarity Detection with Online Architecture and Focused Comparison for Maintaining Academic Integrity in Programming." (2020): 40-52.

[20]    S.K. Robinson J.A. Faidhi. "An empirical approach for detecting program similarity and plagiarism within a university programming environment." In: Comput. Educ. 11(1). 1987, pp. 11–19.

[21]    Edward L Jones. "Metrics based plagiarism monitoring". In: Journal of Computing Sciences in Colleges 16.4 (2001), pp. 253–261.

[22]    Chao Liu et al. "GPLAG: detection of software plagiarism by program dependence graph analysis". In: Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining. 2006, pp. 872–881.

[23]    B. KattaJ J. Yasawi. "Plagiarism detection in programming assignments using deep features". In: 4th Asian Conference on Pattern Recognition. 2017, pp. 7–54.

[24]    B. KattaJ J. Yasawi. "Unsupervised learning based approach for plagiarism detection in programming assignments". In: Machine Learning for Source- code Plagiarism Detection. 2017, pp. 9–71.

[25]    Michal Dˇuracˇ ́ık, Emil Krsˇa ́k, and Patrik Hrku ́t. "Scalable Source Code Plagiarism Detection Using Source Code Vectors Clustering". In: 2018 IEEE 9th International Conference on Software Engineering and Service Science (ICSESS). 2018, pp. 499–502. DOI: 10. 1109 / ICSESS. 2018. 8663708.

[26]    A. Aiken S. Schleimer D. Wilkerson. "Winnowing: local algorithms for document fingerprinting". In: European Conference on Technology Enhanced Learning. ACM. 2003, pp. 76–85.

[27]    M. Basavaraju A. Asadullah. "Design patterns based preprocessing of source code for plagiarism detection". In: 2012 19th Asia-Pacific Software Engineering Conference vol. 2012, pp. 128–13

[28]    Jing Dong, Yajing Zhao, and Yongtao Sun. "A Matrix-Based Approach to Recovering Design Patterns". In: IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans 39.6 (2009), pp. 1271–1282. DOI: 10.1109 / TSMCA.2009.2028012.

[29]    G. Cosma O.M. Mirza M. Joy. "Style analysis for source code plagiarism detection—an analysis of a dataset of student coursework". In: IEEE 17th International Conference on Advanced. 2017, pp. 296–297.

[30]    Prechelt, L., Malpohl, G., & Philippsen, M. (2002). Finding plagiarisms among a set of programs with JPlag. J. Univers. Comput. Sci., 8(11), 1016.

[31]    Michael J Wise. "Detection of Similarities in Student Programs: YAP'ing may be Preferable to Plague'ing". In: Acm Sigcse Bulletin 24.1 (1992), pp. 268–271.

[32]    Georgina Cosma and Mike Joy. "An approach to source-code plagiarism detection and investigation using latent semantic analysis". In: IEEE transactions on computers 61.3 (2011), pp. 379–394.

[33]    Paul Heckel. "A technique for isolating differences between files". In: Communications of theACM 21.4 (1978), pp. 264–268.

[34]    Tapan P Gondaliya, Hiren D Joshi, and Hardik Joshi. "Source Code Plagiarism Detection' SCPDet': A Review". In: International Journal of Computer Applications 105.17 (2014).

[35]    Divya Luke et al. "Software plagiarism detection techniques: A comparative study". In: (2014).

[36]    Richard M Karp and Michael O Rabin. "Efficient randomised pattern-matching algorithms". In: IBM Journal of research and development 31.2 (2011), pp. 249–260.

[37]    Ste ́phane Ducasse, Matthias Rieger, and Serge Demeyer. "A language independent approach for detecting duplicated code". In: Proceedings IEEE International Conference on Software Maintenance-1999 (ICSM'99).' Software Maintenance for Business Change' (Cat. No. 99CB36360). IEEE. 1999, pp. 109–118.

[38]    Jinan AW Faidhi and Stuart K Robinson. "An empirical approach for detecting program similarity and plagiarism within a university programming environment". In: Computers & Education 11.1 (2000), pp. 11–19.

[39]    AS Bin-Habtoor and MA Zaher. "A survey on plagiarism detection systems". In: International Journal of Computer Theory and Engineering 4.2 (2012), p. 185.

[40]    David Gitchell and Nicholas Tran. "Sim: a utility for detecting similarity in computer programs". In: ACM Sigcse Bulletin 31.1 (2008), pp. 266–270.

[41]    Boumediene Belkhouche, Anastasia Nix, and Johnette Hassell. "Plagiarism detection in software designs". In: Proceedings of the 42nd annual Southeast regional conference. 2010, pp. 207–211.

[42]    Maxim Mozgovoy et al. "Fast plagiarism detection system". In: International Symposium on String Processing and Information Retrieval. Springer. 2005, pp. 267–270.

[43]    Gaurav Toksha Anala A. Pandit. "Review of Plagiarism Detection Technique in Source Code". In: International Conference on Intelligent Computing and Smart Communication. 2020, pp. 393– 405.

[44]    Mike Joy and Michael Luck. "Plagiarism in programming assignments". In: IEEE Transactions on education 42.2 (2012), pp. 129–133.

[45]    Toshihiro Kamiya, Shinji Kusumoto, and Katsuro Inoue. "CCFinder: A multilinguistic token-based code clone detection system for large scale source code". In: IEEE transactions on software engineering 28.7 (2006), pp. 654–670.

 

Statistics
Article View: 567
PDF Download: 733
Home | Glossary | News | Aims and Scope | Sitemap
Top Top

Journal Management System. Designed by NotionWave.