DS1 spectrogram: Source Code Clone Detection Using Unsupervised Similarity Measures

Source Code Clone Detection Using Unsupervised Similarity Measures

January 18, 20242401.09885

Authors

Jorge Martinez-Gil

Abstract

Assessing similarity in source code has gained significant attention in recent years due to its importance in software engineering tasks such as clone detection and code search and recommendation. This work presents a comparative analysis of unsupervised similarity measures for identifying source code clone detection.

The goal is to overview the current state-of-the-art techniques, their strengths, and weaknesses. To do that, we compile the existing unsupervised strategies and evaluate their performance on a benchmark dataset to guide software engineers in selecting appropriate methods for their specific use cases.

The source code of this study is available at https://github.com/jorge-martinez-gil/codesim

Resources

Stay in the loop

Every AI paper that matters, free in your inbox daily.

Details

  • © 2026 takara.ai Ltd
  • Content is sourced from third-party publications.