Obfuscated code is identifiable by a token-based code clone detection technique

Junaid Akram*, Danish Vasan, Ping Luo

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Recently, developers use obfuscation techniques to make their code difficult to understand or analyse, especially malware developers. In Android applications, if the application is obfuscated, it is hard to retrieve the exact source code after applying reverse engineering techniques on it. In this paper, we propose an approach which is based on clone detection technique and it can detect obfuscated code in Android applications very efficiently. We perform two experiments on different types of datasets including obfuscated and non-obfuscated application's source code. We successfully detected the obfuscated code of two types, including identifier-renaming and string-encryption with a high accuracy of 95%. A comparative study with other state-of-the-art tools proves the efficiency of our proposed approach. Experimental results show that our approach is reliable, efficient and can be implemented at a large-scale level.

Original languageEnglish
Pages (from-to)254-273
Number of pages20
JournalInternational Journal of Information and Computer Security
Volume19
Issue number3-4
DOIs
StatePublished - 2022
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2022 Inderscience Enterprises Ltd.. All rights reserved.

Keywords

  • Android applications
  • code clones
  • code reuse
  • malware detection
  • obfuscation handling
  • software security

ASJC Scopus subject areas

  • Software
  • Safety, Risk, Reliability and Quality
  • Hardware and Architecture
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Obfuscated code is identifiable by a token-based code clone detection technique'. Together they form a unique fingerprint.

Cite this