Abstract
Recovering function names from stripped binaries is a crucial and time-consuming task for software reverse engineering' particularly in enhancing network reliability, resilience, and security. This paper tackles the challenge of recovering function names in stripped binaries, a fundamental step in reverse engineering. The absence of syntactic information and the possibility of different code producing identical behavior complicate this task. To overcome these challenges, we introduce a novel model, the Bidirectional Encoder Transformer for Assembly Code (BETAC), leveraging a transformer-based architecture known for effectively processing sequential data. BETAC utilizes self-Attention mechanisms and feed-forward networks to discern complex relationships within assembly code for precise function name prediction. We evaluated BETAC against various existing encoder and decoder models in diverse binary datasets, including benign and malicious codes in multiple formats. Our model demonstrated superior performance over previous techniques in certain metrics and showed resilience against code obfuscation.
| Original language | English |
|---|---|
| Title of host publication | 20th International Conference on the Design of Reliable Communication Networks, DRCN 2024 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| ISBN (Electronic) | 9798350348972 |
| DOIs | |
| State | Published - 2024 |
| Externally published | Yes |
| Event | 20th International Conference on the Design of Reliable Communication Networks, DRCN 2024 - Montreal, Canada Duration: 6 May 2024 → 9 May 2024 |
Publication series
| Name | 20th International Conference on the Design of Reliable Communication Networks, DRCN 2024 |
|---|
Conference
| Conference | 20th International Conference on the Design of Reliable Communication Networks, DRCN 2024 |
|---|---|
| Country/Territory | Canada |
| City | Montreal |
| Period | 6/05/24 → 9/05/24 |
Bibliographical note
Publisher Copyright:© 2024 IEEE.
Keywords
- assembly code
- binaries
- CodeBERT
- Reverse engineering automation
- summarization
- Transformers
ASJC Scopus subject areas
- Artificial Intelligence
- Computer Networks and Communications
- Computer Science Applications
- Hardware and Architecture
- Information Systems and Management
- Safety, Risk, Reliability and Quality
- Health Informatics
Fingerprint
Dive into the research topics of 'BETAC: Bidirectional Encoder Transformer for Assembly Code Function Name Recovery'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver