Next Article in Journal
Residual Energy-Based Computation Efficiency Maximization in Dense Edge Computing Systems
Next Article in Special Issue
One-Dimensional Convolutional Wasserstein Generative Adversarial Network Based Intrusion Detection Method for Industrial Control Systems
Previous Article in Journal
Comparison of Different Methods for Building Ensembles of Convolutional Neural Networks
Previous Article in Special Issue
Multimodel Collaboration to Combat Malicious Domain Fluxing
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Streamlined Framework of Metamorphic Malware Classification via Sampling and Parallel Processing

1
School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China
2
School of Space Information, Space Engineering University, Beijing 101416, China
*
Author to whom correspondence should be addressed.
Electronics 2023, 12(21), 4427; https://doi.org/10.3390/electronics12214427
Submission received: 15 September 2023 / Revised: 15 October 2023 / Accepted: 24 October 2023 / Published: 27 October 2023
(This article belongs to the Special Issue AI-Driven Network Security and Privacy)

Abstract

Nowadays, malware remains a significant threat to the current cyberspace. More seriously, malware authors frequently use metamorphic techniques to create numerous variants, which throws malware researchers a heavy burden. Being able to classify these metamorphic malware samples into their corresponding families could accelerate the malware analysis task efficiently. Based on our comprehensive analysis, these variants are usually implemented by making changes to their assembly instruction sequences to a certain extent. Motivated by this finding, we present a streamlined and efficient framework of malware family classification named MalSEF, which leverages sampling and parallel processing to efficiently and effectively classify the vast number of metamorphic malware variants. At first, it attenuates the complexity of feature engineering by extracting a small portion of representative samples from the entire dataset and establishing a simple feature vector based on the Opcode sequences; then, it generates the feature matrix and conducts the classification task in parallel with collaboration utilizing multiple cores and a proactive recommendation scheme. At last, its practicality is strengthened to cope with the large volume of diversified malware variants based on common computing platforms. Our comprehensive experiments conducted on the Kaggle malware dataset demonstrate that MalSEF achieves a classification accuracy of up to 98.53% and reduces time overhead by 37.60% compared to the serial processing procedure.
Keywords: malware classification; malware family; parallel processing; microsoft kaggle malware dataset malware classification; malware family; parallel processing; microsoft kaggle malware dataset

Share and Cite

MDPI and ACS Style

Lyu, J.; Xue, J.; Han, W.; Zhang, Q.; Zhu, Y. A Streamlined Framework of Metamorphic Malware Classification via Sampling and Parallel Processing. Electronics 2023, 12, 4427. https://doi.org/10.3390/electronics12214427

AMA Style

Lyu J, Xue J, Han W, Zhang Q, Zhu Y. A Streamlined Framework of Metamorphic Malware Classification via Sampling and Parallel Processing. Electronics. 2023; 12(21):4427. https://doi.org/10.3390/electronics12214427

Chicago/Turabian Style

Lyu, Jian, Jingfeng Xue, Weijie Han, Qian Zhang, and Yufen Zhu. 2023. "A Streamlined Framework of Metamorphic Malware Classification via Sampling and Parallel Processing" Electronics 12, no. 21: 4427. https://doi.org/10.3390/electronics12214427

APA Style

Lyu, J., Xue, J., Han, W., Zhang, Q., & Zhu, Y. (2023). A Streamlined Framework of Metamorphic Malware Classification via Sampling and Parallel Processing. Electronics, 12(21), 4427. https://doi.org/10.3390/electronics12214427

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop