Ensemble Malware Classifier Considering PE Section Information

Ren TAKEUCHI; Rikima MITSUHASHI; Masakatsu NISHIGAKI; Tetsushi OHKI

doi:10.1587/transfun.2023CIP0024

IEICE TRANSACTIONS on Fundamentals

Ensemble Malware Classifier Considering PE Section Information

Ren TAKEUCHI, Rikima MITSUHASHI, Masakatsu NISHIGAKI, Tetsushi OHKI

Full Text Views

0

Cite this

Summary :

The war between cyber attackers and security analysts is gradually intensifying. Owing to the ease of obtaining and creating support tools, recent malware continues to diversify into variants and new species. This increases the burden on security analysts and hinders quick analysis. Identifying malware families is crucial for efficiently analyzing diversified malware; thus, numerous low-cost, general-purpose, deep-learning-based classification techniques have been proposed in recent years. Among these methods, malware images that represent binary features as images are often used. However, no models or architectures specific to malware classification have been proposed in previous studies. Herein, we conduct a detailed analysis of the behavior and structure of malware and focus on PE sections that capture the unique characteristics of malware. First, we validate the features of each PE section that can distinguish malware families. Then, we identify PE sections that contain adequate features to classify families. Further, we propose an ensemble learning-based classification method that combines features of highly discriminative PE sections to improve classification accuracy. The validation of two datasets confirms that the proposed method improves accuracy over the baseline, thereby emphasizing its importance.

Publication: IEICE TRANSACTIONS on Fundamentals Vol.E107-A No.3 pp.306-318

Publication Date: 2024/03/01

Publicized: 2023/09/19

Online ISSN: 1745-1337

DOI: 10.1587/transfun.2023CIP0024

Type of Manuscript: Special Section PAPER (Special Section on Cryptography and Information Security)

Category

Authors

Ren TAKEUCHI
  Shizuoka University
Rikima MITSUHASHI
  Shizuoka University
Masakatsu NISHIGAKI
  Shizuoka University
Tetsushi OHKI
  Shizuoka University

Keyword

malware family classification, PE section information, convolutional neural network, ensemble learning

Cite this

Copy

Ren TAKEUCHI, Rikima MITSUHASHI, Masakatsu NISHIGAKI, Tetsushi OHKI, "Ensemble Malware Classifier Considering PE Section Information" in IEICE TRANSACTIONS on Fundamentals, vol. E107-A, no. 3, pp. 306-318, March 2024, doi: 10.1587/transfun.2023CIP0024.
Abstract: The war between cyber attackers and security analysts is gradually intensifying. Owing to the ease of obtaining and creating support tools, recent malware continues to diversify into variants and new species. This increases the burden on security analysts and hinders quick analysis. Identifying malware families is crucial for efficiently analyzing diversified malware; thus, numerous low-cost, general-purpose, deep-learning-based classification techniques have been proposed in recent years. Among these methods, malware images that represent binary features as images are often used. However, no models or architectures specific to malware classification have been proposed in previous studies. Herein, we conduct a detailed analysis of the behavior and structure of malware and focus on PE sections that capture the unique characteristics of malware. First, we validate the features of each PE section that can distinguish malware families. Then, we identify PE sections that contain adequate features to classify families. Further, we propose an ensemble learning-based classification method that combines features of highly discriminative PE sections to improve classification accuracy. The validation of two datasets confirms that the proposed method improves accuracy over the baseline, thereby emphasizing its importance.
URL: https://global.ieice.org/en_transactions/fundamentals/10.1587/transfun.2023CIP0024/_p

Copy

@ARTICLE{e107-a_3_306,
author={Ren TAKEUCHI, Rikima MITSUHASHI, Masakatsu NISHIGAKI, Tetsushi OHKI, },
journal={IEICE TRANSACTIONS on Fundamentals},
title={Ensemble Malware Classifier Considering PE Section Information},
year={2024},
volume={E107-A},
number={3},
pages={306-318},
abstract={The war between cyber attackers and security analysts is gradually intensifying. Owing to the ease of obtaining and creating support tools, recent malware continues to diversify into variants and new species. This increases the burden on security analysts and hinders quick analysis. Identifying malware families is crucial for efficiently analyzing diversified malware; thus, numerous low-cost, general-purpose, deep-learning-based classification techniques have been proposed in recent years. Among these methods, malware images that represent binary features as images are often used. However, no models or architectures specific to malware classification have been proposed in previous studies. Herein, we conduct a detailed analysis of the behavior and structure of malware and focus on PE sections that capture the unique characteristics of malware. First, we validate the features of each PE section that can distinguish malware families. Then, we identify PE sections that contain adequate features to classify families. Further, we propose an ensemble learning-based classification method that combines features of highly discriminative PE sections to improve classification accuracy. The validation of two datasets confirms that the proposed method improves accuracy over the baseline, thereby emphasizing its importance.},
keywords={},
doi={10.1587/transfun.2023CIP0024},
ISSN={1745-1337},
month={March},}

Copy

TY - JOUR
TI - Ensemble Malware Classifier Considering PE Section Information
T2 - IEICE TRANSACTIONS on Fundamentals
SP - 306
EP - 318
AU - Ren TAKEUCHI
AU - Rikima MITSUHASHI
AU - Masakatsu NISHIGAKI
AU - Tetsushi OHKI
PY - 2024
DO - 10.1587/transfun.2023CIP0024
JO - IEICE TRANSACTIONS on Fundamentals
SN - 1745-1337
VL - E107-A
IS - 3
JA - IEICE TRANSACTIONS on Fundamentals
Y1 - March 2024
AB - The war between cyber attackers and security analysts is gradually intensifying. Owing to the ease of obtaining and creating support tools, recent malware continues to diversify into variants and new species. This increases the burden on security analysts and hinders quick analysis. Identifying malware families is crucial for efficiently analyzing diversified malware; thus, numerous low-cost, general-purpose, deep-learning-based classification techniques have been proposed in recent years. Among these methods, malware images that represent binary features as images are often used. However, no models or architectures specific to malware classification have been proposed in previous studies. Herein, we conduct a detailed analysis of the behavior and structure of malware and focus on PE sections that capture the unique characteristics of malware. First, we validate the features of each PE section that can distinguish malware families. Then, we identify PE sections that contain adequate features to classify families. Further, we propose an ensemble learning-based classification method that combines features of highly discriminative PE sections to improve classification accuracy. The validation of two datasets confirms that the proposed method improves accuracy over the baseline, thereby emphasizing its importance.
ER -

IEICE TRANSACTIONS on Fundamentals